News

This is implementation of TF-IDF algorithm in Scala and Spark. Input - folder with text files. Each row of each file contains id and a text of a document separeted by "\t". Output - folder with text ...