[[1] G. Erin. Processing time of TFIDF and Naive Bayes on Spark 2.0, Hadoop 2.6 and Hadoop 2.7: Which Tool Is More Efficient?, Msc Thesis, National College of Ireland Dublin, 2016. ⇒52]Search in Google Scholar
[[2] K. Rattanaopas, S. Kaewkeeree. Improving Hadoop MapReduce performance with data compression: A study using wordcount job, 2017 14th International Conference on Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTICON). IEEE, 2017. p. 564-567 ⇒5210.1109/ECTICon.2017.8096300]Search in Google Scholar
[[3] KM. Lee, CS. Han, KI. Kim, SH. Lee, Word recommendation for English composition using big corpus data processing, Cluster Computing, (2019), 1911-1924. ⇒56, 65]Search in Google Scholar
[[4] M. Kontagora, H. Gonzalez-Velez, Benchmarking a MapReduce Environment on a Full Virtualisation Platform, The 4th International Conference on Complex, Intelligent and Software Intensive Systems, 433-438. 10.1109/CISIS.2010.45. ⇒62]Search in Google Scholar
[[5] M. Bartík, S. Ulbik, P. Kubalik Matěj. LZ4 compression algorithm on FPGA, 2015 IEEE International Conference on Electronics, Circuits, and Systems (ICECS). IEEE, 2015 ⇒6310.1109/ICECS.2015.7440278]Search in Google Scholar
[[6] RY Rubinstein, DP. Kroese, Simulation and the Monte Carlo method. Vol. 10. John Wiley & Sons, 2016. ⇒6310.1002/9781118631980]Search in Google Scholar
[[7] R Lenhardt,J Alakuijala, Gipfeli-high speed compression algorithm. 2012 Data Compression Conference (pp. 109-118). IEEE ⇒6210.1109/DCC.2012.19]Search in Google Scholar
[[8] H. Karloff, S. Suri, S. Vassilvitskii, A model of computation for MapReduce. Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms. Society for Industrial and Applied Mathematics, 2010. ⇒5310.1137/1.9781611973075.76]Search in Google Scholar
[[9] Apache Hadoop, Apache, https://hadoop.apache.org/ ⇒52]Search in Google Scholar
[[10] Apache Spark, Apache, https://spark.apache.org/ ⇒52, 55]Search in Google Scholar
[[11] E. Brill, A simple rule-based part of speech tagger, Proceedings of the third conference on Applied natural language processing. Association for Computational Linguistics, 1992. ⇒5210.3115/974499.974526]Search in Google Scholar
[[12] Apache Yarn, Apache, https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/YARN.html ⇒53]Search in Google Scholar
[[13] Apache HDFS docs, https://hadoop.apache.org/docs/r1.2.1/ ⇒53]Search in Google Scholar
[[14] Hadoop Native Library, https://hadoop.apache.org/docs/current/hadoop-project-dist/hadoop-common/NativeLibraries.html ⇒61]Search in Google Scholar
[[15] Project repository, https://gitlab.com/thelfter/word-prediction ⇒64]Search in Google Scholar
[[16] Spark Sql, https://spark.apache.org/docs/latest/sql-programming-guide.html ⇒55]Search in Google Scholar
[[17] Stanford part-of-speecg tagger, https://nlp.stanford.edu/software/tagger.html ⇒57]Search in Google Scholar
[[18] Wikipedia dumps, https://dumps.wikimedia.org/ ⇒63]Search in Google Scholar