Leveraging Traditional Machine Learning and TF-IDF for Robust Fake News Detection

Qabas A Hameed
International Journal of Computational and Electronic Aspects in Engineering
Volume 6: Issue 3, July 2025, pp 143-152


Author's Information
Qabas A Hameed1 
Corresponding Author
1Department of Computer Science, College of Computer Science and Mathematics, Tikrit University, Iraq
qabas.a.hameed@tu.edu.iq

Research Paper -- Peer Review
First online on – 12 July 2025

Open Access article under Creative Commons License

Cite this article –Qabas A Hameedl, “Leveraging Traditional Machine Learning and TF-IDF for Robust Fake News Detection”, International Journal of Computational and Electronic Aspects in Engineering, RAME Publishers, Volume 6, Issue 3, pp. 143-152, 2025.
https://doi.org/10.26706/ijceae.6.3.20250605


Abstract:-
Social media platforms have created space for fabricated news to circulate broadly that currently has become a leading problem distorts public opinion and disrupts political discourse and credibility of online information. The research analyzes how traditional machine learning (ML) models detect fake news from the identification of text content with engineered language features. The experiment proves that traditional methods provide competitive performance with less computational power and maintaining greater interpretability if augmented by strong preprocessing and feature extraction methods. A comprehensive preprocessing pipeline consisted of text normalization followed by stop word removal stemming and n-gram modeling and TF-IDF vectorization to transform raw text to numerical features. The performance was assessed by stratified cross-validation and held-out test set on three supervised ML algorithms, i.e., Logistic Regression (LR), Support Vector Machine (SVM), and Random Forest (RF). The performance assessment included accuracy along with precision and recall and F1-score and ROC-AUC. The results showed that Logistic Regression provided maximum accuracy (98.1%) and F1-score but Random Forest provided maximum recall rate (98.5%) and therefore it was better at detecting actual fake news. The SVM model provided well-balanced outcomes but it was computationally expensive. Traditional ML models show high effectiveness in detecting fake news with proper feature engineering techniques.
Index Terms:-
Traditional Machine Learning; TF-IDF; Social media; Support Vector Machine (SVM).
REFERENCES
  1. H. Allcott and M. Gentzkow, "Social Media and Fake News in the 2016 Election %J Journal of Economic Perspectives," vol. 31, no. 2, pp. 211–36, 2017.

  2. D. S. J. I. J. o. C. Cherian and E. A. i. Engineering, "Image Caption Generator Using CNN and LSTM," vol. 3, no. 2, 2022.

  3. S. Vosoughi, D. Roy, and S. J. s. Aral, "The spread of true and false news online," vol. 359, no. 6380, pp. 1146-1151, 2018.

  4. N. Grinberg, K. Joseph, L. Friedland, B. Swire-Thompson, and D. J. S. Lazer, "Fake news on Twitter during the 2016 US presidential election," vol. 363, no. 6425, pp. 374-378, 2019.

  5. A. Naaman, "Using Machine Learning Models to Evaluate the Performance of Website in Iraqi Universities," International Journal of Computational and Electronic Aspects in Engineering, vol. 3, 12/09 2022.

  6. D. M. Lazer et al., "The science of fake news," vol. 359, no. 6380, pp. 1094-1096, 2018.

  7. G. Pennycook, J. McPhetres, Y. Zhang, J. G. Lu, and D. G. J. P. s. Rand, "Fighting COVID-19 misinformation on social media: Experimental evidence for a scalable accuracy-nudge intervention," vol. 31, no. 7, pp. 770-780, 2020.

  8. K. Shu, A. Sliva, S. Wang, J. Tang, and H. J. A. S. e. n. Liu, "Fake news detection on social media: A data mining perspective," vol. 19, no. 1, pp. 22-36, 2017.

  9. X. Zhou and R. J. A. C. S. Zafarani, "A survey of fake news: Fundamental theories, detection methods, and opportunities," vol. 53, no. 5, pp. 1-40, 2020.

  10. Hiyam Hatem, "Improved Deep Learning Models for Plants Diseases Detection for Smart Farming," International Journal of Computational and Electronic Aspects in Engineering, vol. 6, no. 1, 2025.

  11. N. K. Conroy, V. L. Rubin, Y. J. P. o. t. a. f. i. s. Chen, and technology, "Automatic deception detection: Methods for finding fake news," vol. 52, no. 1, pp. 1-4, 2015.

  12. V. L. Rubin, N. Conroy, Y. Chen, and S. Cornwell, "Fake news or truth? using satirical cues to detect potentially misleading news," in Proceedings of the second workshop on computational approaches to deception detection, 2016, pp. 7-17.

  13. M. Burzo, M. Abouelenien, V. Perez-Rosas, and R. Mihalcea, "Multimodal deception detection," in The Handbook of Multimodal-Multisensor Interfaces: Signal Processing, Architectures, and Detection of Emotion and Cognition-Volume 2, 2018, pp. 419-453.

  14. N. Ruchansky, S. Seo, and Y. Liu, "Csi: A hybrid deep model for fake news detection," in Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, 2017, pp. 797-806.

  15. W. J. I. J. o. C. Razzaq and E. A. i. Engineering, "Categorization of Carcinogenic Abnormalities in Digital Mastography Using Deep Learning Algorithms," vol. 4, no. 4, 2023.

  16. M. Sudhakar and K. P. Kaliyamurthie, "Detection of fake news from social media using support vector machine learning algorithms," Measurement: Sensors, vol. 32, p. 101028, 2024/04/01/ 2024.

  17. D. A. Broniatowski et al., "Weaponized health communication: Twitter bots and Russian trolls amplify the vaccine debate," vol. 108, no. 10, pp. 1378-1384, 2018.

  18. M. Potthast, J. Kiesel, K. Reinartz, J. Bevendorff, and B. J. a. p. a. Stein, "A stylometric inquiry into hyperpartisan and fake news," 2017.

  19. A. Bovet and H. A. J. N. c. Makse, "Influence of fake news in Twitter during the 2016 US presidential election," vol. 10, no. 1, p. 7, 2019.

  20. C. Shao, G. L. Ciampaglia, O. Varol, K.-C. Yang, A. Flammini, and F. J. N. c. Menczer, "The spread of low-credibility content by social bots," vol. 9, no. 1, p. 4787, 2018.

  21. K. Shu, D. Mahudeswaran, S. Wang, D. Lee, and H. J. B. d. Liu, "Fakenewsnet: A data repository with news content, social context, and spatiotemporal information for studying fake news on social media," vol. 8, no. 3, pp. 171-188, 2020.

  22. K. Xu, F. Wang, H. Wang, and B. Yang, "Detecting fake news over online social media via domain reputations and content understanding," Tsinghua Science and Technology, vol. 25, pp. 20-27, 02/01 2020.

  23. K. Sharma et al., "Combating fake news: A survey on identification and mitigation techniques," vol. 10, no. 3, pp. 1-42, 2019.

  24. R. Baly et al., "What was written vs. who read it: News media profiling using text analysis and social media context," 2020.

  25. X. Qiu, T. Sun, Y. Xu, Y. Shao, N. Dai, and X. J. S. C. t. s. Huang, "Pre-trained models for natural language processing: A survey," vol. 63, no. 10, pp. 1872-1897, 2020.

  26. A. A. Gillespie and S. Magor, "Tackling online fraud," in ERA Forum, 2020, vol. 20, pp. 439-454: Springer.

  27. N. Diakopoulos, D. J. N. m. Johnson, and society, "Anticipating and addressing the ethical implications of deepfakes in the context of elections," vol. 23, no. 7, pp. 2072-2098, 2021.

  28. A. Marwick, R. J. N. Y. D. Lewis, and S. R. Institute, "Media manipulation and disinformation online," vol. 359, pp. 1146-1151, 2017.

  29. R. J. P. Binns and technology, "Algorithmic accountability and public reason," vol. 31, no. 4, pp. 543-556, 2018.

  30. R. K. Kaliyar, A. Goswami, P. Narang, and S. J. C. S. R. Sinha, "FNDNet–a deep convolutional neural network for fake news detection," vol. 61, pp. 32-44, 2020.

  31. D. Li, H. Guo, Z. Wang, and Z. J. I. a. Zheng, "Unsupervised fake news detection based on autoencoder," vol. 9, pp. 29356-29365, 2021.

  32. F. A. Ozbay, B. J. P. A. s. m. Alatas, and i. applications, "Fake news detection within online social media using supervised artificial intelligence algorithms," vol. 540, p. 123174, 2020.

  33. Mustafa K. Alasadi, "Method for Mining the Opinion Leaders in Social Networks," International Journal of Computational and Electronic Aspects in Engineering, vol. 5, no. 4, 2024.

  34. N. Cavus, M. Göksu, and B. Oktekin, "Real-time fake news detection in online social networks: FANDC Cloud-based system," Scientific Reports, vol. 14, 10/29 2024.

  35. S. Gilda, "Notice of Violation of IEEE Publication Principles: Evaluating machine learning algorithms for fake news detection," in 2017 IEEE 15th Student Conference on Research and Development (SCOReD), 2017, pp. 110-115.

  36. A. Bhardwaj, S. Bharany, and S. J. H. Kim, "Fake social media news and distorted campaign detection framework using sentiment analysis & machine learning," vol. 10, 2024.

  37. J. Jouhar, A. Pratap, N. Tijo, and M. Mony, "Fake News Detection using Python and Machine Learning," Procedia Computer Science, vol. 233, pp. 763-771, 2024/01/01/ 2024.

  38. A. Jain, A. Shakya, H. Khatter, and A. Gupta, A smart System for Fake News Detection Using Machine Learning. 2019, pp. 1-4.

  39. To view full paper, Download here


Publishing with