Sentiment Analysis Using Machine Learning Methods on News Texts
The categorization of intricate emotional phenomena in news texts permits the assessment of the impact of such news on society. In this context, it is evident that the existing body of research on emotion analysis in Turkish texts is insufficient. The objective of this study is to identify the emotions of "fear," "happiness," "anger," "sadness," and "surprise" in Turkish news texts using machine learning methods. In consideration of the Turkish language structure, the adjectives and nouns were parsed on a specially prepared corpus and dictionary groups specific to this study were created with the use of the TF-IDF and Double Normalization methods. Subsequently, these lexicon groups were employed for the classification of sentiment in news texts using a range of machine learning techniques, including Multilayer Perceptron (MLP), Support Vector Machines (SVM), Random Forest (RF), LightGBM, XGBoost, GPT-2, BERT, DistilBERT, and XLM ROBERTa. The evaluation metrics employed were F1-score, accuracy, recall, and precision. In the training phase of the model, the dictionary group created with Double Normalization, the GPT-2 model, the Grid Search optimization method, and cross-validation were employed in order to achieve the highest F1-score of 0.99 and accuracy of 0.99. It is anticipated that the methods and dataset utilized in this study will contribute to the existing literature on sentiment classification in Turkish texts and provide insights that will inform future similar studies.
Kazan, H. (2021). Dijital habercilik bağlamında trajik haberlerde etik ve bireyler üzerindeki etkisi üzerine bir araştırma. International Journal of Social Inquiry, 14(2), 661-692.
Birjali, M., Kasri, M., & Beni-Hssane, A. (2021). A comprehensive survey on sentiment analysis: Approaches, challenges and trends. Knowledge-Based Systems, 226, 107134.
Mäntylä, M. V., Graziotin, D., & Kuutila, M. (2018). The evolution of sentiment analysis—A review of research topics, venues, and top cited papers. Computer Science Review, 27, 16-32.
Acheampong, F. A., Wenyu, C., & Nunoo‐Mensah, H. (2020). Text‐based emotion detection: Advances, challenges, and opportunities. Engineering Reports, 2(7), e12189.
Arazy, O., & Woo, C. (2007). Enhancing information retrieval through statistical natural language processign: a study of collocation indexing. Mis Quarterly, 525-546.
Türkmenoglu, C., & Tantug, A. C. (2014, June). Sentiment analysis in Turkish media. In International Conference on Machine Learning (ICML).
Akba, F., Uçan, A., Sezer, E. A., & Sever, H. (2014, July). Assessment of feature selection metrics for sentiment analyses: Turkish movie reviews. In 8th European Conference on Data Mining (Vol. 191, No. 2002, pp. 180-184).
Velioğlu, R., Yıldız, T., & Yıldırım, S. (2018, September). Sentiment analysis using learning approaches over emojis for Turkish tweets. In 2018 3rd International Conference on Computer Science and Engineering (UBMK) (pp. 303-307). IEEE.
Shehu, H. A., & Tokat, S. (2020). A hybrid approach for the sentiment analysis of Turkish Twitter data. In Artificial Intelligence and Applied Mathematics in Engineering Problems: Proceedings of the International Conference on Artificial Intelligence and Applied Mathematics in Engineering (ICAIAME 2019) (pp. 182-190). Springer International Publishing.
Onan, A. (2017). Twitter mesajlari üzerinde makine öğrenmesi yöntemlerine dayali duygu analizi. Yönetim Bilişim Sistemleri Dergisi, 3(2), 1-14.
Demirci, G. M., Keskin, Ş. R., & Doğan, G. (2019, December). Sentiment analysis in Turkish with deep learning. In 2019 IEEE international conference on big data (big data) (pp. 2215-2221). IEEE.
Asghar, M. Z., Subhan, F., Imran, M., Kundi, F. M., Shamshirband, S., Mosavi, A., ... & Varkonyi-Koczy, A. R. (2019). Performance evaluation of supervised machine learning techniques for efficient detection of emotions from online content. arXiv preprint arXiv:1908.01587.
Nasir, A. F. A., Nee, E. S., Choong, C. S., Ghani, A. S. A., Majeed, A. P. A., Adam, A., & Furqan, M. (2020, February). Text-based emotion prediction system using machine learning approach. In IOP Conference Series: Materials Science and Engineering (Vol. 769, No. 1, p. 012022). IOP Publishing.
Hasan, M., Rundensteiner, E., & Agu, E. (2019). Automatic emotion detection in text streams by analyzing twitter data. International Journal of Data Science and Analytics, 7, 35-51.
Güven, Z. A., Diri, B., & Çakaloğlu, T. (2018, April). Classification of TurkishTweet emotions by n-stage Latent Dirichlet Allocation. In 2018 Electric Electronics, Computer Science, Biomedical Engineerings' Meeting (EBBT) (pp. 1-4). IEEE.
Balli, C., Guzel, M. S., Bostanci, E., & Mishra, A. (2022). Sentimental analysis of Twitter users from Turkish content with natural language processing. Computational Intelligence and Neuroscience, 2022(1), 2455160.
Boynukalin, Z., & Karagoz, P. (2013, September). Emotion analysis on Turkish texts. In Information Sciences and Systems 2013: Proceedings of the 28th International Symposium on Computer and Information Sciences (pp. 159-168). Cham: Springer International Publishing.
Demirci, S. (2015). Emotion analysis on Turkish tweets (Master's thesis, Middle East Technical University).
Tocoglu, M. A., & Alpkocak, A. (2018). TREMO: A dataset for emotion analysis in Turkish. Journal of Information Science, 44(6), 848-860.
Toçoğlu, M. A., & Alpkocak, A. (2019). Lexicon-based emotion analysis in Turkish. Turkish Journal of Electrical Engineering and Computer Sciences, 27(2), 1213-1227.
Tocoglu, M. A., Ozturkmenoglu, O., & Alpkocak, A. (2019). Emotion analysis from Turkish tweets using deep neural networks. IEEE Access, 7, 183061-183069.
López-Chau, A., Valle-Cruz, D., & Sandoval-Almazán, R. (2020). Sentiment analysis of Twitter data through machine learning techniques. Software engineering in the era of cloud computing, 185-209.
Nasir, A. F. A., Nee, E. S., Choong, C. S., Ghani, A. S. A., Majeed, A. P. A., Adam, A., & Furqan, M. (2020, February). Text-based emotion prediction system using machine learning approach. In IOP Conference Series: Materials Science and Engineering (Vol. 769, No. 1, p. 012022). IOP Publishing.
Yohanes, D., Putra, J. S., Filbert, K., Suryaningrum, K. M., & Saputri, H. A. (2023). Emotion Detection in Textual Data using Deep Learning. Procedia Computer Science, 227, 464-473.
Sam Abraham, S., VL, L., & P Gangan, M. (2022). Readers’ affect: Predicting and understanding readers’ emotions with deep learning. Journal of Big Data, 9(1), 1-31.
Komprehend, Text Analysis APIs, 2020. [Çevrimiçi]. Available: https://komprehend.io/#emotion.
Devlin, J. (2018). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805.
Cortes, C. (1995). Support-Vector Networks. Machine Learning.
Murphy, K. P. (2006). Naive bayes classifiers. University of British Columbia, 18(60), 1-8.
Breiman, L. (2001). Random forests. Machine learning, 45, 5-32.
Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., & Sutskever, I. (2019). Language models are unsupervised multitask learners. OpenAI blog, 1(8), 9.
GuolinKe, Q. M., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., & Liu, T. Y. (2017). Lightgbm: A highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst, 30, 52.
Copyright (c) 2024 International Journal of Engineering and Computer Science

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.