Sentiment Analysis Framework for Telugu Text Based on Novel Contrived Passive Aggressive with Fuzzy Weighting Classifier (CPSC-FWC)

G. Janardana Naidu; Джанардана Найду Д.; M. Seshashayee; Сешашаяи M.

doi:10.15622/ia.23.1.2

Sentiment Analysis Framework for Telugu Text Based on Novel Contrived Passive Aggressive with Fuzzy Weighting Classifier (CPSC-FWC)

Autores: Janardana Naidu G.¹, Seshashayee M.¹
Afiliações:
1. Gandhi Institute of Technology and Management GITAM (Deemed to be University)
Edição: Volume 23, Nº 1 (2024)
Páginas: 39-64
Seção: Artificial intelligence, knowledge and data engineering
URL: https://journal-vniispk.ru/2713-3192/article/view/267187
DOI: https://doi.org/10.15622/ia.23.1.2
ID: 267187

Citar

Texto integral

Resumo
Sobre autores
Bibliografia
Arquivos suplementares
Estatísticas

Resumo

Natural language processing (NLP) is a subset of artificial intelligence demonstrating how algorithms can interact with individuals in their unique languages. In addition, sentiment analysis in NLP is better in numerous programs, including evaluating sentiment in Telugu. Several unsupervised machine-learning algorithms, such as k-means clustering with cuckoo search, are used to detect Telugu text. However, these techniques struggle to cluster data with variable cluster sizes and densities, slow search speeds, and poor convergence accuracy. This study developed a unique ML-based sentiment analysis system for Telugu text to address the shortcomings. Initially, in the pre-processing stage, the proposed Linear Pursuit Algorithm (LPA) removes words in white spaces, punctuation, and stops. Then, for POS tagging, this research proposed a Conditional Random Field with Lexicon weighting; following that, a Contrived Passive Aggressive with Fuzzy Weighting Classifier (CPSC-FWC) is proposed to classify the sentiments in Telugu text. Consequently, the method we propose produces efficient outcomes in terms of accuracy, precision, recall, and f1-score.

Palavras-chave

machine learning, natural language processing, polarity, sentiment analysis, Telugu

Sobre autores

G. Janardana Naidu

Gandhi Institute of Technology and Management GITAM (Deemed to be University)

Autor responsável pela correspondência
Email: jana.766@gmail.com
Gandhi Nagar, Rushikonda -

M. Seshashayee

Gandhi Institute of Technology and Management GITAM (Deemed to be University)

Email: smaruvad@gitam.edu
Gandhi Nagar, Rushikonda -

Bibliografia

Chowdhary K.R., Chowdhary K.R. Natural language processing. Fundamentals of artificial intelligence. 2020. pp. 603–649.
Eisenstein J. Introduction to natural language processing. MIT Press. 2019. 536 p.
Raina V., Krishnamurthy S., Raina V., Krishnamurthy S. Natural language processing. Building an Effective Data Science Practice: A Framework to Bootstrap and Manage a Successful Data Science Practice. 2022. pp. 63–73.
Nguyen H.V., Tan N., Quan N.H., Huong T.T., Phat N.H. Building a Chatbot System to Analyze Opinions of English Comments. Informatics and Automation. 2023. vol. 22. no. 2. pp. 289–315. doi: 10.15622/ia.22.2.3.
Qiu X., Sun T., Xu Y., Shao Y., Dai N., Huang X. Pre-trained models for natural language processing: A survey. Science China Technological Sciences. 2020. vol. 63. no. 10. pp. 1872–1897.
Song L., Xin C., Lai S., Wang A., Su J., Xu K. CASA: Conversational aspect sentiment analysis for dialogue understanding. Journal of Artificial Intelligence Research. 2022. vol. 73. pp. 511–533.
Wang Y., Chen Q., Ahmed M.H., Chen Z., Su J., Pan W., Li Z. Supervised Gradual Machine Learning for Aspect-Term Sentiment Analysis. Transactions of the Association for Computational Linguistics. 2023. vol. 11. pp. 723–739.
Liu B. Sentiment analysis and opinion mining. Springer Nature, 2022. 167 p.
Talaat A.S. Sentiment analysis classification system using hybrid BERT models. Journal of Big Data. 2023. vol. 10. no. 1. pp. 1–18.
Hoang M., Bihorac O.A., Rouces J. Aspect-based sentiment analysis using Bert. Proceedings of the 22nd nordic conference on computational linguistics. 2019. 187–196.
Bataa E., Wu J. An investigation of transfer learning-based sentiment analysis in Japanese. arXiv preprint arXiv:1905.09642. 2019.
Lv H., Liu J., Wang H., Wang Y., Luo J., Liu Y. Efficient hybrid generation framework for aspect-based sentiment analysis. Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics. 2023. pp. 1007–1018.
Chen C., Teng Z., Wang Z., Zhang Y. Discrete opinion tree induction for aspect-based sentiment analysis. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics. 2022. vol. 1. pp. 2051–2064.
Esuli A., Sebastiani F. Determining the semantic orientation of terms through gloss classification. Proceedings of the 14th ACM international conference on information and knowledge management. 2005. pp. 617–624.
Cambria E., Havasi C., Hussain A. SenticNet 2: A semantic and affective resource for opinion mining and sentiment analysis. Proceedings of the Twenty-Fifth International Florida Artificial Intelligence Research Society Conference. 2012. pp. 202–207.
Xiaomei Z., Jing Y., Jianpei Z., Hongyu H. Microblog sentiment analysis with weak dependency connections. Knowledge-Based Systems. 2018. vol. 142. pp. 170–180.
Appel O., Chiclana F., Carter J., Fujita H. Successes and challenges in developing a hybrid approach to sentiment analysis. Applied Intelligence. 2018. vol. 48. pp. 1176–1188.
Yin C., Chen S., Yin Z. Clustering-based Active Learning Classification towards Data Stream. ACM Transactions on Intelligent Systems and Technology. 2023. vol. 14. no. 2. pp. 1–18.
Naseri S., Dalton J., Yates A., Allan J. CEQE to SQET: A study of contextualized embeddings for query expansion. Information Retrieval Journal. 2022. vol. 25. no. 2. pp. 184–208.
Sobkowicz P., Kaschesky M., Bouchard G. Opinion mining in social media: Modeling, simulating, and forecasting political opinions in the web. Government information quarterly. 2012. vol. 29. no. 4. pp. 470–479.
Hu Y.H., Chen Y.L., Chou H.L. Opinion mining from online hotel reviews–a text summarization approach. Information Processing and Management. 2017. vol. 53. no. 2. pp. 436–449.
Yousfi S., Rhanoui M., Mikram M. Comparative study of CNN and LSTM for opinion mining in long text. Journal of Automation, Mobile Robotics and Intelligent Systems. 2020. pp. 50–55.
Ethnologue Languages of the World [online]. Available at: https://www.ethnologue.com/statistics/size (accessed 01.09.2023).
Sultana J., Rani M.U., Farquad M.A.H. Knowledge discovery from recommender systems using deep learning. International Conference on Smart Systems and Inventive Technology (ICSSIT). 2019. pp. 1074–1078.
Sultana J., Jilani A.K. Predicting breast cancer using logistic regression and multi-class classifiers. International Journal of Engineering and Technology. 2018. vol. 7. no. 4(20). pp. 22–26.
Sultana J., Nagalaxmi G. How Efficient is Apriori: A Comparative Analysis. International Journal of Current Engineering and Scientific Research. 2015. pp. 2393–8374.
Naidu R., Bharti S.K., Babu K.S., Mohapatra R.K. Sentiment analysis using telugu sentiwordnet. International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET). 2017. pp. 666–670.
Garapati A., Bora N., Balla H., Sai M. SentiPhraseNet: An extended SentiWordNet approach for Telugu sentiment analysis. International Journal of Advance Research, Ideas and Innovations in Technology. 2019. vol. 5. no. 2. pp. 433–436.
Koppula N., Rani B.P., Srinivas Rao K. Graph-based word sense disambiguation in Telugu language. International Journal of Knowledge-based and Intelligent Engineering Systems. 2019. vol. 23. no. 1. pp. 55–60.
Sultana J. Telugu News Data Classification Using Machine Learning Approach. Handbook of Research on Advances in Data Analytics and Complex Communication Networks. 2022. pp. 181–194.
Janardana Naidu G., Seshashayee M. Sentiment analysis for Telugu text using cuckoo search algorithm. Smart Computing Techniques and Applications: Proceedings of the Fourth International Conference on Smart Computing and Informatics. 2021. vol. 2. pp. 253–257.
Suryachandra P., Venkata P., Reddy S. Machine Learning Approach to Classify the Sentiment Value of Natural Language Processing in Telugu Data. Journal of Engineering and Applied Sciences. 2020. vol. 15. pp. 3593–3598.
Tammina S. A hybrid learning approach for sentiment classification in Telugu language. International Conference on Artificial Intelligence and Signal Processing (AISP). 2020. pp. 1–6.

Arquivos suplementares

Ação

1. JATS XML

Baixar

Nome de usuário
Senha
Lembrar usuário

Esqueceu a senha?	Cadastro

Nome de usuário
Senha
Lembrar usuário

Esqueceu a senha?	Cadastro

Volume 24, Nº 5 (2025)

Volume 24, Nº 5 (2025)

Sentiment Analysis Framework for Telugu Text Based on Novel Contrived Passive Aggressive with Fuzzy Weighting Classifier (CPSC-FWC)

Texto integral

Resumo

Palavras-chave

Sobre autores

G. Janardana Naidu

M. Seshashayee

Bibliografia

Arquivos suplementares