SENTIMENT ANALYSIS OF INDONESIA’S DIGITAL WALLET USING COMBINATION MACHINE LEARNING AND EMOTICON WEIGHT
Bina Nusantara University
Indonesia
Bina Nusantara University
Indonesia
Abstract
Opinions on social media can be used to determine user sentiment by using sentiment analysis concept. Sentiment analysis requires several important stages, namely, preprocessing, feature extraction and classification method stages. The preprocessing stage was carried out to eliminate inconsistent data. In previous research, punctuation marks removal was applied at the preprocessing stage which can eliminate the emoticon position. Emoticons are a combination of punctuation marks. According to previous research, the emoticon feature has no contribution in sentiment analysis. There is another suggestion to maintain an emoticon position like converting an emoticon into a more relevant word such as :) into a “smile”. However, the feature of emoticon weights has not been considered in the sentiment analysis process. In order to consider the role of emoticons and to improve sentiment analysis performance, we propose using a combination of machine learning and emoticon weights. We perform emoticon weight based on probability and sentiment score. Each probability value and sentiment score of the emoticon will be normalized using the z-score method. There are several machine learning methods that have the best classification success rates, namely, Naïve Bayes and SVM. Based on the evaluation results of the proposed model, the best accuracy is 87% - 89% when using the combination of machine learning and emoticon sentiment score. Based on the results also show that the emoticon sentiment score has a significant effect on the accuracy of sentiment analysis.
Keywords
References
A. J. Levitin, "Pandora's Digital Box: The Promise and Perils of Digital Wallets," U. Pa. L. Rev., pp. 305-376, 2017.
Bank Indonesia, "Payment System License Information," 14 December 2019. [Online]. Available: https://www.bi.go.id/id/sistem-pembayaran/informasi-perizinan/uang-elektronik/penyelenggara-berizin/Contents/Default.aspx.
V. D. Devita, "Who is the E-wallet Application with the Most Users in Indonesia?," 12 December 2020. [Online]. Available: https://iprice.co.id/trend/insights/e-wallet-terbaik-di-indonesia/.
E. Cambria, D. Das, S. Bandyopadhyay and A. Feraco, A Practical Guide to Sentiment Analysis, Switzerland: Springer International Publishing, 2017.
B. Liu, "Sentiment Analysis and Opinion Mining," Synthesis lectures on human language technologies, vol. 5, no. 1, pp. 1-167, 2012.
N. I. Prabaningtyas, I. Surjandari and E. Laoh, "Mining Customers Opinion on Services and Applications of Mobile Payment Companies in Indonesia Using Sentiment Analysis Approach," in The 16th International Conference on Service Systems and Service Management (ICSSSM), 2019.
V. S. Shirsat, R. S. Jagdale and S. N. Deshmukh, "Sentence Level Sentiment Identification and Calculation from News Articles Using Machine Learning Techniques," in International Conference Computing, Communication and Signal Processing (ICCASP 2018), Singapore, 2019.
M. I. Zul, F. Yulia and D. Nurmalasari, "Social Media Sentiment Analysis Using K-Means and Naïve Bayes Algorithm," in 2018 2nd International Conference on Electrical Engineering and Informatics (ICon EEI), Batam, 2018.
T. Hu, H. Guo, H. Sun, T. V. T. Nguyen and J. Luo, "Spice Up Your Chat: The Intentions and Sentiment Effects of Using Emojis," in In Eleventh International AAAI Conference on Web and Social Media, Canada, 2017.
H. Wang and J. A. Castanon, "Sentiment Expression via Emoticons on Social Media," in IEEE International Conference on Big Data (Big Data), 2015.
I. P. Windasari, F. N. Uzzi and K. I. Satoto, "Sentiment analysis on Twitter posts: An analysis of positive or negative opinion on GoJek," in 2017 4th international conference on information technology, computer, and electrical engineering (ICITACEE), Semarang, 2017.
P. S. Dandannavar, S. R. Mangalwede and S. B. Deshpande, "Emoticons and Their Effects on Sentiment Analysis of Twitter Data," in EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing, 2020.
R. Garreta and G. Moncecchi, Learning scikit-learn: machine learning in python, Birmingham: Packt Publishing Ltd, 2013.
G. Ignatow and R. Mihalcea, An introduction to text mining: Research design, data collection, and analysis, New Delhi: Sage Publications, 2017.
P. K. Novak, J. Smailovic, B. Sluban and I. Mozetic, "Sentiment of emojis," PLoS ONE, vol. 10, no. 12, 2015.
D. H. Wahid and S. N. Azhari, "Peringkasan Sentimen Esktraktif di Twitter Menggunakan Hybrid TF-IDF dan Cosine Similarity," IJCCS (Indonesian Journal of Computing and Cybernetics Systems), pp. 207-218, 2016.
F. Z. Tala, "A study of stemming effects on information retrieval in Bahasa Indonesia," M.S. thesis. M.Sc. Thesis. Master of Logic Project. Institute for Logic, Language and Computation. Universiteti van Amsterdam The Netherlands., 2003.
A. Librian, "JSastrawi," 18 July 2020. [Online]. Available: https://github.com/jsastrawi/jsastrawi/.
F. K. Chopra and R. Bhatia, "Sentiment analyzing by dictionary based approach," International Journal of Computer Applications, vol. 152, no. 5, pp. 32-34, 2016.
J. Han, M. Kamber and J. Pei, "Data mining concepts and techniques third edition," The Morgan Kaufmann Series in Data Management Systems, vol. 5, no. 4, pp. 83-124, 2011.
K. Wegrzyn-Wolska, L. Bougueroua, H. Yu and J. Zhong, "Explore the effects of emoticons on Twitter sentiment analysis," Comput. Sci. Inf. Technol, vol. 2, pp. 65-77, 2016.
R. Pal, U. Pawar, K. Zambare and V. Hole, "Predicting Stock Market Movement Based on Twitter Data and News Articles Using Sentiment Analysis and Fuzzy Logic," in International Conference on Computer Networks and Inventive Communication Technologies, 2019.