Access the full text.
Sign up today, get DeepDyve free for 14 days.
Synthesis Lectures on Human Language Technologies, 3
Nawaf Abdulla, N. Ahmed, M. Shehab, M. Al-Ayyoub, M. Al-Kabi, Saleh Al-Rifai (2014)
Towards Improving the Lexicon-Based Approach for Arabic Sentiment AnalysisInt. J. Inf. Technol. Web Eng., 9
International Journal of Advanced Computer Science and Applications, 8
I. Guellil, A. Faical (2017)
Bilingual lexicon for Algerian Arabic dialect treatment in social media. In: WiNLP: women and underrepresented minorities in natural language processing (co-located with ACL 2017)
Nora Al-Twairesh, Hend Al-Khalifa, A. Al-Salman, Y. Al-Ohali (2017)
AraSenTi-Tweet: A Corpus for Arabic Sentiment Analysis of Saudi Tweets
M'hamed Mataoui, Omar Zelmati, Madiha Boumechache (2016)
A Proposed Lexicon-Based Sentiment Analysis Approach for the Vernacular Algerian ArabicRes. Comput. Sci., 110
Sadam Al-Azani, El-Sayed El-Alfy (2017)
Using Word Embedding and Ensemble Learning for Highly Imbalanced Data Sentiment Analysis in Short Arabic Text
Walaa Medhat, A. Hassan, H. Korashy (2014)
Sentiment analysis algorithms and applications: A surveyAin Shams Engineering Journal, 5
Maite Taboada, Julian Brooke, Milan Tofiloski, Kimberly Voll, Manfred Stede (2011)
Lexicon-Based Methods for Sentiment AnalysisComputational Linguistics, 37
Information Processing and Management
Haiyun Peng, E. Cambria, A. Hussain (2017)
A Review of Sentiment Analysis Research in Chinese LanguageCognitive Computation, 9
Mohammed Rushdi-Saleh, M. Martín-Valdivia, L. López, José Ortega (2011)
OCA: Opinion corpus for ArabicJournal of the American Society for Information Science and Technology, 62
Ghadah Alwakid, T. Osman, T. Hughes-Roberts (2017)
Challenges in Sentiment Analysis for Arabic Social Networks
International Journal of Advanced Computer Science and Applications, 6
Mohamed Elarnaoty, S. Abdelrahman, A. Fahmy (2012)
A Machine Learning Approach For Opinion Holder Extraction In Arabic LanguageArXiv, abs/1206.1011
Muhammad Abdul-Mageed, Mona Diab, Sandra Kübler (2014)
SAMAR: Subjectivity and sentiment analysis for Arabic social mediaComput. Speech Lang., 28
Journal of King Saud University-Computer and Information Sciences
Khalid Khalifa, N. Omar (2014)
A Hybrid method using Lexicon-based Approach and Naive Bayes Classifier for Arabic Opinion Question AnsweringJ. Comput. Sci., 10
International Journal of Scientific and Engineering Research, 7
International Journal of Social Network Mining, 2
International Science Index, 9
This paper aims to propose an approach to automatically annotate a large corpus in Arabic dialect. This corpus is used in order to analyse sentiments of Arabic users on social medias. It focuses on the Algerian dialect, which is a sub-dialect of Maghrebi Arabic. Although Algerian is spoken by roughly 40 million speakers, few studies address the automated processing in general and the sentiment analysis in specific for Algerian.Design/methodology/approachThe approach is based on the construction and use of a sentiment lexicon to automatically annotate a large corpus of Algerian text that is extracted from Facebook. Using this approach allow to significantly increase the size of the training corpus without calling the manual annotation. The annotated corpus is then vectorized using document embedding (doc2vec), which is an extension of word embeddings (word2vec). For sentiments classification, the authors used different classifiers such as support vector machines (SVM), Naive Bayes (NB) and logistic regression (LR).FindingsThe results suggest that NB and SVM classifiers generally led to the best results and MLP generally had the worst results. Further, the threshold that the authors use in selecting messages for the training set had a noticeable impact on recall and precision, with a threshold of 0.6 producing the best results. Using PV-DBOW led to slightly higher results than using PV-DM. Combining PV-DBOW and PV-DM representations led to slightly lower results than using PV-DBOW alone. The best results were obtained by the NB classifier with F1 up to 86.9 per cent.Originality/valueThe principal originality of this paper is to determine the right parameters for automatically annotating an Algerian dialect corpus. This annotation is based on a sentiment lexicon that was also constructed automatically.
International Journal of Web Information Systems – Emerald Publishing
Published: Oct 15, 2019
Keywords: Arabic sentiment analysis; Algerian dialect; Sentiment lexicon; Sentiment corpus; Doc2vec
Read and print from thousands of top scholarly journals.
Already have an account? Log in
Bookmark this article. You can see your Bookmarks on your DeepDyve Library.
To save an article, log in first, or sign up for a DeepDyve account if you don’t already have one.
Copy and paste the desired citation format or use the link below to download a file formatted for EndNote
Access the full text.
Sign up today, get DeepDyve free for 14 days.
All DeepDyve websites use cookies to improve your online experience. They were placed on your computer when you launched this website. You can change your cookie settings through your browser.