Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
Presenting a Text Mining Algorithm to Identify Emotion in Persian Corpus
oleh: Masoud Garshasbi, Anahid Rais-Rohani, Mohammadreza Kabaranzadeh Ghadim
Format: | Article |
---|---|
Diterbitkan: | University of Tehran 2018-06-01 |
Deskripsi
The literature regarding Persian text mining indicates that most studies are conducted to detect polarity of opinions on social websites. The aim of this research is presenting an algorithm to identify emotion implemented in the text based on the following six main emotions of happiness, sadness, fear, anger, surprise and disgust. In this research, the emotion will be examined based on unsupervised lexicon method. Identifying emotions conveyed by the texts based on a single emotional word will produce low accuracy because the intervening boosters and negating words can influence the emotion of the text too. Therefore, the algorithm has been implemented in six approaches with different features. In the first approach, the algorithm is capable of detecting only one emotional word in a sentence, and then it improves to detect boosters and negating and stop word list as well. The results of running the algorithm on two domains of data showed that the more features used in the algorithm, the more accurate the algorithm becomes and that the most effective factor is part of speech.