Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
Word Sense Disambiguation Using Clustered Sense Labels
oleh: Jeong Yeon Park, Hyeong Jin Shin, Jae Sung Lee
Format: | Article |
---|---|
Diterbitkan: | MDPI AG 2022-02-01 |
Deskripsi
Sequence labeling models for word sense disambiguation have proven highly effective when the sense vocabulary is compressed based on the thesaurus hierarchy. In this paper, we propose a method for compressing the sense vocabulary without using a thesaurus. For this, sense definitions in a dictionary are converted into sentence vectors and clustered into the compressed senses. First, the very large set of sense vectors is partitioned for less computational complexity, and then it is clustered hierarchically with awareness of homographs. The experiment was done on the English Senseval and Semeval datasets and the Korean Sejong sense annotated corpus. This process demonstrated that the performance greatly increased compared to that of the uncompressed sense model and is comparable to that of the thesaurus-based model.