Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
Improving Persian Named Entity Recognition Through Multi Task Learning
oleh: Mohammad Hadi Bokaei, Abdolah Sepahvand, Mohammad Nouri
Format: | Article |
---|---|
Diterbitkan: | Iran Telecom Research Center 2021-06-01 |
Deskripsi
Named Entity Recognition is a challenging task, specially for low resource languages, such as Persian, due to the lack of massive gold data. As developing manually-annotated datasets is time consuming and expensive, we use a multitask learning (MTL) framework to exploit different datasets to enrich the extracted features and improve the accuracy of recognizing named entities in Persian news articles. Highly motivated auxiliary tasks are chosen to be included in a deep learning based structure. Additionally, we investigate the effect of chosen datasets on performance of the model. Our best model significantly outperformed the state of the art model by , according to F1 score in the phrase level.