Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
A Metamodel Enabled Approach for Discovery of Coherent Topics in Short Text Microblogs
oleh: Herman Wandabwa, Muhammad Asif Naeem, Russel Pears, Farhaan Mirza
| Format: | Article |
|---|---|
| Diterbitkan: | IEEE 2018-01-01 |
Deskripsi
Comprehending social media discussions in short text microblogs is fundamental for knowledge-based applications like recommender systems. Twitter, for example, provides rich real-time information in keeping with its streaming nature. Making sense of such data without automated support is not feasible due to its vast size and nature. The problem becomes more complex when the data in question have a low variance in terms of topical diversity. Therefore, an automatic method for understanding textual patterns in such topically constrained data needs to be developed. A major challenge to building such a system is in its ability to comprehend the nature of the data with regard to diversity of word structure correlations, vocabulary sparsity, and distinguishing factors in the generated topics. In this paper, we present a novel semi-supervised approach called metamodel enabled latent Dirichlet allocation to address this challenge. Compared to stateof-the-art approaches, our model incorporates a domain-specific metamodel. The metamodel is defined as a set of topic label vectors derived from long texts to guide the learning process in shorter texts.