Find in Library
Search millions of books, articles, and more
Indexed Open Access Databases
Verteilte Korpusabfragesysteme
oleh: Tobias Roth
Format: | Article |
---|---|
Diterbitkan: | Bern Open Publishing 2009-04-01 |
Deskripsi
Distributed text corpora have not been very much in use so far. The Swiss Text Corpus (CHTK) and its partner projects set up a distributed corpus for German ("Korpus C4"), virtually merging parts of their corpus data and making them available through one common query platform. Based on experience made during this project, we propose a possible path towards a more standardised interface for distributed corpus queries. This should allow to integrate new as well as existing corpora more easily into distributed corpus systems. Special attention is paid to problems such as responsibility assignment, performance, user management, format unification and metadata synchronisation.