A generic formalism to represent linguistic corpora in RDF and OWL/DL
- This paper describes POWLA, a generic formalism to represent linguistic corpora by means of RDF and OWL/DL. Unlike earlier approaches in this direction, POWLA is not tied to a specific selection of annotation layers, but rather, it is designed to support any kind of text-oriented annotation. POWLA inherits its generic character from the underlying data model PAULA (Dipper, 2005; Chiarcos et al., 2009) that is based on early sketches of the ISO TC37/SC4 Linguistic Annotation Framework (Ide and Romary, 2004). As opposed to existing standoff XML linearizations for such generic data models, it uses RDF as representation formalism and OWL/DL for validation. The paper discusses advantages of this approach, in particular with respect to interoperability and queriability, which are illustrated for the MASC corpus, an open multi-layer corpus of American English (Ide et al., 2008).
Author: | Christian ChiarcosORCiDGND |
---|---|
Frontdoor URL | https://opus.bibliothek.uni-augsburg.de/opus4/104219 |
URL: | https://aclanthology.org/L12-1548/ |
ISBN: | 978-2-9517408-7-7OPAC |
Parent Title (English): | Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12), Istanbul, Turkey, May 21-27, 2012 |
Publisher: | European Language Resources Association (ELRA) |
Place of publication: | Paris |
Editor: | Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Mehmet Uğur Doğan, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk, Stelios Piperidis |
Type: | Conference Proceeding |
Language: | English |
Year of first Publication: | 2012 |
Release Date: | 2023/05/16 |
First Page: | 3205 |
Last Page: | 3212 |
Institutes: | Philologisch-Historische Fakultät |
Philologisch-Historische Fakultät / Angewandte Computerlinguistik | |
Philologisch-Historische Fakultät / Angewandte Computerlinguistik / Lehrstuhl für Angewandte Computerlinguistik (ACoLi) |