Linking discourse marker inventories

  • The paper describes the first comprehensive edition of machine-readable discourse marker lexicons. Discourse markers such as and, because, but, though or thereafter are essential communicative signals in human conversation, as they indicate how an utterance relates to its communicative context. As much of this information is implicit or expressed differently in different languages, discourse parsing, context-adequate natural language generation and machine translation are considered particularly challenging aspects of Natural Language Processing. Providing this data in machine-readable, standard-compliant form will thus facilitate such technical tasks, and moreover, allow to explore techniques for translation inference to be applied to this particular group of lexical resources that was previously largely neglected in the context of Linguistic Linked (Open) Data.

Download full text files

Export metadata

Statistics

Number of document requests

Additional Services

Share in Twitter Search Google Scholar
Metadaten
Author:Christian ChiarcosORCiDGND, Maxim Ionov
URN:urn:nbn:de:bvb:384-opus4-1039971
Frontdoor URLhttps://opus.bibliothek.uni-augsburg.de/opus4/103997
ISBN:978-3-95977-199-3OPAC
ISSN:1868-8969OPAC
Parent Title (English):3rd Conference on Language, Data and Knowledge (LDK 2021), September 1–3, 2021, Zaragoza, Spain
Publisher:Schloss Dagstuhl – Leibniz-Zentrum für Informatik
Place of publication:Saarbrücken
Editor:Dagmar Gromann, Gilles Sérasset, Thierry Declerck, John P. McCrae, Jorge Gracia, Julia Bosque-Gil, Fernando Bobillo, Barbara Heinisch
Type:Conference Proceeding
Language:English
Year of first Publication:2021
Publishing Institution:Universität Augsburg
Release Date:2023/05/16
First Page:40:1
Last Page:40:15
Series:OASIcs ; 93
DOI:https://doi.org/10.4230/OASIcs.LDK.2021.40
Institutes:Philologisch-Historische Fakultät
Philologisch-Historische Fakultät / Angewandte Computerlinguistik
Philologisch-Historische Fakultät / Angewandte Computerlinguistik / Lehrstuhl für Angewandte Computerlinguistik (ACoLi)
Dewey Decimal Classification:4 Sprache / 40 Sprache / 400 Sprache
Licence (German):CC-BY 4.0: Creative Commons: Namensnennung (mit Print on Demand)