Prosody in automatic speech processing
- Automatic speech processing (ASP) is understood as covering word recognition, the processing of higher linguistic components (syntax, semantics, and pragmatics), and the processing of computational paralinguistics (CP), which deals with speaker states and traits. This chapter attempts to track the role of prosody in ASP from the word level up to CP. A short history of the field from 1980 to 2020 distinguishes the early years (until 2000)—when the prosodic contribution to the modelling of linguistic phenomena, such as accents, boundaries, syntax, semantics, and dialogue acts, was the focus—from the later years, when the focus shifted to paralinguistics; prosody ceased to be visible. Different types of predictor variables are addressed, among them high-performance power features as well as leverage features, which can also be employed in teaching and therapy.
Author: | Anton BatlinerGND, Bernd Möbius |
---|---|
URN: | urn:nbn:de:bvb:384-opus4-712875 |
Frontdoor URL | https://opus.bibliothek.uni-augsburg.de/opus4/71287 |
ISBN: | 9780198832232OPAC |
Parent Title (English): | The Oxford handbook of language prosody |
Publisher: | Oxford University Press |
Place of publication: | Oxford |
Editor: | C. Gussenhoven, A. Chen |
Type: | Part of a Book |
Language: | English |
Year of first Publication: | 2020 |
Embargo Date: | 2022/12/31 |
Publishing Institution: | Universität Augsburg |
Release Date: | 2020/02/24 |
First Page: | 633 |
Last Page: | 645 |
DOI: | https://doi.org/10.1093/oxfordhb/9780198832232.013.42 |
Institutes: | Fakultät für Angewandte Informatik |
Fakultät für Angewandte Informatik / Institut für Informatik | |
Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Embedded Intelligence for Health Care and Wellbeing | |
Dewey Decimal Classification: | 0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik |
Licence (German): | Deutsches Urheberrecht |