TY - CONF A1 - da Silva Cardoso, Heike A1 - Wolska, Magdalena A2 - Volodina, Elena A2 - Borin, Lars A2 - Pilán, Ildikó T1 - Misspellings in responses to listening comprehension questions: prospects for scoring based on phonetic normalization T2 - Proceedings of the 4th workshop on NLP for Computer Assisted Language Learning at NODALIDA 2015, 11th May 2015, Vilnius, Lithuania N2 - Automated scoring systems which evaluate content require robust ways of dealing with form errors. The work presented in this paper is set in the context of scoring learners’ responses to listening comprehension items included in a placement test of German as a foreign language. Based on a corpus of over 3000 responses to 17 questions, by test takers of different language proficiencies, we perform a quantitative analysis of the diversity in misspellings. We evaluate the performance of an off-the-shelf open source spell-checker on our data showing that around 45% of the reported non-word errors are not correctly accounted for, that is, they are either falsely identified as misspelt or the spell-checker is unable to identify the intended word. We propose to address misspellings in computer-based scoring of constructed response items by means of phonetic normalization. Learner responses transcribed into Soundex codes and into two encodings borrowed from historical linguistics (ASJP and Dolgopolsky’s sound classes) are compared to transcribed reference answers using string distance measures. We show that reliable correlation with teachers’ scores can be obtained, however, similarity thresholds are item-specific. Y1 - 2024 UR - https://opus.bibliothek.uni-augsburg.de/opus4/frontdoor/index/index/docId/113442 UR - https://nbn-resolving.org/urn:nbn:de:bvb:384-opus4-1134421 UR - https://ep.liu.se/en/conference-article.aspx?series=ecp&issue=114&Article_No=2 SN - 978-91-7519-036-5 SN - 1650-3686 SP - 1 EP - 10 PB - Linköping University Electronic Press CY - Linköping ER -