CarMem: enhancing long-term memory in LLM voice assistants through category-bounding

Kirmayr, Johannes; Stappen, Lukas; Schneider, Phillip; Matthes, Florian; André, Elisabeth

CarMem: enhancing long-term memory in LLM voice assistants through category-bounding

Johannes Kirmayr, Lukas Stappen, Phillip Schneider, Florian Matthes, Elisabeth André

In today’s assistant landscape, personalisation enhances interactions, fosters long-term relationships, and deepens engagement. However, many systems struggle with retaining user preferences, leading to repetitive user requests and disengagement. Furthermore, the unregulated and opaque extraction of user preferences in industry applications raises significant concerns about privacy and trust, especially in regions with stringent regulations like Europe. In response to these challenges, we propose a long-term memory system for voice assistants, structured around predefined categories. This approach leverages Large Language Models to efficiently extract, store, and retrieve preferences within these categories, ensuring both personalisation and transparency. We also introduce a synthetic multi-turn, multi-session conversation dataset (CarMem), grounded in real industry data, tailored to an in-car voice assistant setting. Benchmarked on the dataset, our system achieves an F1-score of .78In today’s assistant landscape, personalisation enhances interactions, fosters long-term relationships, and deepens engagement. However, many systems struggle with retaining user preferences, leading to repetitive user requests and disengagement. Furthermore, the unregulated and opaque extraction of user preferences in industry applications raises significant concerns about privacy and trust, especially in regions with stringent regulations like Europe. In response to these challenges, we propose a long-term memory system for voice assistants, structured around predefined categories. This approach leverages Large Language Models to efficiently extract, store, and retrieve preferences within these categories, ensuring both personalisation and transparency. We also introduce a synthetic multi-turn, multi-session conversation dataset (CarMem), grounded in real industry data, tailored to an in-car voice assistant setting. Benchmarked on the dataset, our system achieves an F1-score of .78 to .95 in preference extraction, depending on category granularity. Our maintenance strategy reduces redundant preferences by 95% and contradictory ones by 92%, while the accuracy of optimal retrieval is at .87. Collectively, the results demonstrate the system’s suitability for industrial applications.…

Metadaten
Author:	Johannes Kirmayr, Lukas Stappen, Phillip Schneider, Florian Matthes, Elisabeth André ORCiD GND
URN:	urn:nbn:de:bvb:384-opus4-1282671
Frontdoor URL	https://opus.bibliothek.uni-augsburg.de/opus4/128267
URL:	https://aclanthology.org/2025.coling-industry.29/
ISBN:	979-8-89176-197-1OPAC
Parent Title (English):	Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025): industry track, January 19–24, 2025, Abu Dhabi, UAE
Publisher:	Association for Computational Linguistics (ACL)
Place of publication:	Stroudsburg, PA
Editor:	Owen Rambow, Leo Wanner, Marianna Apidianaki, Hend Al-Khalifa, Barbara Di Eugenio, Steven Schockaert, Kareem Darwish, Apoorv Agarwal
Type:	Conference Proceeding
Language:	English
Date of Publication (online):	2026/02/18
Year of first Publication:	2025
Publishing Institution:	Universität Augsburg
Release Date:	2026/02/18
First Page:	343
Last Page:	357
Institutes:	Fakultät für Angewandte Informatik
	Fakultät für Angewandte Informatik / Institut für Informatik
	Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Menschzentrierte Künstliche Intelligenz
Dewey Decimal Classification:	0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Licence (German):	CC-BY 4.0: Creative Commons: Namensnennung

Open Access

CarMem: enhancing long-term memory in LLM voice assistants through category-bounding

Download full text files

Export metadata

Statistics

Additional Services