G-MACT at SemEval-2025 Task 8: exploring planning and tool use in question answering over tabular data

Zhou, Wei; Mesgar, Mohsen; Friedrich, Annemarie; Adel, Heike

G-MACT at SemEval-2025 Task 8: exploring planning and tool use in question answering over tabular data

Wei Zhou, Mohsen Mesgar, Annemarie Friedrich, Heike Adel

This paper describes our system submitted to SemEval-2024 Task 8 “Question Answering over Tabular Data.”The shared task focuses on tackling real-life table question answering (TQA) involving extremely large tables with the additional challenges of interpreting complex questions. To address these issues, we leverage a framework of Multi-Agent Collaboration with Tool use (MACT), a method that combines planning and tool use. The planning module breaks down a complex question by designing a step-by-step plan. This plan is translated into Python code by a coding model, and a Python interpreter executes the code to generate an answer. Our system demonstrates competitive performance in the shared task and is ranked 5th out of 38 in the open-source model category. We provide a detailed analysis of our model, evaluating the effectiveness and the efficiency of each component, and identify common error patterns. Our paper offers essential insights and recommendations for future advancements inThis paper describes our system submitted to SemEval-2024 Task 8 “Question Answering over Tabular Data.”The shared task focuses on tackling real-life table question answering (TQA) involving extremely large tables with the additional challenges of interpreting complex questions. To address these issues, we leverage a framework of Multi-Agent Collaboration with Tool use (MACT), a method that combines planning and tool use. The planning module breaks down a complex question by designing a step-by-step plan. This plan is translated into Python code by a coding model, and a Python interpreter executes the code to generate an answer. Our system demonstrates competitive performance in the shared task and is ranked 5th out of 38 in the open-source model category. We provide a detailed analysis of our model, evaluating the effectiveness and the efficiency of each component, and identify common error patterns. Our paper offers essential insights and recommendations for future advancements in developing TQA systems.…

Metadaten
Author:	Wei Zhou, Mohsen Mesgar, Annemarie Friedrich ORCiD GND, Heike Adel
URN:	urn:nbn:de:bvb:384-opus4-1264073
Frontdoor URL	https://opus.bibliothek.uni-augsburg.de/opus4/126407
URL:	https://aclanthology.org/2025.semeval-1.100/
URL:	http://979-8-89176-273-2
Parent Title (English):	Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), 31 July - 1 August 2025, Vienna, Austria
Publisher:	Association for Computational Linguistics (ACL)
Place of publication:	Stroudsburg, PA
Editor:	Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Type:	Conference Proceeding
Language:	English
Date of Publication (online):	2025/11/20
Year of first Publication:	2025
Publishing Institution:	Universität Augsburg
Release Date:	2025/11/26
First Page:	726
Last Page:	742
Institutes:	Fakultät für Angewandte Informatik
	Fakultät für Angewandte Informatik / Institut für Informatik
	Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Computerlinguistik
Dewey Decimal Classification:	0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik
Licence (German):	CC-BY 4.0: Creative Commons: Namensnennung

Open Access

G-MACT at SemEval-2025 Task 8: exploring planning and tool use in question answering over tabular data

Download full text files

Export metadata

Statistics

Additional Services