arXiv:2505.17565v2 Announce Type: replace 
Abstract: Table question answering (TQA) focuses on answering questions based on tabular data. Developing TQA systems targets effective interaction with tabular data for tasks such as cell retrieval and data analysis. While recent work has leveraged fine-tuning to improve TQA systems, existing approaches often under-utilize available data and neglect the potential of post-training for further gains. In this work, we introduce p2-TQA, a process-based preference learning framework for TQA post-training. p2-TQA automatically constructs process-based preference data via a table-specific pipeline, eliminating the need for manual or costly data collection. It then optimizes models through contrastive learning on the collected data. Experiments show that p2-TQA effectively improves TQA models by up to 5% on in-domain datasets and 2.4% on out-of-domain datasets with only 8,000 training instances. Furthermore, models enhanced with p2-TQA achieve competitive results against larger, more complex state-of-the-art TQA systems, while maintaining up to five times higher efficiency.

إطار p2-TQA يعزز نماذج الإجابة على الأسئلة المتعلقة بالجداول (TQA) باستخدام نهج تعلم تفضيل قائم على العمليات، محققًا تحسينات تصل إلى 5% على مجموعات البيانات في النطاق و2.4% على مجموعات البيانات خارج النطاق مع 8000 حالة تدريب فقط. هذه الابتكار مهم لأنه يسمح بتدريب نماذج أكثر كفاءة وأداء تنافسي مقارنة بالأنظمة الأكبر، مما يجعل TQA أكثر سهولة وفعالية.

El marco p2-TQA mejora los modelos de respuesta a preguntas sobre tablas (TQA) utilizando un enfoque de aprendizaje por preferencia basado en procesos, logrando mejoras de hasta el 5% en conjuntos de datos en dominio y del 2.4% en conjuntos de datos fuera de dominio con solo 8,000 instancias de entrenamiento. Esta innovación es significativa ya que permite un entrenamiento de modelos más eficiente y un rendimiento competitivo frente a sistemas más grandes, haciendo que el TQA sea más accesible y efectivo.

Le cadre p2-TQA améliore les modèles de questionnement sur tableaux (TQA) en utilisant une approche d'apprentissage par préférence basée sur le processus, atteignant des améliorations allant jusqu'à 5 % sur des ensembles de données en domaine et 2,4 % sur des ensembles de données hors domaine avec seulement 8 000 instances d'entraînement. Cette innovation est significative car elle permet un entraînement de modèle plus efficace et une performance compétitive par rapport à des systèmes plus grands, rendant le TQA plus accessible et efficace.

The p2-TQA framework enhances table question answering (TQA) models by utilizing a process-based preference learning approach, achieving improvements of up to 5% on in-domain datasets and 2.4% on out-of-domain datasets with only 8,000 training instances. This innovation is significant as it allows for more efficient model training and competitive performance against larger systems, making TQA more accessible and effective.

p2-TQA: A Process-based Preference Learning Framework for Self-Improving Table Question Answering Models

Was this article worth reading? Share it

Ready to build your own newsroom?