FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning

TL;DR

FGTR achieves fine-grained multi-table retrieval via hierarchical LLM reasoning, improving F_2 by 18% on Spider and 21% on BIRD datasets.

cs.IR 🔴 Advanced 2026-03-13 2 views

Chaojie Sun Bin Cao Tiantian Li Chenyu Hou Ruizhe Li Qing Fan

AI Reader Arxiv Page Download PDF

multi-table retrieval fine-grained retrieval hierarchical reasoning large language models information retrieval

Key Findings

Methodology

This paper introduces a hierarchical multi-table query method based on large language models (LLMs) called FGTR. FGTR employs a human-like reasoning strategy to first identify relevant schema elements and then retrieve corresponding cell contents, ultimately constructing a concise and accurate sub-table that aligns with the given query. The method consists of two stages: schema retrieval and cell retrieval. Through this decompositional reasoning approach, FGTR effectively transfers the textual reasoning strengths of LLMs to the domain of complex two-dimensional tabular data.

Key Results

On the Spider dataset, FGTR improves the F_2 metric by 18% compared to previous state-of-the-art methods. This demonstrates FGTR's significant advantage in fine-grained retrieval, enabling better identification and extraction of query-relevant table content.
On the BIRD dataset, FGTR achieves a 21% improvement in the F_2 metric. This further validates FGTR's effectiveness in multi-table retrieval tasks, especially in handling complex cross-table reasoning tasks.
Ablation studies confirm the contribution of each component of FGTR to overall performance, particularly the key role of hierarchical reasoning strategies in enhancing retrieval precision and efficiency.

Significance

The introduction of FGTR provides a new paradigm for multi-table retrieval tasks, significantly improving retrieval accuracy and efficiency. By employing a fine-grained retrieval strategy, FGTR effectively reduces redundant information, enhancing model computational efficiency and complex reasoning capabilities. This method's successful application holds significant academic importance and offers new insights for the industry in handling large-scale tabular data.

Technical Contribution

FGTR's technical contribution lies in its innovative hierarchical reasoning strategy, which simulates human cognitive processes to achieve precise information localization in complex multi-table environments. Compared to existing single-table retrieval methods, FGTR excels in handling cross-table relationships, optimizing the retrieval of crucial joining keys, and supporting complex cross-table reasoning.

Novelty

FGTR is the first to extend fine-grained retrieval to multi-table contexts, guided by a hierarchical reasoning strategy that directs LLMs to perform logical inference. This approach not only enhances retrieval efficiency but also achieves superior performance over existing methods in complex cross-table retrieval tasks.

Limitations

FGTR may face computational resource constraints when dealing with extremely large-scale tabular data, especially when extensive inter-table joins are required.
The method heavily relies on the reasoning capabilities of LLMs, which may affect retrieval performance if the LLM is underperforming or inadequately trained.
In certain specific domains or datasets, FGTR may require additional parameter tuning to achieve optimal results.

Future Work

Future research directions include further optimizing FGTR's computational efficiency, particularly its application on large-scale datasets. Additionally, exploring the integration of FGTR with other information retrieval technologies could enhance its applicability across different fields and tasks. Investigating how to improve FGTR's performance under limited resources is also a worthwhile direction.

AI Executive Summary

With the rapid advancement of large language models (LLMs), there has been an increasing focus on LLM-based table retrieval. However, existing studies primarily concentrate on single-table queries, implementing them through similarity matching after encoding the entire table. These methods often result in low accuracy due to their coarse-grained encoding, which incorporates much query-irrelevant data, and are inefficient when dealing with large tables, failing to fully utilize the reasoning capabilities of LLMs. Moreover, multi-table queries remain under-explored in retrieval tasks. To address this, the paper proposes a hierarchical multi-table query method based on LLMs: Fine-Grained Multi-Table Retrieval (FGTR), a new retrieval paradigm that employs a human-like reasoning strategy. Through hierarchical reasoning, FGTR first identifies relevant schema elements and then retrieves the corresponding cell contents, ultimately constructing a concise and accurate sub-table that aligns with the given query. To comprehensively evaluate the performance of FGTR, two new benchmark datasets based on Spider and BIRD were constructed. Experimental results show that FGTR outperforms previous state-of-the-art methods, improving the F_2 metric by 18% on Spider and 21% on BIRD, demonstrating its effectiveness in enhancing fine-grained retrieval and its potential to improve end-to-end performance on table-based downstream tasks.

Table reasoning tasks involve performing reasoning and deriving answers over structured tables, widely used in healthcare, finance, and corporate financial reporting. Distinct from textual data, tabular data exhibits a two-dimensional structure with complex logical relationships among rows, columns, and cell contents, posing significant NLP challenges. Common tasks include table question answering, table fact verification, table-to-text generation, and table semantic parsing. As retrieved tables serve as critical inputs for reasoning, effective table information retrieval becomes pivotal. Reducing redundant table content directly enhances model accuracy, computational efficiency, and complex reasoning capability.

In natural language processing (NLP), LLMs have made significant strides, opening new frontiers for table reasoning tasks. However, their application to structured data still faces persistent challenges due to inherent limitations. First, LLMs’ inherent context window constraints pose critical limitations when processing large-scale tables containing thousands of rows or columns despite their proficiency in conventional text tasks. Common mitigation strategies for large tables include table segmentation or setting a token budget for content extraction, which risks information loss and subsequent degradation of reasoning accuracy. Second, LLMs demonstrate constrained capabilities in comprehending tabular structures and logical relationships, attributed to:

Hence, it is necessary to overcome these issues through fine-grained multi-table retrieval, as table-based reasoning constitutes a complex task requiring human-like cognitive processes and fine-grained comprehension. However, existing studies still exhibit two main limitations. First, current approaches primarily encode the entire table or encode rows and columns to retrieve question-relevant contents and construct sub-tables. Such methods depend heavily on holistic table representations, resulting in limited understanding of table structures and low encoding efficiency. In contrast, as illustrated in Figure, humans typically first attend to the table’s schema structure—including column names, attributes, and sample entries—to rapidly grasp the informational framework and overall theme. On this basis, they subsequently focus on the columns required by the question, locate specific cell contents, and obtain the relevant rows needed to solve the problem. Second, existing fine-grained table retrieval methods tend to oversimplify real-world scenarios by restricting retrieval to a single table, neglecting the fact that in practical cases, the required information is often distributed across multiple tables and must be integrated through inter-table joins and relationships.

Deep Dive

Abstract

With the rapid advancement of large language models (LLMs), growing efforts have been made on LLM-based table retrieval. However, existing studies typically focus on single-table query, and implement it by similarity matching after encoding the entire table. These methods usually result in low accuracy due to their coarse-grained encoding which incorporates much query-irrelated data, and are also inefficient when dealing with large tables, failing to fully utilize the reasoning capabilities of LLM. Further, multi-table query is under-explored in retrieval tasks. To this end, we propose a hierarchical multi-table query method based on LLM: Fine-Grained Multi-Table Retrieval FGTR, a new retrieval paradigm that employs a human-like reasoning strategy. Through hierarchical reasoning, FGTR first identifies relevant schema elements and then retrieves the corresponding cell contents, ultimately constructing a concise and accurate sub-table that aligns with the given query. To comprehensively evaluate the performance of FGTR, we construct two new benchmark datasets based on Spider and BIRD . Experimental results show that FGTR outperforms previous state-of-the-art methods, improving the F_2 metric by 18% on Spider and 21% on BIRD, demonstrating its effectiveness in enhancing fine-grained retrieval and its potential to improve end-to-end performance on table-based downstream tasks.

cs.IR cs.CL cs.LG

References (20)

Is Table Retrieval a Solved Problem? Exploring Join-Aware Multi-Table Retrieval

P. Chen, Yi Zhang, Dan Roth

2024 38 citations ⭐ Influential View Analysis →

Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs

Yury Malkov, Dmitry A. Yashunin

2016 2108 citations ⭐ Influential View Analysis →

PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-training

Zihui Gu, Ju Fan, N. Tang et al.

2022 58 citations View Analysis →

Binding Language Models in Symbolic Languages

Zhoujun Cheng, Tianbao Xie, Peng Shi et al.

2022 290 citations View Analysis →

TabFact: A Large-scale Dataset for Table-based Fact Verification

Wenhu Chen, Hongmin Wang, Jianshu Chen et al.

2019 646 citations View Analysis →

Table-Critic: A Multi-Agent Framework for Collaborative Criticism and Refinement in Table Reasoning

Peiying Yu, Guoxin Chen, Jingjing Wang

2025 27 citations View Analysis →

Large Language Models are few(1)-shot Table Reasoners

Wenhu Chen

2022 213 citations View Analysis →

Automatic Metadata Extraction for Text-to-SQL

Vladislav Shkapenyuk, D. Srivastava, T. Johnson et al.

2025 22 citations View Analysis →

Compositional Semantic Parsing on Semi-Structured Tables

Panupong Pasupat, Percy Liang

2015 959 citations View Analysis →

TaPERA: Enhancing Faithfulness and Interpretability in Long-Form Table QA by Content Planning and Execution-based Reasoning

Yilun Zhao, Lyuhao Chen, Arman Cohan et al.

2024 31 citations

Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs

Jinyang Li, Binyuan Hui, Ge Qu et al.

2023 783 citations View Analysis →

Spider: A Large-Scale Human-Labeled Dataset for Complex and Cross-Domain Semantic Parsing and Text-to-SQL Task

Tao Yu, Rui Zhang, Kai-Chou Yang et al.

2018 1681 citations View Analysis →

Piece of Table: A Divide-and-Conquer Approach for Selecting Sub-Tables in Table Question Answering

Wonjin Lee, Kyumi Kim, Sungjae Lee et al.

2024 4 citations View Analysis →

RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL

Haoyang Li, Jing Zhang, Cuiping Li et al.

2023 311 citations View Analysis →

HeLM: Highlighted Evidence augmented Language Model for Enhanced Table-to-Text Generation

Junyi Bian, Xiaolei Qin, Wuhe Zou et al.

2023 4 citations View Analysis →

TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning

Yuan Sui, Jiaru Zou, Mengyu Zhou et al.

2023 49 citations View Analysis →

Open Domain Question Answering over Tables via Dense Retrieval

Jonathan Herzig, Thomas Müller, Syrine Krichene et al.

2021 139 citations View Analysis →

Large Language Models are Versatile Decomposers: Decomposing Evidence and Questions for Table-based Reasoning

Yunhu Ye, Binyuan Hui, Min Yang et al.

2023 239 citations View Analysis →

TaPas: Weakly Supervised Table Parsing via Pre-training

Jonathan Herzig, Pawel Krzysztof Nowak, Thomas Müller et al.

2020 812 citations View Analysis →

Fantastic Tables and Where to Find Them: Table Search in Semantic Data Lakes

M. Christensen, Aristotelis Leventidis, Matteo Lissandrini et al.

2025 11 citations

FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning

Key Findings

Methodology

Key Results

Significance

Technical Contribution

Novelty

Limitations

Future Work

AI Executive Summary

Deep Dive

Abstract

References (20)

Related Papers

NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval

Taming the Long Tail: Efficient Item-wise Sharpness-Aware Minimization for LLM-based Recommender Systems

Anchored Alignment: Preventing Positional Collapse in Multimodal Recommender Systems

VLM4Rec: Multimodal Semantic Representation for Recommendation with Large Vision-Language Models

InterDeepResearch: Enabling Human-Agent Collaborative Information Seeking through Interactive Deep Research

Federated Learning and Unlearning for Recommendation with Personalized Data Sharing