This is a curated list of awesome table structure recognition (TSR) research.Including sota models, influential papers, popular datasets and open-source codes. Continuously updating.
If you find any relevant academic papers that have not been included in our research, please submit a request for an update. We welcome contributions from everyone.
| Name | Tables | Data Type | TD | TSR | TCR | Year | 
|---|---|---|---|---|---|---|
| ICDAR2013 | 156 | Digital | √ | √ | √ | 2013 | 
| SciTSR | 15K | Digital | × | √ | √ | 2019 | 
| TableBank | 417K | Digital | √ | × | × | 2020 | 
| TableBank | 145K | Digital | × | √ | × | 2020 | 
| PubTabNet | 1M+ | Digital | × | √ | √ | 2020 | 
| PubTables-1M | 1M+ | Digital | √ | √ | × | 2021 | 
| FinbTabNet | 91596 | Digital | × | √ | √ | 2021 | 
| WTW | 14581 | Both | × | √ | × | 2021 | 
| SynthTabNet | 600K | Digital | × | √ | √ | 2022 | 
| TabRecSet | 38177 | Both | √ | √ | √ | 2023 | 
| iFLYTAB | 12104 | Both | √ | √ | × | 2023 | 
TD means Table Detection
TSR means Table Structure Recognition
TCD means Table Content Recognition
Both means including both digital and physical data
| Method Type | Method Name | Citation Count | Year | Venue | Open Source | 
|---|---|---|---|---|---|
| Bottom-up | Res2TIM | 37 | 2019 | ICDAR | √ | 
| CascadeTabNet | 182 | 2020 | CVPR | √ | |
| TabStruct-Net | 87 | 2020 | ECCV | ||
| LGPMA | 63 | 2021 | ICDAR | √ | |
| Cycle-CenterNet | 39 | 2021 | ICCV | ||
| TGRNet | 42 | 2021 | ICCV | √ | |
| FLAG-NET | 30 | 2021 | MM | ||
| GTE | 116 | 2021 | WACV | ||
| NCGM | 25 | 2022 | CVPR | ||
| LORE | 7 | 2023 | AAAI | ||
| Image2Markup | EDD | 171 | 2020 | ECCV | √ | 
| TableMaster | 34 | 2021 | ICDAR | √ | |
| TableFormer | 41 | 2022 | CVPR | ||
| VAST | 12 | 2023 | CVPR | ||
| UniTable | 2024 | √ | |||
| OminParser | 2024 | CVPR | √ | ||
| Split-and-Merge Based | SPLERGE | 91 | 2019 | ICDAR | √ | 
| SEM | 40 | 2022 | PR | √ | |
| TSRFormer | 20 | 2022 | MM | ||
| RobusTabNet | 32 | 2023 | PR | ||
| SEMv2 | 3 | 2024 | PR | √ | |
| TSRFormer-DQ-DETR | 2024 | PR | |||
| Others | TableNet | 188 | 2019 | ICDAR | √ | 
| DETR | 60 | 2022 | CVPR | √ | |
| TRACE | 3 | 2023 | ICDAR | √ | 
- OMNIPARSER: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition-Paper,code
 
- SEMv2: Table Separation Line Detection Based on Conditional Convolution-Paper, code
 - Robust table structure recognition with dynamic queries enhanced detection transformer-Paper
 
- Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling-Paper
 
- LORE: Logical Location Regression Network for Table Structure Recognition=Paper
 
- Robust Table Detection and Structure Recognition from Heterogeneous Document Images-Paper
 - Scene table structure recognition with segmentation collaboration and alignment-Paper
 
- TableVLM: Multi-modal Pre-training for Table Structure Recognition-Paper
 
- TRACE: Table Reconstruction Aligned to Corner and Edges-Paper
 - Aligning benchmark datasets for table structure recognition-Paper
 - Optimized Table Tokenization for Table Structure Recognition-Paper
 
- Neural Collaborative Graph Machines for Table Structure Recognition-Paper
 - TableFormer: Table Structure Understanding with Transformers-Paper
 - PubTables-1M: Towards comprehensive table extraction from unstructured documents-Paper, code
 
- Split, Embed and Merge: An accurate table structure recognizer-Paper
 
- Visual Understanding of Complex Table Structures from Document Images-Paper
 
- TSRFormer: Table Structure Recognition with Transformers-Paper
 
- Parsing Table Structures in the Wild-Paper
 - TGRNet: A Table Graph Reconstruction Network for Table Structure Recognition-Paper, code
 
- Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context-Paper
 
- Show, Read and Reason: Table Structure Recognition with Flexible Context Aggregator-Paper
 
- LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment-Paper
 - PINGAN-VCGROUP’S SOLUTION FOR ICDAR 2021 COMPETITION ON SCIENTIFIC LITERATURE PARSING TASK B:TABLE RECOGNITION TO HTML-Paper, code
 - TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables-Paper
 
- CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents-Paper, code
 
- Image-based table recognition: data, model, and evaluation-Paper
 - Table Structure Recognition using Top-Down and Bottom-Up Cues-Paper
 
- TableBank: Table Benchmark for Image-based Table Detection and Recognition-Paper
 
- Challenges in end-to-end neural scientific table recognition-Paper
 - Deep Splitting and Merging for Table Structure Decomposition-Paper
 - DeepTabStR: Deep Learning based Table Structure Recognition-Paper
 - Rethinking Table Recognition using Graph Neural Networks-Paper
 - ReS2TIM: Reconstruct Syntactic Structures from Table Images-Paper
 - TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images-Paper,code,model
 
- UniTable: Towards a Unified Framework for Table Structure Recognition via Self-Supervised Pretraining-Paper
 
- Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations-Paper, code