Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions docling/backend/msexcel_backend.py
Original file line number Diff line number Diff line change
Expand Up @@ -256,8 +256,8 @@ def _convert_workbook(self, doc: DoclingDocument) -> DoclingDocument:

self.parents[0] = doc.add_group(
parent=None,
label=GroupLabel.SECTION,
name=f"sheet: {name}",
label=GroupLabel.SHEET,
name=name,
content_layer=self._get_sheet_content_layer(sheet),
)
doc = self._convert_sheet(doc, sheet, page_no)
Expand Down
24 changes: 12 additions & 12 deletions tests/data/groundtruth/docling_v2/1706.03762_main.tex.md
Original file line number Diff line number Diff line change
Expand Up @@ -336,26 +336,26 @@ Variations on the Transformer architecture. Unlisted values are identical to tho
| 0pt2.0ex | 2*$N$ | 2*$d_{\text{model}}$ | 2*$d_{\text{ff}}$ | 2*$h$ | 2*$d_k$ | 2*$d_v$ | 2*$P_{drop}$ | 2*$\epsilon_{ls}$ | train | PPL | BLEU | params |
|----------------|---------|---------------------------------------------|---------------------|---------|-----------|-----------|----------------|---------------------|---------|-------|--------|--------------|
| | | | | | | | | | steps | (dev) | (dev) | $\times10^6$ |
| 0pt2.0ex base | 6 | 512 | 2048 | 8 | 64 | 64 | 0.1 | 0.1 | 100K | 4.92 | 25.8 | 65 |
| 0pt2.0ex 4*(A) | | | | 1 | 512 | 512 | | | | 5.29 | 24.9 | |
| | | | | 4 | 128 | 128 | | | | 5.00 | 25.5 | |
| | | | | 16 | 32 | 32 | | | | 4.91 | 25.8 | |
| | | | | 32 | 16 | 16 | | | | 5.01 | 25.4 | |
| 0pt2.0ex 2*(B) | | | | | 16 | | | | | 5.16 | 25.1 | 58 |
| | | | | | 32 | | | | | 5.01 | 25.4 | 60 |
| 0pt2.0ex base | 6 | 512 | 2048 | 8 | 64 | 64 | 0.1 | 0.1 | 100K | 4.92 | 25.8 | 65 |
| 0pt2.0ex 4*(A) | | | | 1 | 512 | 512 | | | | 5.29 | 24.9 | |
| | | | | 4 | 128 | 128 | | | | 5.00 | 25.5 | |
| | | | | 16 | 32 | 32 | | | | 4.91 | 25.8 | |
| | | | | 32 | 16 | 16 | | | | 5.01 | 25.4 | |
| 0pt2.0ex 2*(B) | | | | | 16 | | | | | 5.16 | 25.1 | 58 |
| | | | | | 32 | | | | | 5.01 | 25.4 | 60 |
| 0pt2.0ex 7*(C) | 2 | | | | | | | | | 6.11 | 23.7 | 36 |
| | 4 | | | | | | | | | 5.19 | 25.3 | 50 |
| | 8 | | | | | | | | | 4.88 | 25.5 | 80 |
| | | 256 | | | 32 | 32 | | | | 5.75 | 24.5 | 28 |
| | | 1024 | | | 128 | 128 | | | | 4.66 | 26.0 | 168 |
| | | 256 | | | 32 | 32 | | | | 5.75 | 24.5 | 28 |
| | | 1024 | | | 128 | 128 | | | | 4.66 | 26.0 | 168 |
| | | | 1024 | | | | | | | 5.12 | 25.4 | 53 |
| | | | 4096 | | | | | | | 4.75 | 26.2 | 90 |
| 0pt2.0ex 4*(D) | | | | | | | 0 | | | 5.77 | 24.6 | |
| 0pt2.0ex 4*(D) | | | | | | | 0.0 | | | 5.77 | 24.6 | |
| | | | | | | | 0.2 | | | 4.95 | 25.5 | |
| | | | | | | | | 0 | | 4.67 | 25.3 | |
| | | | | | | | | 0.0 | | 4.67 | 25.3 | |
| | | | | | | | | 0.2 | | 5.47 | 25.7 | |
| 0pt2.0ex (E) | | 7cpositional embedding instead of sinusoids | | 4.92 | 25.7 | | | | | | | |
| 0pt2.0ex big | 6 | 1024 | 4096 | 16 | | | 0.3 | | 300K | 4.33 | 26.4 | 213 |
| 0pt2.0ex big | 6 | 1024 | 4096 | 16 | | | 0.3 | | 300K | 4.33 | 26.4 | 213 |
| | | | | | | | | | | | | |

To evaluate the importance of different components of the Transformer, we varied our base model in different ways, measuring the change in performance on English-to-German translation on the development set, newstest2013. We used beam search as described in the previous section, but no checkpoint averaging. We present these results in Table
Expand Down
16 changes: 8 additions & 8 deletions tests/data/groundtruth/docling_v2/2203.01017v2.md
Original file line number Diff line number Diff line change
Expand Up @@ -184,16 +184,16 @@ Structure. As shown in Tab. 2, TableFormer outperforms all SOTA methods across d

| Model | Dataset | Simple | TEDS Complex | All |
|-------------|-----------|----------|----------------|-------|
| EDD | PTN | 91.1 | 88.7 | 89.9 |
| EDD | PTN | 91.1 | 88.7 | 89.9 |
| GTE | PTN | - | - | 93.01 |
| TableFormer | PTN | 98.5 | 95.0 | 96.75 |
| EDD | FTN | 88.4 | 92.08 | 90.6 |
| EDD | FTN | 88.4 | 92.08 | 90.6 |
| GTE | FTN | - | - | 87.14 |
| GTE (FT) | FTN | - | - | 91.02 |
| TableFormer | FTN | 97.5 | 96.0 | 96.8 |
| EDD | TB | 86.0 | - | 86 |
| TableFormer | TB | 89.6 | - | 89.6 |
| TableFormer | STN | 96.9 | 95.7 | 96.7 |
| TableFormer | FTN | 97.5 | 96.0 | 96.8 |
| EDD | TB | 86.0 | - | 86.0 |
| TableFormer | TB | 89.6 | - | 89.6 |
| TableFormer | STN | 96.9 | 95.7 | 96.7 |

Table 2: Structure results on PubTabNet (PTN), FinTabNet (FTN), TableBank (TB) and SynthTabNet (STN).

Expand All @@ -215,9 +215,9 @@ Table 4: Results of structure with content retrieved using cell detection on Pub

| Model | Simple | TEDS Complex | All |
|-------------|----------|----------------|-------|
| Tabula | 78 | 57.8 | 67.9 |
| Tabula | 78.0 | 57.8 | 67.9 |
| Traprange | 60.8 | 49.9 | 55.4 |
| Camelot | 80 | 66 | 73 |
| Camelot | 80.0 | 66.0 | 73.0 |
| Acrobat Pro | 68.9 | 61.8 | 65.3 |
| EDD | 91.2 | 85.4 | 88.3 |
| TableFormer | 95.4 | 90.1 | 93.6 |
Expand Down
6 changes: 3 additions & 3 deletions tests/data/groundtruth/docling_v2/2305.03393_main.tex.md
Original file line number Diff line number Diff line change
Expand Up @@ -169,14 +169,14 @@ HPO performed in OTSL and HTML representation on the same transformer-based Tabl
| 2*c]@c@# enc-layers | 2*c]@c@# dec-layers | 2*Language | 3c|TEDs | 2*mAP (0.75) | 2*Inference time (secs) | | |
|------------------------|------------------------|--------------|----------------|------------------|-----------------------------|-------|------|
| 4-6 | | | 1c|simple | 1c|complex | all | | |
| 2*6 | 2*6 | OTSL | 0.965 | 0.934 | 0.955 | 0.88 | 2.73 |
| 2*6 | 2*6 | OTSL | 0.965 | 0.934 | 0.955 | 0.88 | 2.73 |
| | | HTML | 0.969 | 0.927 | 0.955 | 0.857 | 5.39 |
| 2*4 | 2*4 | OTSL | 0.938 | 0.904 | 0.927 | 0.853 | 1.97 |
| | | HTML | 0.952 | 0.909 | 0.938 | 0.843 | 3.77 |
| 2*2 | 2*4 | OTSL | 0.923 | 0.897 | 0.915 | 0.859 | 1.91 |
| | | HTML | 0.945 | 0.901 | 0.931 | 0.834 | 3.81 |
| 2*4 | 2*2 | OTSL | 0.952 | 0.92 | 0.942 | 0.857 | 1.22 |
| | | HTML | 0.944 | 0.903 | 0.931 | 0.824 | 2 |
| | | HTML | 0.944 | 0.903 | 0.931 | 0.824 | 2 |
| | | | | | | | |

### Quantitative Results
Expand All @@ -190,7 +190,7 @@ TSR and cell detection results compared between OTSL and HTML on the PubTabNet[P
| 2*Data set | 2*Language | 3c|TEDs | 2*mAP(0.75) | 2*Inference time (secs) | | |
|----------------|--------------|----------------|-----------------|-----------------------------|-------|------|
| 3-5 | | 1c|simple | 1c|complex | all | | |
| 2*PubTabNet | OTSL | 0.965 | 0.934 | 0.955 | 0.88 | 2.73 |
| 2*PubTabNet | OTSL | 0.965 | 0.934 | 0.955 | 0.88 | 2.73 |
| | HTML | 0.969 | 0.927 | 0.955 | 0.857 | 5.39 |
| 2*FinTabNet | OTSL | 0.955 | 0.961 | 0.959 | 0.862 | 1.85 |
| | HTML | 0.917 | 0.922 | 0.92 | 0.722 | 3.26 |
Expand Down
14 changes: 7 additions & 7 deletions tests/data/groundtruth/docling_v2/2412.19437_main.tex.md
Original file line number Diff line number Diff line change
Expand Up @@ -822,16 +822,16 @@ In addition, we perform language-modeling-based evaluation for Pile-test and use
| | TriviaQA (EM) | 5-shot | 80.0 | 71.9 | 82.7 | 82.9 | | |
| | NaturalQuestions (EM) | 5-shot | 38.6 | 33.2 | 41.5 | 40.0 | | |
| | AGIEval (EM) | 0-shot | 57.5 | 75.8 | 60.6 | 79.6 | | |
| 75.4 | 4*Code | HumanEval (Pass@1) | 0-shot | 43.3 | 53.0 | 54.9 | 65.2 | |
| 75.4 | 4*Code | HumanEval (Pass@1) | 0-shot | 43.3 | 53.0 | 54.9 | 65.2 | |
| | MBPP (Pass@1) | 3-shot | 65.0 | 72.6 | 68.4 | 75.4 | | |
| | LiveCodeBench-Base (Pass@1) | 3-shot | 11.6 | 12.9 | 15.5 | 19.4 | | |
| | CRUXEval-I (EM) | 2-shot | 52.5 | 59.1 | 58.5 | 67.3 | | |
| | CRUXEval-O (EM) | 2-shot | 49.8 | 59.9 | 59.9 | 69.8 | | |
| (EM) | 3*Math | GSM8K (EM) | 8-shot | 81.6 | 88.3 | 83.5 | 89.3 | |
| (EM) | 3*Math | GSM8K (EM) | 8-shot | 81.6 | 88.3 | 83.5 | 89.3 | |
| | MATH (EM) | 4-shot | 43.4 | 54.4 | 49.0 | 61.6 | | |
| | MGSM (EM) | 8-shot | 63.6 | 76.2 | 69.9 | 79.8 | | |
| | CMath (EM) | 3-shot | 78.7 | 84.5 | 77.3 | 90.7 | | |
| 90.1 | 7*Chinese | CLUEWSC (EM) | 5-shot | 82.0 | 82.5 | 83.0 | 82.7 | |
| 90.1 | 7*Chinese | CLUEWSC (EM) | 5-shot | 82.0 | 82.5 | 83.0 | 82.7 | |
| | C-Eval (EM) | 5-shot | 81.4 | 89.2 | 72.5 | 90.1 | | |
| | CMMLU (EM) | 5-shot | 84.0 | 89.5 | 73.7 | 88.8 | | |
| | CMRC (EM) | 1-shot | 77.4 | 75.8 | 76.0 | 76.3 | | |
Expand Down Expand Up @@ -1169,8 +1169,8 @@ DeepSeek-V3and Qwen2.5-72B exhibit similar performance levels, indicating that b
| Qwen2.5-72B-Instruct | 81.2 | 49.1 |
| LLaMA-3.1 405B | 69.3 | 40.5 |
| GPT-4o-0513 | 80.4 | 51.1 |
| Claude-Sonnet-3.5-1022 | 85.2 | 52 |
| DeepSeek-V3 | 85.5 | 70 |
| Claude-Sonnet-3.5-1022 | 85.2 | 52.0 |
| DeepSeek-V3 | 85.5 | 70.0 |
| | | |

English open-ended conversation evaluations.
Expand Down Expand Up @@ -1208,9 +1208,9 @@ Therefore, we employ DeepSeek-V3along with voting to offer self-feedback on open
| GPT-4o-0513 | 96.6 | 70.4 | 86.7 | 84.9 | 84.7 |
| GPT-4o-0806 | 96.1 | 76.1 | 88.1 | 86.6 | 86.7 |
| GPT-4o-1120 | 95.8 | 71.3 | 86.2 | 85.2 | 84.6 |
| Claude-3.5-sonnet-0620 | 96.4 | 74 | 81.6 | 84.7 | 84.2 |
| Claude-3.5-sonnet-0620 | 96.4 | 74.0 | 81.6 | 84.7 | 84.2 |
| Claude-3.5-sonnet-1022 | 96.4 | 79.7 | 91.1 | 87.6 | 88.7 |
| DeepSeek-V3 | 96.9 | 79.8 | 87 | 84.3 | 87 |
| DeepSeek-V3 | 96.9 | 79.8 | 87.0 | 84.3 | 87.0 |
| DeepSeek-V3 (maj@6) | 96.9 | 82.6 | 89.5 | 89.2 | 89.6 |
| | | | | | |

Expand Down
2 changes: 1 addition & 1 deletion tests/data/groundtruth/docling_v2/escaped_characters.md.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,4 +41,4 @@ The pipe symbol (| or `|` ) only needs to be escaped in tables.

## Link

[& < > " '](https://en.wikipedia.org/wiki/Albert_Einstein)
[& < > " '](https://en.wikipedia.org/wiki/Albert_Einstein)
Loading
Loading