Skip to content

Releases: docling-project/docling-eval

v1.2.0

24 Apr 12:52

Choose a tag to compare

Feature

  • CVAT submission delivery improvements (#211) (356c8df)

v1.1.1

14 Apr 15:25

Choose a tag to compare

Fix

v1.1.0

13 Apr 16:21

Choose a tag to compare

Feature

  • Dev/add datasetrecord (#207) (412c43a)
  • Flat-layout CVAT campaign tools and resilient shard writing (#206) (5c9f3fa)
  • New dataset builder - DoclingSDGDatasetBuilder (#205) (e761bcc)

Fix

  • PIL Image Memory Leaks in Dataset Builders (#194) (55fd3eb)

v1.0.1

11 Mar 16:38

Choose a tag to compare

Fix

  • Remove hard pinning of docling-parse (#203) (901814d)

v1.0.0

11 Mar 15:42

Choose a tag to compare

Feature

  • Parallelize the evaluation of tables and cache the loading of external predictions (#190) (9d04a56)
  • Regression tests for CVAT to Docling conversion (#193) (8a10188)
  • CVAT box rotation support, structural cleanup (#191) (db068e9)
  • Improvements in user experience: Performance, error handling, logging (#189) (a850784)
  • Visualizer tool and command for datasets (#186) (373f959)
  • Extend the evaluators to support external predictions stored in files (#185) (53dbd95)
  • Convert Docling JSON inputs to image streams in FileDatasetBuilder (#184) (15888fd)
  • Allow subset to split routing in CVAT to HF exporter (#182) (ebb8800)
  • Ingest CVAT assets and filter submissions (#180) (b55b2ea)
  • Runtime optimizations for MultiLabelConfusionMatrix (#175) (5084a4d)
  • Add more fine-grained control in the DoclingEvalCOCOExporter (#149) (8f33420)
  • Remove legacy CvatDatasetBuilder code, use modernized code (#174) (693c224)
  • Introduce the PixelLayoutEvaluator to produce confusion matrices for the multi-label layout analysis (#173) (a79bac5)
  • Review-bundle builder, fixes for GraphCell with merged elements and more (#172) (21341ce)

Fix

  • Correct import path for TableStructureModel (#199) (a7e74a3)
  • Fix the reporting of doc_id, true_md, pred_md in markdown_text_evaluator.py (#196) (3ce7591)
  • PixelLayoutEvaluator: Set all-pixels background in case of a missing prediction and evaluate (#183) (4314091)
  • Fix empty prediction handling in markdown evaluator (#177) (9b6df83)
  • Consistenty and perf improvements (#171) (8fb3a16)

Breaking

  • CvatDatasetBuilder now requires modern CVAT folder structure and uses convert_cvat_folder_to_docling() internally. (693c224)

v0.10.0

05 Nov 18:26

Choose a tag to compare

Feature

  • Extend the CLI for create-eval to receive the vlm-options and max_new_tokens parameters when the provider is GraniteDocling (#164) (8be2e83)
  • Harmonizing pic classes for cvat to docling conversion (#167) (740157d)
  • Add more specific validation for reading-order, enhance validation report (5e5f2db)
  • Integrate textline_cells based OCR evaluation (#156) (3a9543c)

Fix

  • Validation fixes for list item impurity check (#169) (74e7b3e)
  • Don't report content-layer group violation multiple times (cb71009)
  • Handle merged elements regarding inclusion, don't flag single element pages (c10fdfd)
  • Missing transform to storage_scale for some items and table cells (1eb6b4e)
  • More CVAT validation and docling conversion fixes (#163) (6f59c7a)
  • Better control over scaling in CVAT transform, fixes for OCR (#162) (ef17b5a)
  • Fixes for CVAT validation, OCR in CVAT pipeline, logging, and more (#161) (80e449d)

Performance

v0.9.0

01 Oct 03:42

Choose a tag to compare

Feature

  • Exposed forced-ocr-option (#157) (ac21644)
  • Implementation of table structure conversion from CVAT to DoclingDocument (208cd14)

v0.8.1

16 Sep 08:23

Choose a tag to compare

Fix

  • Ocr visualization and add ocr recognition metrics (#144) (d63a439)

v0.8.0

02 Sep 21:18
758f6dc

Choose a tag to compare

What's Changed

  • feat: Extend the Consolidator to export Latex files alongside the excel report by @nikos-livathinos in #143
  • feat: Extend the DoclingEvalCOCOExporter to export a parquet dataset in COCO format by @nikos-livathinos in #145
  • feat: Several fixes and campaign tools extensions by @cau-git in #150
  • feat: Add Table structure evaluations for TEDS by @praveenmidde in #94

Full Changelog: v0.7.0...v0.8.0

v0.7.0

30 Jul 08:06

Choose a tag to compare

Feature

Fix

  • Prevent crash from invalid bbox coordinates in HTML export (#142) (c31b107)