Docling-eval

Evaluate Docling on various datasets.

Features

Evaluate docling on various datasets. You can use the cli

docling-eval % poetry run evaluate --help

 Usage: evaluate [OPTIONS]

╭─ Options ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ *  --task            -t      [create|evaluate|visualize]                                 Evaluation task [default: None] [required]                 │
│ *  --modality        -m      [end-to-end|layout|table_structure|code_transcription|math  Evaluation modality [default: None] [required]             │
│                              _transcription|reading_order|markdown_text|captioning|bbox                                                             │
│                              es_text]                                                                                                               │
│ *  --benchmark       -b      [DPBench|OmniDocBench|WordScape|PubLayNet|DocLayNetV1|DocL  Benchmark name [default: None] [required]                  │
│                              ayNetV2|FUNSD|Pub1M|PubTabNet|FinTabNet|WikiTabNet]                                                                    │
│ *  --output-dir      -o      PATH                                                        Output directory [default: None] [required]                │
│    --input-dir       -i      PATH                                                        Input directory [default: None]                            │
│    --converter_type  -c      [Docling|SmolDocling]                                       Type of document converter [default: Docling]              │
│    --split           -s      TEXT                                                        Dataset split [default: test]                              │
│    --artifacts-path  -a      PATH                                                        Load artifacts from local path [default: None]             │
│    --max-items       -n      INTEGER                                                     How many items to load from the original dataset           │
│                                                                                          [default: 1000]                                            │
│    --help                                                                                Show this message and exit.                                │
╰─────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯

Benchmarks

DP-Bench benchmarks: Text, layout, reading order and table structure evaluation on the DP-Bench dataset.
OmniDocBench benchmarks: Text, layout, reading order and table structure evaluation on the OmniDocBench dataset.
DocLayNetV1 Benchmarks: Text and layout evaluation on the DocLayNet v1.2 dataset.
FinTabnet Benchmarks: Table structure evaluation on the FinTabNet dataset.
PubTabNet benchmarks: Table structure evaluation on the PubTabNet dataset.
Pub1M benchmarks: Table structure evaluation on the Pub1M dataset.

Contributing

Please read Contributing to Docling for details.

License

The Docling codebase is under MIT license. For individual model usage, please refer to the model licenses found in the original packages.

IBM ❤️ Open Source AI

Docling-eval has been brought to you by IBM.

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.github		.github
docling_eval		docling_eval
docs		docs
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Docling-eval

Features

Benchmarks

Contributing

License

IBM ❤️ Open Source AI

About

Releases

Packages

Contributors 6

Languages

License

DS4SD/docling-eval

Folders and files

Latest commit

History

Repository files navigation

Docling-eval

Features

Benchmarks

Contributing

License

IBM ❤️ Open Source AI

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages