Perform various operations on ALTO xml files
Project description
ALTO Tools
:snake: tools for performing various operations on ALTO XML files
Installation
Clone the repository, enter it and run
pip install .
Usage
alto-tools <INPUT> [OPTION]
INPUT
should be the path to an ALTO file or directory containing ALTO files.
Output is sent to stdout
.
OPTION | Description |
---|---|
-t --text |
Extract UTF-8 encoded text content |
-c --confidence |
Extract mean OCR word confidence score |
-i --illustrations |
Extract bounding box coordinates of <Illustration> elements |
-g --graphics |
Extract bounding box coordinates of <GraphicalElement> elements |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
alto-tools-0.1.0.tar.gz
(9.5 kB
view details)
Built Distribution
File details
Details for the file alto-tools-0.1.0.tar.gz
.
File metadata
- Download URL: alto-tools-0.1.0.tar.gz
- Upload date:
- Size: 9.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f95c288388015835c38e2d86afb31491be9adae9b60ea2fc0f3927ffaa8cfef7 |
|
MD5 | 3979b7f66432b28401610be1037f86cd |
|
BLAKE2b-256 | ac70135a76f2d514242093f32972ecfcee8fa5fe14618f6d69302ef68d717363 |
File details
Details for the file alto_tools-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: alto_tools-0.1.0-py3-none-any.whl
- Upload date:
- Size: 9.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9015c74d0bd089da52ae6d8d69903c2b562061ef21803b1f39cec1587a468691 |
|
MD5 | 3a460e80393f33bc46b47cb3e3cd3f90 |
|
BLAKE2b-256 | 50a8397269efadf94ece214951c2678b06afd290fad091f5cf48c0cff5c92c13 |