Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Published April 7, 2024 | Version v2
Software Open

Code and Data for "Historical Insights from Sacrobosco Tables" Project

  • 1. ROR icon Technische Universität Berlin
  • 2. ROR icon Max Planck Institute for the History of Science
  • 3. ROR icon Freie Universität Berlin

Description

This repository contains code and data for the project:

"Insightful analysis of historical sources at scales beyond human capabilities using unsupervised Machine Learning and Explainable AI".

  • sacrobosco_tables.zip: Contains the pre-processed Sacrobosco Tables Data. See folder raw/ for the original scans and processed/ for the binarized pages used as input for the model.

 

  • page_data.zip: Contains three folders of training and evaluation material: patches_annotated/ are the ground annotations of digit patches and  contrast_annotated/ contains non-digit page content. The folder eval_annotated contains pages that have been annotated in full used for evaluation of full pages.

 

  • code.zip: Contains the main code functionalities, including data processing, model training, inference and analysis and visualization code to generate case study analyses presented in our paper. See README.md for details on code execution and setup of the project repository.

 

Notes

This work was partly funded by the German Ministry for Education and Research (under refs 01IS14013A-E, 01GQ1115, 01GQ0850, 01IS18056A, 01IS18025A and 01IS18037A) and BBDC/BZML and BIFOLD.
Furthermore KRM was partly supported by the Institute of Information & Communications Technology Planning & Evaluation (IITP) grants funded by the Korea Government (MSIT) (No. 2019-0-00079, Artificial Intelligence Graduate School Program, Korea University and No. 2022-0-00984, Development of Artificial Intelligence Technology for Personalized Plug-and-Play Explanation and Verification of Explanation). Finally, the Sphere project is also supported by the Max Planck Institute for the History of Science.

Files

code.zip

Files (21.2 GB)

Name Size Download all
md5:0837094c2ae9b54188d56356352c6ffd
209.4 MB Preview Download
md5:78e07a243eaa290bfec3ed4f696ffbd4
5.1 GB Preview Download
md5:25532816efb307d16b07aa23e2b1fb3a
15.9 GB Preview Download

Additional details

Related works

Is derived from
Dataset: 10.5281/zenodo.5767439 (DOI)