This document summarizes progress made on the File Information Tool Set (FITS) for analyzing video files, including:
1) FITS is being enhanced to better support video file characterization by assessing current metadata support, identifying essential video metadata, implementing support for additional metadata, and testing the changes.
2) A hackathon was held to further develop FITS and related tools like C3PO, focusing on risk identification in PDFs, file format identification, and integrating Apache Tika.
3) Funding was awarded to continue integrating Tika into FITS, update dependent tools, improve video metadata support, and address performance issues. Feedback on video metadata needs is requested.
Crowdsourcing Representation Information to Support Preservation: CRISP
Report
Share
1 of 8
More Related Content
Progress with FITS for analyzing video
1. Progress with FITS for analyzing video
Andrea Goethals
Paul Wheatley
SPRUCE Project Manager
University of Leeds
Twitter: @prwheatley
http://openplanetsfoundation.org/blogs/paul
2. Poster on user needs in digital preservation
“Practitioners need better characterisation
tools”, DCC Conference, Amsterdam,
January 2013
See:
http://bit.ly/spruce-results
3. Hackathon: Leeds UK, March 12th 2013
• Hackathon focusing on
characterisation of digital
objects
• Organised by SPRUCE
Project
• Funded by Jisc (UK)
• Included key people behind
JHOVE, JHOVE2, FITS,
DROID, C3PO, FIDO and
more...
4. Hackathon: Leeds UK, March 12th 2013
• Developments focused on 4 key areas:
–Long term risk identification in PDF files
–Consolidating file format identification magic
–Adding support for Apache Tika to FITS
–Adding support for Apache Tika to C3PO
5. FITS: File Information Tool Set
“The File Information Tool Set (FITS) identifies, validates, and extracts
technical metadata for various file formats. It wraps several third-party open
source tools, normalizes and consolidates their output, and reports any
errors. FITS was created by the Harvard University Library Office for
Information Systems for use in its Digital Repository Service (DRS).”
• JHOVE
• Exiftool
• NLNZ metadata extractor
• DROID
• FFIdent
• File
• Apache Tika
Combines/normalises the output
6. C3PO: FITS data output visualization tool
http://ifs.tuwien.ac.at/imp/c3po
7. SPRUCE funding awards
Funding awards of £5000 for C3PO and for FITS
FITS work is already half way through...
• Fully incorporate Tika into FITS
• Bring JHOVE and DROID tools up to latest versions
• Enhance video metadata model
• Improve documentation
• Identify and fix performance issues
8. Enhance video metadata model
Current support for video metadata in FITS is quite poor
Development effort available to:
• Assess current situation and collate user feedback
• Identify core metadata essential to support
• Implement, adding additional tools to FITS if necessary
• Test and evaluate
Please contribute feedback to:
http://fileformats.wordpress.com/2013/04/11/video-md/
Please try out the finished tool in ~3 weeks time and give us some feedback!
https://github.com/gmcgath/fits-mcgath