"zml" refers to Zetcom's "generic" xml format that comes out of MpRia's API.
- See: http://docs.zetcom.com/ws/
- For LIDO see http://www.lido-schema.org
- main xslt mapping is at zml2lido/data/xsl/zml2lido.xsl
- lxml
- Saxon. We're now using saxonche from pypi. Install with
pip install saxonche
The mapping is partly specific to the RIA installation of the SPK, but might inspire others.
Includes the following scripts
- lido
- lvalidate
- saxon
- -j picks a job e.g. smb or ohneLit
- -i provide the xml source (required)
- -c expect one or multiple zipped cache files as input
- -f force; overwrite most files created during previous runs (except relWorks cache)
lido -j smb -i path/to/generic_xml
In previous versions, the script had to be executed in one particular directory for it to work correctly. Now it can be executed anywhere and the results are always saved in sdata/{label}/{date}.
job 'smb' does the following steps
- convert to LIDO (lvl1)
- filter out lido records that are not published
- filter out relatedWorks tht are not pubished (except Literature)
- validate LIDO
- split
job 'dd' does the following (meant for debugging)
- convert to LIDO (lvl1)
- validate LIDO
todo