Perl-module which allows to use Droid/PRONOM signatures and to convert it to Perl regular expressions, analyze files using wxHexEditor tags to display matches and calc statistics. For PRONOM see https://www.nationalarchives.gov.uk/PRONOM/
art1@andreas-romeyke.de 1b0e7d299f - init 2 weeks ago
bin - init 2 weeks ago
lib/File/FormatIdentification - init 2 weeks ago
t - init 2 weeks ago
Changes - init 2 weeks ago
MANIFEST - init 2 weeks ago
Makefile.PL - init 2 weeks ago
README - init 2 weeks ago

README

File-FormatIdentification-Pronom version 0.01
=============================================

The module allows to handle Droid signatures. Droid is a utility which
uses the PRONOM database to identify file formats.

See https://www.nationalarchives.gov.uk/PRONOM/ for details.

With this module you could:

* convert Droid signatures to Perl regular expressions
* analyze files and display which/where pattern of Droid signature matches via tag-files for wxHexEditor
* calc statistics about Droid signatures

The module is in early alpha state and should not be used in production.

INSTALLATION

To install this module type the following:

perl Makefile.PL
make
make test
make install

DEPENDENCIES

This module requires these other modules and libraries:

* XML::LibXML
* Carp
* List::Util
* Scalar::Util
* YAML::XS
* Moose
* String::LCSS
* Regexp::Assemble
* Regexp::Optimizer

The script is only tested under GNU/Linux Debian (Stretch)

CALL

$> perl ./pronom2wxhexeditor.pl DROID-SIGNATURE.xml binaryfile

The first run produces a file "DROID-SIGNATURE.xml.yaml" which holds the PCREs
to avoid parsing DROID-SIGNATURE.xml again and again. This file is human
readable

The run produces a file "binaryfile.tags" which holds the tags used by
wxHexEditor when you open the file "binaryfile".

The run also produces a file "binaryfile.html" which can be viewed in
Webbrowser to get all matches as an overview.

$> perl ./pronom_statistics.pl DROID-SIGNATURE.xml

This prints some useful statistics about the signature file.

BUGS

* Some droid recipes results in PCREs which are greedy and therefore the running
time could be exponential with size of binary file.


COPYRIGHT AND LICENCE

Copyright (C) 2018 by Andreas Romeyke

This library is free software; you can redistribute it and/or modify
it under the same terms as Perl itself, either Perl version 5.24.1 or,
at your option, any later version of Perl 5 you may have available.