File format detection and parser validation

19 views
Skip to first unread message

Marc Juul

unread,
Mar 22, 2017, 7:27:28 PM3/22/17
to DIYbio
Does anyone know of a tool for detecting file formats for common biology files, e.g. FASTA, FASTQ, GenBank, SBOL, AB1, etc.

The *nix file command / libmagic does a terrible job of this.

I'm also looking for a library of samples that showcase the diversity of formats and *ahem* variants of those formats for the purpose of ensuring that parsers don't fail on edge cases.

--
marc/juul
Reply all
Reply to author
Forward
0 new messages