Does anyone know of a tool for detecting file formats for common biology files, e.g. FASTA, FASTQ, GenBank, SBOL, AB1, etc.
The *nix file command / libmagic does a terrible job of this.
I'm also looking for a library of samples that showcase the diversity of formats and *ahem* variants of those formats for the purpose of ensuring that parsers don't fail on edge cases.
--