As you now, the quantity of different kind of file formats is
unlimited so my idea was to archive the files using one or more
annotations for the files (e.g. File -> Plain Text File -> Structured
File -> Column Delimited File -> Ped-in File) and one ore more
annotations for the columns ( Any-Type -> Biological Entity-> Genetic
Marker -> SNP -> RS-Id )
To achieve this I've started to build two ontologies in a new google project:
http://code.google.com/p/fileontology/source/browse/trunk/files/ont/columns.rdf
http://code.google.com/p/fileontology/source/browse/trunk/files/ont/files.rdf
Using this king of ontology it should be possible for a robot to find
all the " Structured File" containing a "SNP".
I'm exploring this solution.
At the same time Frank Gibson (Newcastle University, UK /
http://www.carmen.org.uk/ ) gave me the advice to join the
information-ontologgy group
(http://groups.google.com/group/information-ontology ) because they
might have some good advices about this problem.
Pierre