Hi Matús,
thanks a lot for your work here and the poster you presented at the ISMB in Dublin this year. I really like the route where bioXSD is going!
Lately, I was thinking in the same direction and with great input from Mike Amudsen and the ALPS community[1] we created bioALPS. Obviously, my literature research wasn't done very well, otherwise I would have stumbled on bioXSD much earlier. I'm not sure if you're aware of ALPS, but in a nutshell it exploits the profile link relation from RFC6906 by linking a document to a profile, which in turn can constrain content or advertise conventions, just like an XML namespace document. Besides the semantics of the data, ALPS can also define "transitions in an intuitive structure with human-readable descriptions"[2]. With this, its possible to additionally describe actions or transitions that make sense in scope of the data (e.g. how to issue a blast search with a sequence). The section "A Short Hike into the ALPS" of [2] should give a sufficient overview to grasp the essentials.
For me, ALPS was the way to go to semantically describe the interplay between sequences, high-throughput experiments, samples, genome assemblies, sequence features and experimental contrasts. What I like very much about ALPS is that it extends the contraints/conventions not only to the data format but also on the interaction with the underlying web service that serves the data. This can, in my view, lead to benefits in database integration and "smoother" workflows, as you put it in Figure 4 ;)
Please feel free to comment and let me know what you think!
Best,
Jens
[1]
https://groups.google.com/forum/#!forum/alps-io[2]
http://www.infoq.com/articles/programming-semantic-profiles [3]
https://github.com/jenzopr/bioalps/tree/master/examples