> Can you give some examples of what you mean? Are their metadata
> fields that are not represented in the API ATOM feeds, or is the ATOM
> just not in the right format? Please give as much detail as possible
> so we can help think of a solution.
... the citation info looks to come back as something like:
<arxiv:journal_ref xmlns:arxiv="
http://arxiv.org/schemas/
atom">Eur.Phys.J. C31 (2003) 17-29</arxiv:journal_ref>
It would be good to get the year, journal, pages etc in separate
fields to
save parsing the string... is this info stored separately within
arxiv?
Or would a simple parse of the string to get the components work
reliably? If not, it would be useful to get separate fields like:
<year>2003</year>
<journal>Eur.Phys.J.</journal>
<pages>17-29</pages>
Thanks,
Fred.
PS I've used a JSON encoding with bibtex field names, embedded in the
<content>
of Atom entries before (to get lossless exchange of bibtex
metadata) ... most
tools already have some bibtex import / export support, so there's
some
advantage in using its schema.
I appended a sample of JSON/Bibtex embedded within
Atom - it's a bit unusual to include json within xml, but you sidestep
all the
clunky XML namespaces by using json, and it's easy to process.
Not sure if JSON output could be an alternative for your arxiv api...
<feed>
<entry>
<content>
{
"type":"article",
"title":"...",
"year":"2006",
"author":"...",
"journal":"Concurrency Computat.: Pract. Exper.",
"volume":"19",
"number":"",
"pages":"207-221",
"month":"",
"doi":"10.1002\/cpe.1044",
"pdflink":"",
"urllink":"http:\/\/
www3.interscience.wiley.com\/...",
"abstract":"The integrative ambitions of systems biology - (...)",
"note":"",
"keywords":"XML, Semantic Web, e-Science"
}
</content>
</entry>
</feed>