Error with latest Reactome and Netpath biopax files

6 views
Skip to first unread message

Ruth

unread,
Jul 3, 2009, 3:10:55 PM7/3/09
to cpath-dev, gro...@cbio.mskcc.org
Hi,
I have been re-building my instance of cpath and the reactome and
Netpath biopax files keep coming up with the following error:

Reading in file: Mus musculus.owl
XML Type: BIO_PAX
Reading in meta-data from: db.info
Data source is: Reactome
Reading in file content...
Processing file...
Validating BioPAX File with Paxtools...
rethrew: org.xml.sax.SAXParseException: Invalid byte 2 of 3-byte UTF-8
sequence.

Is there any way of fixing the files?

It works fine for Cell map, Biocarta and NCI biopax files.

Thanks,
Ruth

Gary Bader

unread,
Jul 3, 2009, 3:19:54 PM7/3/09
to cpat...@googlegroups.com, gro...@cbio.mskcc.org
Hi Ruth - what do the top of the files look like in a text editor? Is
there an issue with them?

Gary
--
http://baderlab.org
Terrence Donnelly Centre for Cellular and Biomolecular Research
University of Toronto

Ruth Isserlin

unread,
Jul 3, 2009, 3:31:08 PM7/3/09
to cpat...@googlegroups.com, gro...@cbio.mskcc.org
Hi Gary,

The tops of the files don't look different to me. Reactome has a few more
things defined, not sure if that breaks them.

For Cellmap which works it looks like -
<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:bp="http:
//www.biopax.org/release/biopax-level2.owl#"
xmlns:owl="http://www.w3.org/2002/0
7/owl#" xmlns="http://cbio.mskcc.org/cpath#"
xml:base="http://cbio.mskcc.org/cpa
th">
<owl:Ontology rdf:about="">
<owl:imports
rdf:resource="http://www.biopax.org/release/biopax-level2.owl"
/>
</owl:Ontology>

And for Reactome it looks like -
<?xml version="1.0" encoding="UTF-8"?>
<rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
xmlns:bp="http:
//www.biopax.org/release/biopax-level2.owl#"
xmlns:rdfs="http://www.w3.org/2000/
01/rdf-schema#" xmlns:owl="http://www.w3.org/2002/07/owl#"
xmlns:xsd="http://www
.w3.org/2001/XMLSchema#" xmlns="http://www.reactome.org/biopax#"
xml:base="http:
//www.reactome.org/biopax">
<owl:Ontology rdf:about="">
<owl:imports
rdf:resource="http://www.biopax.org/release/biopax-level2.owl"
/>
</owl:Ontology>

Thanks,
Ruth

Gary Bader

unread,
Jul 3, 2009, 3:46:05 PM7/3/09
to cpat...@googlegroups.com, gro...@cbio.mskcc.org
I think you have to use a hex editor to see the if the actual bytes are
different.

Gary

Ethan Cerami

unread,
Jul 3, 2009, 3:49:07 PM7/3/09
to cpat...@googlegroups.com, gro...@cbio.mskcc.org
I think that Ben uses a character conversion utility to fix this issue, but he is on holiday today (it's the fourth of july vacation here today).

rex....@syngenta.com

unread,
Jul 8, 2009, 2:32:25 PM7/8/09
to cpat...@googlegroups.com, gro...@cbio.mskcc.org

I am having similar problems trying to read reactome files in BP level 3 programmatically with paxtools.

I don’t understand – is this a bug in paxtools, or do the files need to be rebuilt with the right character set or something?

Does anyone know of level 3 files in the correct format?

Rex


This message may contain confidential information. If you are not the designated recipient, please notify the sender immediately, and delete the original and any copies. Any use of the message by you is prohibited.

Reply all
Reply to author
Forward
0 new messages