importing a big rdf file

18 views
Skip to first unread message

George Lilly

unread,
Dec 3, 2011, 9:15:19 PM12/3/11
to fileman-tr...@googlegroups.com
Have you ever seen a 3 million line rdf file in a fileman word processing field? Well, here's one:


==[ WRAP ]==[ INSERT ]================< TEXT >===============[ <PF1>H=Help ]====
<?xml version="1.0" encoding="UTF-8"?>
         xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
         xmlns:xsd="http://www.w3.org/2001/XMLSchema#"
         xmlns:owl="http://www.w3.org/2002/07/owl#"
         xmlns="http://www.ihtsdo.org/"
         xml:base="http://www.ihtsdo.org/">

    <owl:Ontology rdf:about="">
        <rdfs:comment>
        Generated as OWL RDF/XML from SNOMED CT release files by Perl transform
        Input concepts file was             sct1_Concepts_Core_INT_20100731.txt
        Input stated relationships file was res1_StatedRelationships_Core_INT_20
        </rdfs:comment>
    </owl:Ontology>
<owl:ObjectProperty rdf:about="RoleGroup">
    <rdfs:label xml:lang="en">RoleGroup</rdfs:label>
</owl:ObjectProperty>
<owl:ObjectProperty rdf:about="SCT_116676008">
<=======T=======T=======T=======T=======T=======T=======T=======T=======T>======


I got pretty close to importing it.. MXML parsed the entire thing into its DOM...(pretty fast too) ..but then my code choked on something it shouldn't have (don't know what yet)... here's the log:

GTM>D FILEIN^C0XF2N

File Directory: /home/glilly/fmts/trunk/samples/  Replace fmts/trunk/samples/ Wi
th snomed/  Replace
   /home/glilly/snomed/
File Name: qds.rdf// res_StatedRDFXML_Core_INT_20100731.owl
STARTED: 3111204.013646
READING IN: res_StatedRDFXML_Core_INT_20100731.owl
3674881 LINES READ
ADDED: _:G086040803 _:S918005987 fmts:rdfSource _TXT_INCOMING_RDF_FILE_/home/gli
lly/snomed/res_StatedRDFXML_Core_INT_20100731.owl_381669951
2609883 XML NODES PARSED
PARSE COMPLETE AT 3111204.015206
 ELAPSED TIME: 680 SECONDS
 APPROXIMATELY 3838 NODES PER SECOND
INSERTING GRAPH: /home/glilly/snomed/res_StatedRDFXML_Core_INT_20100731.owl
Error. Not an RDF file. Cannot process.

Reply all
Reply to author
Forward
0 new messages