Improving search with linked data

19 views
Skip to first unread message

Alex Donnini

unread,
Aug 28, 2013, 12:27:34 PM8/28/13
to lmf-...@googlegroups.com
Hi all, 
This is a person in my dataset:
        <rdf:type rdf:resource="owl:NamedIndividual"/>
        <rdfs:label rdf:datatype="xsd:string">FRANCESCA DRAGOTTO</rdfs:label>
        <vcard:fn rdf:datatype="xsd:string">FRANCESCA DRAGOTTO</vcard:fn>
        <vcard:related rdf:resource="http://othersite/utente_data/getPubblicazioni/4077"/>
        <foaf:familyName rdf:datatype="xsd:string">DRAGOTTO</foaf:familyName>
        <foaf:givenName rdf:datatype="xsd:string">FRANCESCA</foaf:givenName>
        <vcard:family-name rdf:datatype="xsd:string">DRAGOTTO</vcard:family-name>
        <vcard:given-name rdf:datatype="xsd:string">FRANCESCA</vcard:given-name>
        <foaf:gender rdf:datatype="xsd:string">Female</foaf:gender>
        <foaf:mbox rdf:datatype="xsd:anyURI">mailto:xxx</foaf:mbox>
        <vcard:Email rdf:datatype="xsd:anyURI">mailto:xxx</vcard:Email>
        <foaf:phone rdf:datatype="xsd:string">1111</foaf:phone>
        <foaf:depiction>
                <foaf:Image rdf:about="http://othersite/assets/uploads/utenti/4077.jpg" />
        </foaf:depiction>
</foaf:Person>

In vcard:related I have an URI that returns temporary json output and contains person publications information:

My search core is:
@prefix foaf : <http://xmlns.com/foaf/0.1/> ;
@filter rdf:type is foaf:Person ;
  name = foaf:familyName :: xsd:string ;
  mbox = foaf:mbox :: xsd:anyURI ;
  pubblicazioni = vcard:related :: lmf:text_it ;

I would like to index publication data but the solr search result is in the following:

<?xml version="1.0" encoding="UTF-8"?> <response> <lst name="responseHeader"> <int name="status">0</int> <int name="QTime">2</int> <lst name="params"> <str name="indent">true</str> <str name="q">dragotto</str> <str name="_">1377706941217</str> <str name="wt">xml</str> </lst> </lst> <result name="response" numFound="1" start="0"> <doc> <str name="id">62526</str> <date name="lmf.indexed">2013-08-28T16:21:26.426Z</date> <date name="lmf.created">2013-08-25T16:33:15.54Z</date> <str name="lmf.uri">http://mysite/LMF/resource/personale_4077</str> <arr name="lmf.type"> <str>owl:NamedIndividual</str> <str>http://xmlns.com/foaf/0.1/Person</str> </arr> <arr name="name"> <str>DRAGOTTO</str> </arr> <arr name="mbox"> <str>mailto:xxx</str> </arr> <arr name="pubblicazioni"> <str>http://othersite/utente_data/getPubblicazioni/4077</str> </arr></doc> </result> </response>

Sergio Fernández

unread,
Aug 29, 2013, 3:30:43 AM8/29/13
to lmf-...@googlegroups.com, Alex Donnini
Hi Alex,

in you do not have that data locally, LDCache would try to retrieve it,
so then you could use it in other part of the system, like semantic
search. Further details about LDCache at:

http://marmotta.incubator.apache.org/ldcache/introduction.html

But LDCache can't make magic, since
http://othersite/utente_data/getPubblicazioni/4077 returns a 404.

So basically you have three options:

1) provide links that can be retrieved using the linked data principles

2) provide links that can be retrieved using a custom ldcache backend

3) manually import that data

Hope this would help you to understand the issue.

Cheers,
--
Sergio Fern�ndez
Salzburg Research
+43 662 2288 318
Jakob-Haringer Strasse 5/II
A-5020 Salzburg (Austria)
http://www.salzburgresearch.at

Alex Donnini

unread,
Aug 29, 2013, 7:33:54 AM8/29/13
to lmf-...@googlegroups.com, Alex Donnini
Hi Sergio 
othersite is not the true domain :) I would like to avoid to publish data before they are official. I wrote the true domain and I had someone using my site so I changed it.
returns a JSON structure like this:
{
    "files_data": {
        "titolo": "Elenco pubblicazioni",
        "relazione": "pubblicazioni",
        "numero": "35",
        "pubblicazioni": [
            {
                "citazione": "Dragotto, F. (2007). Deonomastica e processi della creazione lessicale [Altro].",
                "data": "2007",
                "id": "PUBBLICAZIONE_26581",
                "nome": "Francesca",
                "cognome": "Dragotto",
                "personale_id": "4077",
                "handle": "2108/43110",
                "metadata1": "Altro",
                "metadata2": "Letteratura",
                "metadata3": "Settore L-LIN/01 - Glottologia e Linguistica",
                "metadata4": null,
                "metadata5": "Dragotto, F",
                "metadata6": null,
                "metadata7": "Deonomastica e processi della creazione lessicale",
                "metadata8": null,
                "metadata9": null,
                "metadata10": null
            },
...
}
Sergio Fern�ndez
Reply all
Reply to author
Forward
0 new messages