Protein and accession

49 views
Skip to first unread message

Darèios89

unread,
Apr 18, 2013, 4:42:53 AM4/18/13
to bio4j...@googlegroups.com
If i have to search proteins by their synonyms, i think i have to use a code line such as:

List <ProteinNode> proteins = nodeRetriever.getProteinsByFullName(protein);

"protein" is a string with its name. I downloaded the folder uniref50.xml and i put as arg[0] it on Eclipse, but i have exceptions and i don't know why. They are parsing errors but what i have to do if the line code is right? Why there are these exceptions? Maybe it depends on the fact i don't use accessions? I' m desperate...

graphservice hashcode: -1836727
java.lang.RuntimeException: org.apache.lucene.queryParser.ParseException: Cannot parse 'Glycine dehydrogenase [decarboxylating], mitochondrial': Encountered " "]" "] "" at line 1, column 38.
Was expecting one of:
    "TO" ...
    <RANGEIN_QUOTED> ...
    <RANGEIN_GOOP> ...
    
at org.neo4j.index.impl.lucene.IndexType.query(IndexType.java:290)
at org.neo4j.index.impl.lucene.LuceneIndex.query(LuceneIndex.java:226)
at com.era7.bioinfo.bio4j.model.util.NodeRetriever.getProteinsByFullName(NodeRetriever.java:156)
at src.main.java.com.era7.bioinfo.bio4j.drug.Prova.main(Prova.java:136)
.
.
.
.
.



Darèios89

unread,
Apr 18, 2013, 4:54:20 AM4/18/13
to bio4j...@googlegroups.com
I forgot to say i' m on Windows 8.

Pablo Pareja Tobes

unread,
Apr 23, 2013, 4:59:41 AM4/23/13
to bio4j...@googlegroups.com
Hi Dareios,

You should first have a deeper look at the documentation and the project in general because it looks like you're trying to use the DB without even previously importing it...! :)
Please have a look here:
in order to see a step-by-step guide on how to import Bio4j into your own machine.

Cheers,

Pablo


--
Has recibido este mensaje porque estás suscrito al grupo "bio4j-user" de Grupos de Google.
Para anular la suscripción a este grupo y dejar de recibir sus correos electrónicos, envía un correo electrónico a bio4j-user+...@googlegroups.com.
Para obtener más opciones, visita https://groups.google.com/groups/opt_out.
 
 



--
Pablo Pareja Tobes

Darèios89

unread,
Apr 23, 2013, 7:42:56 AM4/23/13
to bio4j...@googlegroups.com
I Pablo! I have read the instructions, but i don't understand what the last instruction does, also because i' m on windows and it refers to linux. I have downloaded 
uniprot_sprot.xml and put it as folder of the DB with the code:

String folder="C:\\Users\\Dario\\workspace\\Dbio4j\\src\\src\\main\\java\\com\\era7\\bioinfo\\bio4j\\drug\\uniprot_sprot.xml";

manager = new Bio4jManager(folder);
NodeRetriever nodeRetriever = new NodeRetriever(manager);

Now i think it' s right because i see other files into the uniprot folder....such as...index....neostore....

Pablo Pareja Tobes

unread,
Apr 24, 2013, 2:49:49 AM4/24/13
to bio4j...@googlegroups.com
Hi Dareios,

The importing process for the whole DB takes at least ~ 20 hours in a personal computer. 
Besides, the DB folder should have a size of ~ 100GB if you imported all modules.

Cheers,

Pablo

Darèios89

unread,
Apr 24, 2013, 6:59:44 AM4/24/13
to bio4j...@googlegroups.com
Yes Pablo. ..but i need to do the last passage of importing bio4j? The 8th one. Because i downloaded part of the DB...and i want to know if putting as manager parameter the DB folder let me to search on the DB.

Pablo Pareja Tobes

unread,
Apr 25, 2013, 4:41:27 AM4/25/13
to bio4j...@googlegroups.com
Hi Dareios,

You have to follow all the steps specified here:
If you didn't execute step 9 you didn't import anything at all into the DB.

Cheers,

Pablo


On Wed, Apr 24, 2013 at 12:59 PM, Darèios89 <dariog...@gmail.com> wrote:
Yes Pablo. ..but i need to do the last passage of importing bio4j? The 8th one. Because i downloaded part of the DB...and i want to know if putting as manager parameter the DB folder let me to search on the DB.
--
Has recibido este mensaje porque estás suscrito al grupo "bio4j-user" de Grupos de Google.
Para anular la suscripción a este grupo y dejar de recibir sus correos electrónicos, envía un correo electrónico a bio4j-user+...@googlegroups.com.
Para obtener más opciones, visita https://groups.google.com/groups/opt_out.


Darèios89

unread,
Apr 25, 2013, 11:38:35 AM4/25/13
to bio4j...@googlegroups.com
Thanks...i read at step 9:

"Different log files will be created at jar folder level regarding the progress of the data importation"

Where is exactly this jar folder?

Pablo Pareja Tobes

unread,
Apr 26, 2013, 3:16:10 AM4/26/13
to bio4j...@googlegroups.com
Hi Dareios,

That refers to the folder where you're running the jar file.

Cheers,

Pablo

Darèios89

unread,
Apr 26, 2013, 9:58:43 AM4/26/13
to bio4j...@googlegroups.com
Good, thanks Pablo, i downloaded Bio4jManager and Bio4jTools and i can reach them from the file system, but when i run the instruction:

sudo jdk1.6.0_23/bin/java -d64 -Xmx30G -jar ExecuteBio4jTool.jar executionsBio4j.xml &

I don't indicate a folder, so does it work automatically?

Pablo Pareja Tobes

unread,
Apr 29, 2013, 4:06:43 AM4/29/13
to bio4j...@googlegroups.com
Hi Dareios,

Please have a look at the step: 8b. Tuning executions.xml file

Cheers,

Pablo

Darèios89

unread,
Apr 29, 2013, 6:24:24 AM4/29/13
to bio4j...@googlegroups.com
Hi Pablo, so if i want to inlcude only uniref i think the DownloadAndPrepareBio4jSources should be like this:





And the execution:

<scheduled_executions>
<execution>
<class_full_name>com.era7.bioinfo.bio4j.programs.InitBio4jDB</class_full_name>
<arguments>
<argument>bio4jdb</argument>
<argument>batchInserter.properties</argument>
</arguments>
</execution>

<execution>
<class_full_name>com.era7.bioinfo.bio4j.programs.ImportUniref</class_full_name>
<arguments>
<argument>uniref100.xml</argument>
<argument>uniref90.xml</argument>
<argument>uniref50.xml</argument>
<argument>bio4jdb</argument>
<argument>batchInserter.properties</argument>
</arguments>
</execution>
</scheduled_executions>

Maybe it is a problem i have a pc with only 1G RAM.
 

Pablo Pareja Tobes

unread,
Apr 30, 2013, 3:45:26 AM4/30/13
to bio4j...@googlegroups.com
Hi Dareios,

Please have a look at the schema shown in step 8b :https://github.com/bio4j/Bio4j/wiki/Importing-bio4j , you need to have Uniprot already imported in your DB prior to import Uniref module.

Yeah, having only 1G RAM could likely be an issue.

Cheers,

Pablo

Zaynab Mousavian

unread,
Oct 23, 2013, 2:14:10 AM10/23/13
to bio4j...@googlegroups.com, dariog...@gmail.com
Hi,

I have read this conversation, and i have found that you use bio4j DB in Windows.
Is it possible? How can i import this database on Windows OS. In the importing section of Wiki site, the instructions of importing bio4j on a machine with Linux OS only are explained.

Thanks

Pablo Pareja Tobes

unread,
Oct 24, 2013, 6:10:19 AM10/24/13
to bio4j...@googlegroups.com
Hi Zaynab,

The different steps from the importing process in both Windows and Linux are virtually the same. 
Since Java is a multi-platform language and the DB system is embedded in the jar file your only requirement in order to import the last version of Bio4j is having installed Java 7.
Apart from that, the only thing I can think about that would be different depending on your OS choice is that you'll have to use equivalent programs in windows to download some of the files through FTP or decompress/extract the .gz files.

Cheers,

Pablo


--
Has recibido este mensaje porque estás suscrito al grupo "bio4j-user" de Grupos de Google.
Para anular la suscripción a este grupo y dejar de recibir sus correos electrónicos, envía un correo electrónico a bio4j-user+...@googlegroups.com.
Para obtener más opciones, visita https://groups.google.com/groups/opt_out.
Reply all
Reply to author
Forward
0 new messages