Hey guys,
I am importing the data files into DB folder by using the source codes.
But as you know total data size is too huge to handle in my own local PC. (Not in workstation stuffs)
When I tried to import refseq data files (> 100GB), I guess it will take more than a month.
So I want to import a part of refseq data files (e.g. for plant only).
I am wondering if it is okay or not. Without importing uniprot data files, can I retrieve refseq information with accession id?
Because after importing partially refseq data, when I tried to import uniprot_sprot.xml, I found several exceptions (SEVERE: Exception retrieving protein).
Is there any way to avoid these exceptions?
Please give me some wise words in order to build DB with a part of total data.
Thank you. :)