dbprep tool

48 views
Skip to first unread message

CentreFrançoisBaclesse - Bioinfo

unread,
Sep 16, 2015, 9:08:24 AM9/16/15
to CAVA User Group
Hello,

I try to create transcript database, but I got this error message :

$ python dbprep.py -i transcripts.txt -e 75 -o outputGRCh37

-----------------------------------------------------------------
CAVA v1.1.1 dbprep (database preparation script) is now running.
Started:  2015-09-16 15:42:26.563913 

Ensembl version:  75
Reference genome: GRCh37

35 transcript IDs read from transcripts.txt.

Downloading Ensembl database... OK
Extracting transcript data... Traceback (most recent call last):
  File "dbprep.py", line 405, in <module>
    createFile(options)
  File "dbprep.py", line 28, in createFile
    n=generateTranscriptDB(options)
  File "dbprep.py", line 146, in generateTranscriptDB
    finalizeAndOutput(outfile,transcript)
UnboundLocalError: local variable 'transcript' referenced before assignment

Best, 

Márton Münz

unread,
Sep 17, 2015, 8:58:12 AM9/17/15
to CAVA User Group
Hi,

The dbprep command you used is correct and is working well on our computer. Could you please send me the transcripts.txt file so that I can try with that as well? I'm trying to reproduce the error you got.

Thanks!
Best, 
Marton

CentreFrançoisBaclesse - Bioinfo

unread,
Sep 17, 2015, 10:23:04 AM9/17/15
to CAVA User Group
Hi, 


I have found why the command not working. 
I filled transcript file with HGNC RefSeq ID. 

I would like to know if it's possible to annotate variants with RefSeq Id. 


Best, 

Márton Münz

unread,
Sep 17, 2015, 11:02:36 AM9/17/15
to CAVA User Group
Hi,
No, CAVA currently only supports Ensembl transcripts.
Best,
Marton

Elise Ruark

unread,
Sep 17, 2015, 11:08:52 AM9/17/15
to Márton Münz, CAVA User Group
To annotate variants with RefSeq IDs, you would need to find the corresponding Ensembl ENST ID using something like Ensembl's BioMart tool. You can then supply those to CAVA, and it will return the HGNC symbol (e.g. BRCA1) in the annotation.
The caveat here, of course, is that RefSeq transcript sequence is not necessarily the same as Ensembl transcript sequence (which is reference genome sequence), for instance when the RefSeq transcript contains a major allele which is not the reference genome allele.
Best,
Elise

--
You received this message because you are subscribed to the Google Groups "CAVA User Group" group.
To unsubscribe from this group and stop receiving emails from it, send an email to cava-user-gro...@googlegroups.com.
Visit this group at http://groups.google.com/group/cava-user-group.
To view this discussion on the web visit https://groups.google.com/d/msgid/cava-user-group/7191134c-db66-4729-b79f-a976df6d8460%40googlegroups.com.

For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages