Running DaPars with sample files without leading "chr" and Ensembl gene model

56 views
Skip to first unread message

Farica Zhuang

unread,
Aug 27, 2021, 7:15:27 PM8/27/21
to DaPars
Hi,

When running DaPars with Ensembl gene model which has no "chr" prefix and sample files with no leading "chr" to match, I found that I obtain an empty output file. I then looked into the code https://github.com/ZhengXia/dapars/blob/master/src/DaPars_main.py and saw that on line 495, the if condition requires that each line in the sample files to start with "chr" when it says "if....line[0:3] == 'chr'".

Is it true that the model requires leading "chr" in the input files? And if so, is it safe to add leading "chr" for the input sample files and Ensemble gene model file?

Farica

Shen Tian

unread,
Aug 27, 2021, 10:42:43 PM8/27/21
to Farica Zhuang, DaPars
Hi Farica,

Yes you can add chr in your input file. I added and it worked.

Hope it helps.

Shen

Sent from my iPhone

On 28 Aug 2021, at 07:15, Farica Zhuang <far...@seas.upenn.edu> wrote:

Hi,

When running DaPars with Ensembl gene model which has no "chr" prefix and sample files with no leading "chr" to match, I found that I obtain an empty output file. I then looked into the code https://github.com/ZhengXia/dapars/blob/master/src/DaPars_main.py and saw that on line 495, the if condition requires that each line in the sample files to start with "chr" when it says "if....line[0:3] == 'chr'".

Is it true that the model requires leading "chr" in the input files? And if so, is it safe to add leading "chr" for the input sample files and Ensemble gene model file?

Farica

--
You received this message because you are subscribed to the Google Groups "DaPars" group.
To unsubscribe from this group and stop receiving emails from it, send an email to dapars+un...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/dapars/a39047bd-9765-427b-ab2c-4602a63ee83en%40googlegroups.com.

Farica Zhuang

unread,
Aug 28, 2021, 12:03:58 AM8/28/21
to DaPars

Hi Shen,

Thanks for the reply! I tried adding leading chr in my input files too and it worked as you said, thank you! But do you happen to know if it's safe to simply add the "chr"? I'm wondering because some chromosomes are different, e.g. I read that chrM in hg19 is named MT in GRCh37, so making it chrMT doesn't make it equal to chrM in the hg19 file. My guess is it doesn't matter but just wanted to make sure.

Farica

Shen Tian

unread,
Aug 28, 2021, 12:17:11 AM8/28/21
to Farica Zhuang, DaPars
Hey Farica,

Sorry I’m working with a genome of a non-model organism that I annotated myself, so I’m not quite familiar with human genome.

But I guess it won’t be a problem to match the two systems if you get rid of the three letters later when u finish your analysis? 

Thanks,
Shen

Sent from my iPhone

On 28 Aug 2021, at 12:04, Farica Zhuang <far...@seas.upenn.edu> wrote:



Farica Zhuang

unread,
Sep 22, 2021, 8:58:15 PM9/22/21
to DaPars
Yes good idea, thank you!
Reply all
Reply to author
Forward
0 new messages