RefSeq or GENCODE

40 views
Skip to first unread message

Gmail Marcin

unread,
Jun 8, 2017, 9:39:31 AM6/8/17
to methylkit_...@googlegroups.com
Hi,
I have RRBS question related specifically to the annotation strategy.  So I am contemplating to switch from RefSeq to GENCODE  (humans) as a source of annotation and I would like to hear people thoughts about it. For RRBS I think Gencode should be better, as what we are looking for are also regions between genes (intragenic regions).

(Further reading about RefSeq vs GENCODE here http://iobio.io/2016/03/21/gene-sets/ )

http://www.gencodegenes.org/releases/current.html
I was adviced to use  Basic gene annotation but I do not see the info about introns. What do I miss compared to RefSeq protocol taken from here:

http://zvfak.blogspot.no/2012/09/downloading-annotation-files-for.html

Many thanks,

Marcin


Altuna Akalin

unread,
Jun 8, 2017, 12:15:16 PM6/8/17
to methylkit_...@googlegroups.com
I think gencode is better, used to be more complete but refseq might have caught up. You can download gencode bed files from UCSC, GFF/GTF to bed conversion might be problematic. 

Best,
Altuna

--
You received this message because you are subscribed to the Google Groups "methylkit_discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to methylkit_discussion+unsub...@googlegroups.com.
To post to this group, send email to methylkit_discussion@googlegroups.com.
Visit this group at https://groups.google.com/group/methylkit_discussion.
For more options, visit https://groups.google.com/d/optout.

Gmail Marcin

unread,
Jun 9, 2017, 3:59:43 AM6/9/17
to methylkit_...@googlegroups.com
Thank you Altuna. If someone else has some experience in comparing both, it would be great to hear. Especially for the regions between genes.

Thinking about it the intragenic regions, is there any way to twist the readTranscriptFeatures function so it also gives this information and not only introns, exons, promotor regions and TSS?

Many thanks,
Marcin



On 8 June 2017 at 18:14, Altuna Akalin <aak...@gmail.com> wrote:
I think gencode is better, used to be more complete but refseq might have caught up. You can download gencode bed files from UCSC, GFF/GTF to bed conversion might be problematic. 

Best,
Altuna
On Thu, Jun 8, 2017 at 3:39 PM, Gmail Marcin <marcin.b...@gmail.com> wrote:
Hi,
I have RRBS question related specifically to the annotation strategy.  So I am contemplating to switch from RefSeq to GENCODE  (humans) as a source of annotation and I would like to hear people thoughts about it. For RRBS I think Gencode should be better, as what we are looking for are also regions between genes (intragenic regions).

(Further reading about RefSeq vs GENCODE here http://iobio.io/2016/03/21/gene-sets/ )

http://www.gencodegenes.org/releases/current.html
I was adviced to use  Basic gene annotation but I do not see the info about introns. What do I miss compared to RefSeq protocol taken from here:

http://zvfak.blogspot.no/2012/09/downloading-annotation-files-for.html

Many thanks,

Marcin


--
You received this message because you are subscribed to the Google Groups "methylkit_discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to methylkit_discussion+unsubscrib...@googlegroups.com.

To post to this group, send email to methylkit_discussion@googlegroups.com.
Visit this group at https://groups.google.com/group/methylkit_discussion.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "methylkit_discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to methylkit_discussion+unsub...@googlegroups.com.
To post to this group, send email to methylkit_discussion@googlegroups.com.
Visit this group at https://groups.google.com/group/methylkit_discussion.
For more options, visit https://groups.google.com/d/optout.



--
smoking cigarettes causes cancer

Altuna Akalin

unread,
Jun 9, 2017, 4:03:51 AM6/9/17
to methylkit_...@googlegroups.com
you can read the bed12 file as a regular bed file (genomation::readBed), just containing start-end positions of the genes. Then, you have to use GenomicRanges functions to I thinks it is called "gaps" or "gap" or something like that to get the regions between genes. But some genes are very long, maybe you want to filter genes based on evidence or protein coding capacity. 

Best,
Altuna

--
smoking cigarettes causes cancer

Gmail Marcin

unread,
Jun 9, 2017, 10:25:50 AM6/9/17
to methylkit_...@googlegroups.com
I see, I found also this: https://stackoverflow.com/questions/29253412/finding-intergenic-regions   if someone would need a detailed recipe so similar can be done on UCMC Gencode table.  They reduce also + - to *.

Many thanks,
Marcin

Kristina Santucci

unread,
Apr 7, 2026, 4:05:53 AMApr 7
to methylkit_discussion
Hi Altuna,

Could you point me into the right direction to get the bed12 file for Gencode v49 GRCh38? I can't seem to find this and I'm hoping to avoid any manual conversions.

Thank you in advanced!

Cheers,
Kristina.

On Friday, June 9, 2017 at 2:15:16 AM UTC+10 Altuna Akalin wrote:
I think gencode is better, used to be more complete but refseq might have caught up. You can download gencode bed files from UCSC, GFF/GTF to bed conversion might be problematic. 

Best,
Altuna

On Thu, Jun 8, 2017 at 3:39 PM, Gmail Marcin <marcin.b...@gmail.com> wrote:
Hi,
I have RRBS question related specifically to the annotation strategy.  So I am contemplating to switch from RefSeq to GENCODE  (humans) as a source of annotation and I would like to hear people thoughts about it. For RRBS I think Gencode should be better, as what we are looking for are also regions between genes (intragenic regions).

(Further reading about RefSeq vs GENCODE here http://iobio.io/2016/03/21/gene-sets/ )

http://www.gencodegenes.org/releases/current.html
I was adviced to use  Basic gene annotation but I do not see the info about introns. What do I miss compared to RefSeq protocol taken from here:

http://zvfak.blogspot.no/2012/09/downloading-annotation-files-for.html

Many thanks,

Marcin


--
You received this message because you are subscribed to the Google Groups "methylkit_discussion" group.
To unsubscribe from this group and stop receiving emails from it, send an email to methylkit_discussion+unsub...@googlegroups.com.
To post to this group, send email to methylkit_...@googlegroups.com.

alex....@gmail.com

unread,
Apr 7, 2026, 9:58:32 AMApr 7
to methylkit_discussion
Hi Kristina,

You can follow the original instructions from Altuna: https://zvfak.blogspot.com/2012/09/downloading-annotation-files-for.html 
Conversions are usually necessary if you have specific requirements or use a less well-annotated organism. 

Best,
Alex
Reply all
Reply to author
Forward
0 new messages