GC Skew

11 views
Skip to first unread message

Zeynep Çalıcı

unread,
Jul 5, 2021, 5:38:15 AM7/5/21
to iu-...@googlegroups.com
Hello  everyone

I hope your vacation is going well. I know our lessons are over, but I want to consult you about something. I hope it's okay with that.
How do we decide the window size when drawing a GC skew graph? In a genome where I've tried many window sizes, I still can't see any specific and sharp transitions. Do you have any ideas?

Sincerely 

Dr. Andrés Aravena

unread,
Jul 5, 2021, 5:50:25 AM7/5/21
to IU Computing in Molecular Biology
What I do is this:

Let's say that you have the genome as a vector of char, as we did in class. Then I create a new_vector with one number for each nucleotide. I use 1 for G, -1 for C, and 0 for A and T. I do like this

nucl_to_number <- c(A=0, C=-1, G=1, T=0)
new_vector <- nucl_to_number[genome]

Here new_vector and genome have the same length. This is in practice a window of size 1. We will not see anything here. Instead, I plot

cumsum(new_vector)

and I look for the minimum and maximum.

What do you think?

Zeynep Çalıcı

unread,
Jul 5, 2021, 6:21:36 AM7/5/21
to iu-...@googlegroups.com
I think it's a great idea.  I will definitely apply this. 

However in some archaea there may be more than one replication origin.  Can this also be used to find the replication origin number?  Is finding only one minimum and one maximum point sufficient to find the number of replication origin? 


5 Tem 2021 Pzt 12:50 tarihinde Dr. Andrés Aravena <andres....@istanbul.edu.tr> şunu yazdı:

Yasal Uyarı: Bu elektronik posta, buradan  ulaşabileceğiniz Koşul ve Şartlara tabidir.
Disclaimer: This email is subject to the Terms and Conditions available here.
------------------------------------------------------------------
Lütfen bu e-postayı yazdırmadan önce çevreye olan etkisini dikkate alınız. Unutmayınız ki; dünyadaki kağıt tüketiminin yarısı kazanılırsa, her yıl 8 milyon hektar orman alanı (Ege Bölgesi büyüklüğünde) yok olmaktan kurtulacaktır.
------------------------------------------------------------------
Please take into account the impact on the environment before printing this e-mail. Do not forget that if we reduce our paper consumption by half, every year 8 million hectares of forest (an area the size of Aegean Region in Turkey) will be saved from vanishing.
------------------------------------------------------------------

--
You received this message because you are subscribed to the Google Groups "IU Computing in Molecular Biology" group.
To unsubscribe from this group and stop receiving emails from it, send an email to iu-cmb+un...@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/iu-cmb/1f4a0e1d-5527-43b2-873a-dd628f21d326n%40googlegroups.com.

Andres Aravena

unread,
Jul 5, 2021, 6:50:30 AM7/5/21
to IU Computing in Molecular Biology


On Mon, 5 Jul 2021 at 13:21, Zeynep Çalıcı <zeynepca...@gmail.com> wrote:
I think it's a great idea.  I will definitely apply this. 

However in some archaea there may be more than one replication origin.  Can this also be used to find the replication origin number?  Is finding only one minimum and one maximum point sufficient to find the number of replication origin? 

Good question. I know this technique for bacteria, and I know archaea have many replication origins. I'm not sure if this works on archaea, but it may be worth trying.

Notice that just min or max GC-skew is not enough. Only one is the origin, and this plot does not tell you which one. For that you can check, for example:
  • if the AT content is high in that region. Low GC content helps splitting the double helix.
  • which genes are close to the putative replication point. Sometimes DnaA is close
hope it helps

A

Zeynep Çalıcı

unread,
Jul 5, 2021, 6:56:50 AM7/5/21
to iu-...@googlegroups.com
I took notes of what you said. Thank you so much!

Sincerely

Andres Aravena <andres....@istanbul.edu.tr>, 5 Tem 2021 Pzt, 13:50 tarihinde şunu yazdı:
--
You received this message because you are subscribed to the Google Groups "IU Computing in Molecular Biology" group.
To unsubscribe from this group and stop receiving emails from it, send an email to iu-cmb+un...@googlegroups.com.

Buse Özden

unread,
Jul 5, 2021, 8:04:09 AM7/5/21
to IU Computing in Molecular Biology
I believe specifying local maxima and minima could be the solution in the case of archaea. The plot of the cumulative sum would display multiple local extrema along with absolute extrema since archaea have many replication origins. I found a function called extrema that returns local extrema. Maybe that would be useful for your case. 

Hope this helps. Best

5 Temmuz 2021 Pazartesi tarihinde saat 13:56:50 UTC+3 itibarıyla S.Zeynep Çalıcı şunları yazdı:

Zeynep Çalıcı

unread,
Jul 5, 2021, 10:36:27 AM7/5/21
to iu-...@googlegroups.com
When I plot a graph of the sum, I see too many maximum and minimum points.  I doubt they are all have replication origins. I don't think they are all replication origins. Moreover, it goes from negative to positive or from positive to negative at many points in the skew graphic.  

I thought there some correlation with the window size.  I tried many different sizes.  However again I couldn't see any specific transition.  
I'll look into the function you mentioned. 

Thank you very much!

5 Tem 2021 Pzt 15:04 tarihinde Buse Özden <buse....@ogr.iu.edu.tr> şunu yazdı:
Reply all
Reply to author
Forward
0 new messages