If the d is not small ~ < 2*tag size (for those tag size < 50bp), and the model image in PDF shows clean bimodal shape, d may be good. And several bp differences on d shouldn't affect the peak detection on general transcription factor ChIP-seq much.
However, for Pol2 or histone marks, things may be different. Pol2 is moving so it's not appropriate to say there is a fixed fragment size. I don't know the correct answer. For histone mark ChIP-seq, since they would have a underlying characteristic 147bp resolution for a nucleosome size, you can simply skip model building and use "--shiftsize 74 --nomodel" instead. Also if you want, you can try other software like SICER and NPS.
Best,
Tao
> --
> You received this message because you are subscribed to the Google Groups "MACS announcement" group.
> To post to this group, send email to macs-ann...@googlegroups.com.
> To unsubscribe from this group, send email to macs-announcem...@googlegroups.com.
> For more options, visit this group at http://groups.google.com/group/macs-announcement?hl=en.
>
Tao Liu
Research Fellow
Dept of Biostats and Comp Bio, DFCI / HSPH
450 Brookline Ave., Boston, MA 02215
--
You received this message because you are subscribed to the Google Groups "MACS announcement" group.
To post to this group, send email to macs-ann...@googlegroups.com.
To unsubscribe from this group, send email to macs-announcem...@googlegroups.com.
For more options, visit this group at http://groups.google.com/group/macs-announcement?hl=en.
Tao,
I interpret 1.0 FDR as not a good peak at all and 0.4 FDR as perhaps.
Is this plausable?
Lana
--
You received this message because you are subscribed to the Google Groups "MACS announcement" group.
To view this discussion on the web visit https://groups.google.com/d/msg/macs-announcement/-/eLoegsiFiCkJ.
Hi Tao,
I am having a similar issue: tag size is 75 bp (as expected) but d=76 which I assume is somewhat lower than expected. The distribution is binomial but with a smaller summit (see attach). Using these settings:
macs14 -tR1_trimmed.map -c input_trimmed.map -n R1_macs -f BOWTIE --llocal 10000 --verbose 3 --diag
I got the following results:
# effective genome size = 2.70e+09 # band width = 300 # model fold = 10,30 # pvalue cutoff = 1.00e-05 # Large dataset will be scaled towards smaller dataset. # Range for calculating regional lambda is: 1000 bps and 10000 bps # tag size is determined as 74 bps # total tags in treatment: 38801194 # tags after filtering in treatment: 26535992 # maximum duplicate tags at the same position in treatment = 1 # Redundant rate in treatment: 0.32 # total tags in control: 32842482 # tags after filtering in control: 31898554 # maximum duplicate tags at the same position in control = 1 # Redundant rate in control: 0.03 # d = 76 chr start end length summit tags -10*log10(pvalue) fold_enrichment FDR(%) chr1 705945 706381 437 73 20 64.85 8.44 65.95 chr1 947036 947283 248 100 12 57.41 9.37 58.3
Changing the model fold to 10,50 only increased the number of FDR=100%, and increasing bandwidth to 500 (expected fragment size) made it worst and reduced d.
is that d=76 an acceptable model? What can I do to improve it?
Thanks.
To view this discussion on the web visit https://groups.google.com/d/msg/macs-announcement/-/7S98xOVGtigJ.
# This file is generated by MACS version 2.0.9 20111102 (tag:alpha) |
| # ARGUMENTS LIST: |
| # name = ../MACS_results/test1/ |
| # format = BAM |
| # ChIP-seq file = R1_trimmed_sorted.bam |
| # control file = Hela_input_trimmed_sorted.bam |
| # effective genome size = 2.70e+09 |
| # band width = 300 |
| # model fold = 10,30 |
| # qvalue cutoff = 1.00e-02 |
| # Larger dataset will be scaled towards smaller dataset. |
| # Range for calculating regional lambda is: 1000 bps and 10000 bps |
| # Broad region calling is off |
| # tag size is determined as 74 bps |
| # total tags in treatment: 38801195 |
| # tags after filtering in treatment: 35268985 |
| # maximum duplicate tags at the same position in treatment = 2 |
| # Redundant rate in treatment: 0.09 |
| # total tags in control: 32842496 |
| # tags after filtering in control: 32275086 |
| # maximum duplicate tags at the same position in control = 2 |
| # Redundant rate in control: 0.02 |
| # d = 74 |
António
--
You received this message because you are subscribed to the Google Groups "MACS announcement" group.
To view this discussion on the web visit https://groups.google.com/d/msg/macs-announcement/-/rvS1w0m1RqUJ.