Question about AQUAS pipeline ( the version of AQUAS pipeline is download at 2017/02/17)

150 views
Skip to first unread message

洪彧丞

unread,
Sep 13, 2017, 1:48:32 AM9/13/17
to idr-discuss
Hello:
       
       I had tried to use AQUAS pipeline. However, I suffered a trouble:
       
       My data had 3 replicates and each of them had a chip and an input, so, there are "marked_SOC1_chip_rep1.bam", "marked_SOC1_input_rep1.bam", "marked_SOC1_chip_rep2.bam", "marked_SOC1_input_rep2.bam",
       "marked_SOC1_chip_rep3.bam" and "marked_SOC1_input_rep3.bam".
     
       But, when I put my data into AQUAS pipeline, it seems that the AQUAS pipeline couldn't get the "marked_SOC1_input_rep3.bam"(the input of replicate 3).
       
       This is my command:
       bds chipseq.bds -species ath10 -se -bam1 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_chip_rep1.bam -bam2 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_chip_rep2.bam -bam3  /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_chip_rep3.bam -ctl_bam1 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep1.bam -ctl_bam2 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep2.bam -ctl_bam3 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep3.bam -out_dir ../GSE45846_SOC1 -idr_thresh 0.05 

       I couldn't upload my final log file because Google will stop it, thank you for your help. 

洪彧丞

unread,
Sep 13, 2017, 2:05:31 AM9/13/17
to idr-discuss
Attach is my final log file, I had changed the Filename Extension to .txt , please modify .txt to .js and .html when you want to check it.

洪彧丞於 2017年9月13日星期三 UTC+8下午1時48分32秒寫道:
chipseq.bds.20170912_225824_091.dag.txt
chipseq.bds.20170912_225824_091.report.txt

Jin

unread,
Sep 13, 2017, 4:00:03 PM9/13/17
to idr-d...@googlegroups.com
Hello,

Number of controls exceeded number of chip replicates. Please reduce number of controls by merging some of them.

For example, you can merge 2nd and 3rd controls with:

-ctl_bam1 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep1.bam -ctl_bam2:1 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep2.bam -ctl_bam2:2 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep3.bam

Or merge all controls with:

-ctl_bam1:1 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep1.bam -ctl_bam1:2 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep2.bam -ctl_bam1:3 /home/rommelhong/Chip-seq/picard/GSE45846_SOC1/marked_SOC1_input_rep3.bam

Thanks,

Jin

--
You received this message because you are subscribed to the Google Groups "idr-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to idr-discuss+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Jin

unread,
Sep 13, 2017, 5:29:46 PM9/13/17
to idr-d...@googlegroups.com
Oops sorry,

Merging only works for FASTQs in the pipeline. You need to manually merge BAMs before pushing them into the pipeline.

Please forget about the example command line parameters I wrote previously. Just match number of chip replicates and controls or use a single control.

-bam1 BAM1 -bam2 BAM2 -ctl_bam1 CTL_BAM1 -ctl_bam2 CTL_BAM2

or

-bam1 BAM1 -bam2 BAM2 -ctl_bam1 CTL_BAM1


Thanks,

Jin


洪彧丞

unread,
Sep 14, 2017, 4:48:05 AM9/14/17
to idr-discuss
Hello, thanks for your reply,

But I still have some questions:

According to your advice, it seems that AQUAS pipeline will limit the numbers of controls (inputs) and experiment (chips). What is the maximum of the input and chip number?

In addition, I have 3 replicated data, and all data have own chip data and input data.

Therefore, I should type the command like this?

-bam1 BAM1 -bam2 BAM2 -bam3 BAM3 -ctl_bam1 CTL_BAM1 -ctl_bam2 CTL_BAM2 -ctl_bam3 CTL_BAM3

 If this command is correct, why aquas pipeline can not read the parameter of rep3 input?

Thank you for your help.


Jin於 2017年9月14日星期四 UTC+8上午5時29分46秒寫道:
To unsubscribe from this group and stop receiving emails from it, send an email to idr-discuss...@googlegroups.com.

Jin

unread,
Sep 18, 2017, 4:46:29 PM9/18/17
to idr-d...@googlegroups.com
You were right. I made a hotfix for this problem. It's in the branch named "can_use_more_than_two_ctrls".

$ git checkout can_use_more_than_two_ctrls

Please let me know if it works so that I can merge that branch to master.

Thanks,

Jin

To unsubscribe from this group and stop receiving emails from it, send an email to idr-discuss+unsubscribe@googlegroups.com.

洪彧丞

unread,
Sep 19, 2017, 2:31:14 AM9/19/17
to idr-discuss
OK, I'll try it and tell you the result.

Thank you very much.

Yu-Cheng Hung

Jin於 2017年9月19日星期二 UTC+8上午4時46分29秒寫道:

洪彧丞

unread,
Sep 27, 2017, 12:22:25 AM9/27/17
to idr-discuss
Hello, sorry for late reply.

I have tried the branch named "can_use_more_than_two_ctrls" and it could work normally.

But, I have two questions:

First, two warnings appeared like picture below when I install my genome data (ath10, Arabidopsis thaliana):


Second, the final report html (ex. GSE51537_JAG_report.html) doesn't have the information of ctl1, ctl2, ctl3 like picture below:


Thank you,


Yu-Cheng Hung


Jin於 2017年9月19日星期二 UTC+8上午4時46分29秒寫道:
You were right. I made a hotfix for this problem. It's in the branch named "can_use_more_than_two_ctrls".

Jin

unread,
Sep 30, 2017, 9:15:46 PM9/30/17
to idr-d...@googlegroups.com
1) You can use your own blacklist BED (with `-blacklist  [BED_GZ]`) if you want your final IDR peaks to be filtered through it. You can ignore the second warning. The latest pipeline no longer uses unique mappability tracks.

2) Please pull fix commit d23864655b16aaa3e0fc5385963f3c707380e135 from the "branch can_use_more_than_two_ctrls" and resume pipelines. Please let me know if it works.

Thanks,

Jin

Virus-free. www.avast.com

To unsubscribe from this group and stop receiving emails from it, send an email to idr-discuss+unsubscribe@googlegroups.com.

洪彧丞

unread,
Oct 4, 2017, 1:44:38 AM10/4/17
to idr-discuss
It could work normally and the result like picture below:


 I'll try the pair-end data to test if it can process successfully.

Thank you.

Yu-Cheng Hung 

Jin於 2017年10月1日星期日 UTC+8上午9時15分46秒寫道:

洪彧丞

unread,
Oct 18, 2017, 4:10:59 AM10/18/17
to idr-discuss
Hello, I have tried to process pair-end data (by format of BAM), but an error appeared and the pipeline discontinued immediately like picture below:


Thank you,


Yu-Cheng Hung
  

Jin於 2017年10月1日星期日 UTC+8上午9時15分46秒寫道:

Jin

unread,
Oct 18, 2017, 4:35:54 AM10/18/17
to idr-d...@googlegroups.com
It looks like cross-correlation analysis failed. Can you post a full log? Also, run the following command for debugging info.

$ tail [YOUR_OUT_DIR]/qc/rep*/*.cc.qc

Thanks,

Jin

To unsubscribe from this group and stop receiving emails from it, send an email to idr-discuss+unsubscribe@googlegroups.com.
Message has been deleted

洪彧丞

unread,
Oct 18, 2017, 5:08:53 AM10/18/17
to idr-discuss
Attach is the full log, I had changed the Filename Extension to .txt , please modify .txt to .js and .html when you want to check it.

I run the command and get the information like picture below:


Thank you,


Yu-Cheng Hung 

Jin於 2017年10月18日星期三 UTC+8下午4時35分54秒寫道:
chipseq.bds.20171017_100040_637.dag.txt
chipseq.bds.20171017_100040_637.report.txt

Jin

unread,
Oct 20, 2017, 7:41:22 PM10/20/17
to idr-d...@googlegroups.com
The 3rd column in a cc.qc log file must be positive, but your is -5.

We need more information about the quality of your samples for debugging.

Please run pipelines with `-final_stage xcor` then pipeline will stop before peak calling so no error will occurs.

And then send us (lee...@gmail.com) HTML report (*_report.html) found in your [YOUR_OUT_DIR].

Thanks,

Jin

To unsubscribe from this group and stop receiving emails from it, send an email to idr-discuss+unsubscribe@googlegroups.com.

洪彧丞

unread,
Oct 21, 2017, 3:50:42 AM10/21/17
to idr-discuss
Hello, I had sent my HTML report to you, and my address is romme...@gmail.com.

Thank you,


Yu-Cheng Hung

Jin於 2017年10月21日星期六 UTC+8上午7時41分22秒寫道:

洪彧丞

unread,
Dec 29, 2017, 4:48:09 AM12/29/17
to idr-discuss
Hi Jin,

Recently, I tried to analyze ChIP-seq data about histone modification, Do I just need add "-type histone" to do this?

Thank you,

Yu-Cheng Hung


Jin於 2017年10月21日星期六 UTC+8上午7時41分22秒寫道:

Jin

unread,
Dec 30, 2017, 7:24:10 PM12/30/17
to idr-d...@googlegroups.com
Yes `-type histone` or just `-histone` should work.

Jin

To unsubscribe from this group and stop receiving emails from it, send an email to idr-discuss+unsubscribe@googlegroups.com.
Reply all
Reply to author
Forward
0 new messages