With the software, I can successfully reproduce the NA12878 assembly. However, when I tried to use 3d-dna to scaffold my de novo assembly, it failed(I successfully got results with LACHESIS). The species is Arabidopsis thaliana. The input files are:
Draft de novo assembly (121M) containing 720 contigs
32G hi-c reads
I ran the juicer pipeline and got a 800M(bwa aln) or a 12G(bwa mem) merged_nodups.txt file. Then I used these files as the input of the 3d-dna pipeline.
When I ran in the haploid mode, it ended up with a FINAL fasta file with one big unsplit scaffold and many small scaffolds.(the stderr shows "Chromosome boundary position not positive! Chromosome splitter failed. Refer to the hic map: continuing without splitting")
While in the diploid mode, it ended up with a FINAL fasta file containing many small scaffolds similar to the input contigs. (no error info)
juicer command:
```
juicer.sh -d /mydirpath/ -D /mydirpath/ \
-S early -r -R 2 -z references/p_ctg.fa\
-s HindIII -y restriction_sites/p_ctg_HindIII.txt\
-p restriction_sites/p_ctg.chrom.sizes
```
3d-dna command:
```
sh 3d-dna/run-pipeline.sh -m haploid -t 10000 -s 2 -c 5 p_ctg.fa at.mnd.txt
sh 3d-dna/run-pipeline.sh -m diploid -t 10000 -s 2 -c 5 p_ctg.fa at.mnd.txt
```
I have noticed that the stderr of juicer pipeline shows:
'Error! The number of reads in the fastqs (46859770) is not the same as the number of reads reported in the stats (46913927), likely due to a failure during alignment.Reads don't add up.'
I checked this forum and found someone had a similar problem with juicer:
https://groups.google.com/forum/#!topic/3d-genomics/aalqyooC9u8
But my readname looks like this:
@ST-E00494:70:H35YGALXX:5:1101:9780:1309 2
I am not sure what causes this failure in juicer.
If any intermediate file is needed to diagnose this issue, please let me know.
Thank you.
To view this discussion on the web visit https://groups.google.com/d/msgid/3d-genomics/46e07865-10ef-4fd9-b2ee-5bd53b2efdc8%40googlegroups.com.--
You received this message because you are subscribed to the Google Groups "3D Genomics" group.
To unsubscribe from this group and stop receiving emails from it, send an email to 3d-genomics+unsubscribe@googlegroups.com.