There is no merging functionality for BEDPE files. I think your best option would be to use pairToPair to find junctions that overlap on both ends (the default), and then write a script around the output to merge the coordinates. I recognize that this is a non-trivial problem, as I have faced it myself. Essentially, what you are try to do is equivalent to structural variant clustering algorithms such as VariationHunter or my own Hydra. If the former solution is inadequate, you could consider using Hydra on the BEDPE input (you'd have to add a few dummy columns) to cluster the junctions.
Best,
Aaron
cat merged.bed1 | bedmap --echo --echo-ref-name --echo-map-id-uniq --delim "\t" -
cat merged.bed2 | bedmap --echo --echo-ref-name --echo-map-id-uniq --delim "\t" -
3. sort the 2 bed files according to the id column and then check
any common ids were overlapped in the last column (added by bedmap)
Any simpler way to do it?
Also, in this approach I have to make sure that the same row in the bedpe format always have the
left most breakpoint in the first block and the right most breakpoint in the second block.
Moreover, bedmap does not take strand into consideration.
I actually have to split one bedpe to 4 bed files based on strandness (++,+-,-+,--)
I realized that it is a common task (find recurrent svs in all samples) to do when one has the sv calls.
I am using speedseq to get the sv calls and convert them to bedpe.
what's your suggestion to do this?
Thanks very much!
Ming
--
You received this message because you are subscribed to the Google Groups "bedtools-discuss" group.
To unsubscribe from this group and stop receiving emails from it, send an email to bedtools-discu...@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.