Hi pysam developers and the team!
I am a developer of Hi-C library pairtools for parsing SAM entries of Hi-C mapped data into pairs of 3D genome interactions.
Some time ago we switched to the pysam backend, and found it very fast and simple - it's amazing!
While it serves our needs perfectly, I have recently faced a need in merging two SAM entries into one, which is one of the logistic bottlenecks in the pairtools upgrade that I'm working on.
In other words, there are alignments in the genome originating from the same chromosome, and I want to convert that into a single gapped alignment; or as a single entry with additional information about the "mate".
Do you think this is something that can be done within the pysam framework?
I understand there can be complications due to mapper-specific formatting choices (we typically parse the output of bwa mem), but any case of alignment merging will be a good starting point for me.
I would appreciate any thoughts or suggestions!
Thanks!
Kind regards,
Aleksandra
----------------
Dr. Aleksandra Galitsyna
IMES MIT
77 Mass Ave, Cambridge, MA