Hi Aaron,
Sorry about the delay.
Your first guess was right.
I mean extracting sequence from a FASTA file based on BED coordinates
and then using that sequence as the SEQ in the resulting BAM.
The bedToBam converter is great but the problem is that some programs
have trouble dealing with the blank seq field. For example, samtools
tview is great for quickly going over a bam file and bedToBam can
allow you to input gene annotations into it which it otherwise has no
support for (and merge the annotations to your data BAM as a separate
read group). But the program gets confused by the blank seq field and
it looks horrible (although I think the alignments are right).
For some uses putting in the seq info might defeat the purpose of
keeping the file more compact than a GFF (I'm not sure).
Have you tried the BAM annotations with Gbrowse?
In general, I think their needs to be an alternative to the GFF format
for holding annotations. A modified BAM format might do the trick?
Maybe it would need some additional fields though?
Best Regards,
Sean