Sharing the Software / Models Used in Predicting 16S rRNA Copy Number

33 views

Skip to first unread message

Roland Wilhelm

unread,

Jan 25, 2019, 1:00:15 PM1/25/19

to IMG User Forum

Hello there!

I'm building a pipeline to predict the 16S rRNA copy number from draft genome assemblies. The copy number posted to IMG/ER for my test set of genomes is in agreement with many of my predictions, and seems to be more accurate than my own (since many of my predictions are 0 or 1 for organisms we'd expect to have high copy number). So, I tried recapitulating your results using RNAmmer as described in Markowitz et al. (2014). However, these predictions are also discrepant (typically underestimate) with what is posted on IMG/ER in many cases. Would it be possible to provide more detailed information about what parameters / software is being used in the IMG/ER pipeline. I used the vanilla RNAmer HMMs in what could be in accordance with this text from Markowitz: "Ribosomal RNA genes (5S, 16S and 23S) are predicted using hmmsearch against the custom models generated for each type of rRNA in bacteria and archaea (7,8)."

Thanks in advance for any assistance you can provide,

Roli