#count tRNA transcripts
cat annotations.gff | grep -c -P “\ttRNA\t"
#count snoRNA transcripts
cat annotations.gff | grep -c -P “\tsnoRNA\t"
You can also pull out the Parent feature and count uniq entries to look at genes instead of transcripts. Example:
#count protein coding genes
cat annotations.gff | grep -c -P “\tmRNA\t” | perl -ane ‘/Parent=([^\;\n]+)/; print "$1\n”’ | sort | uniq | grep -c “"
—Carson
Hello Carson, Greetings from Nigeria.
Please how can I extract these matrix from my annotations?
Number of protein-coding genes in the assembled tea plant genome Those with known
proteins and/or domains . Annotation of noncoding RNA
genes ribosomal RNA genes Number of transfer RNA genes, Number transcription factor
genes and simple sequence
Thanks
Nnaemeka Emmanuel Nnadi,Ph.D
Department of Microbiology,
Faculty of Natural and Applied Science,
Plateau State University, Bokkos, Plateau State, Nigeria.
Publications: