I am using tr2aacds.pl (v22may07) for collapsing long-read transcriptome assemblies from different samples in a non-model organism.
$ tr2aacds.pl -NCPU=${task.cpus} -MAXMEM=${mem_MB} -MINAA=${params.minaaLength} -logfile -debug -species=Amex -noutrorf -cdnaseq AmexT_${params.transcriptomeVersion}_stringtie-merge.fa
At earlier steps from my assembly pipeline I determine the strandness of each read, looking to get a stranded transcriptome. By analyzing the output of tr2aacds.pl, I noticed that each FASTA header of the file "inputset/AmexT_v50_stringtie-merge.cds" indicates the strand in which the CDS was found
>AMEX70DD.1.1 type=CDS; aalen=314,28%,partial5-utrbad; clen=3268; strand=+; offs=1-945; codepot=Code/0.0059;
>AMEX70DD.10.1 type=CDS; aalen=116,52%,complete-utrpoor; clen=670; strand=-; offs=611-261; codepot=Code/0.0047;
While this option could allow me to find ORFs in transcripts whose orientation was incorrectly determined, I would like to ask if there is a way to inhibit the search for ORFs in the negative strand of transcripts.
Kind regards,
Salvador