Thank you.
Additionally,
I was considering the maximum lengths. The AMPSphere paper mentioned that the average lengths of AMPs are around 37 residues. However, I have some AMPs with longer sequences. I've used the nfcore/funscan pipeline to predict the AMPs with Macrel
and several other tools. Since the maximum length of AMPs is around 50 residues, would it be best to exclude larger peptides, even if they're predicted as AMPS?
Also, I compared my peptide sequences with AMPSphere peptides using mmseqs, which produced a table with a number of columns. Apart from the e-value column, is there any column I should consider when filtering? I'm having trouble locating the column descriptions.
Finally, are there any additional steps I can take to ensure I have a high-quality set of AMP candidates?
Thank you,
Kind regards,
Sachintha