Reduction in size of 25_04

25 views
Skip to first unread message

Daniel Esteban Palma Igor

unread,
May 15, 2025, 3:43:06 PMMay 15
to MIMt database questions
Hi, I have noted that there is a considerable reduction in the number of sequences in the version 25_04 but an increase in the number of species compared to previous releases, have you change your filtering process? It may be a good idea to document this type of changes somewhere.
Regards,
Daniel Palma

‍이재윤((서울)교육지원팀)

unread,
Jun 16, 2025, 11:24:37 AMJun 16
to MIMt database questions
I have the same question.

2025년 5월 16일 금요일 오전 4시 43분 6초 UTC+9에 Daniel Esteban Palma Igor님이 작성:

Antonio Muñoz Mérida

unread,
Jul 10, 2025, 6:44:49 AMJul 10
to MIMt database questions

In the last versions of MIMt we applied an extra filter to remove all sequences identical (100% of identity), total or partially, that means that if a sequence is shorter but identical to another one that is longer, we keep the longer one and in the side file _redundancy.txt we put the ID of the sequence that remains, and the species that it represents, in this case both of them, the longer (that remains in the database) and the shorter belonging to a different species.

Best regards
Reply all
Reply to author
Forward
0 new messages