CD-HIT identity %

51 views
Skip to first unread message

YJ Kim

unread,
Dec 9, 2017, 7:56:10 PM12/9/17
to shortbred-users
Hi Jim, 

May I ask you how to specify the percentage of identity using CD-HIT when generating markers using 'shortbred_identify command'? 

I would like to use 100% as used in this paper (https://www.nature.com/articles/nature17672).

Best wishes,

Younjung

Jim Kaminski

unread,
Dec 10, 2017, 11:02:42 PM12/10/17
to YJ Kim, shortbred-users
Hi Younjung,

You can obtain 100% id with cd-hit by using these parameter settings: --clustid 1.0 --qclustid 1.0 

"--clustid" sets the clustering identity used by cd-hit in the initial clustering of your input sequences. "--qclustid" sets the clustering identity for quasi markers (if they exist).

Best,

Jim


--
You received this message because you are subscribed to the Google Groups "shortbred-users" group.
To unsubscribe from this group and stop receiving emails from it, send an email to shortbred-users+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Reply all
Reply to author
Forward
0 new messages