Change in LDA score

396 views
Skip to first unread message

Duy Dinh

unread,
Aug 28, 2013, 3:14:55 PM8/28/13
to lefse...@googlegroups.com
Hi Nicola,

I have another question I want to ask.  Have you change how the LDA score is calculated recently?

I'm asking because I have used the same data to look for significant features.  I noticed that on my old LDA score, all the features have about  .5 more in LDA score.  I took notice because initially our Barnesiella had an LDA score of over 2 but no Barnesiella was reported the second time.  
I lowered the LDA threshold to 1.5 and Barnesiella again showed as significant.  Can there be any other reason for this discrepancy? 

I have attached the two files here.  The first one is the "HIV- and HIV+ no subclass...".  The second is the older one which is "12_12_13 LefSe ..".

Thank you,
Duy

PS sorry about the multiple questions.
HIV- and HIV+ no subclass 14 size font 150dpi LDA.png
12_12_13 LefSe HIV-+only with no subgroup.png

Nicola Segata

unread,
Aug 29, 2013, 12:17:43 PM8/29/13
to lefse...@googlegroups.com
Hi Duy,
 thanks for getting in touch. The LEfSe algorithm has not been changed recently. The last change was a small bug fix for rare cases in which very few samples were present. Some updates might have occurred to some of the libraries LEfSe internally use, but this should not affect the values considerably. Notice also that the LDA computation is based on boostrapping which has a randomization component, but the differences between runs are minimal. When very few samples are present the random variability can be a little higher. So unless you even slightly changed the input it's hard for me to find an explanation.

Notice, however, that the LDA score and the threshold at 2.0 are not an absolute "significance" estimate. The most important aspect of the LDA score is the ranking it induces and the relative differences among biomarkers and both these aspects seem identical in the two plots.

I hope this helps
thanks
Nicola

Duy Dinh

unread,
Sep 3, 2013, 8:30:52 PM9/3/13
to lefse...@googlegroups.com
Thanks Nicola!

lina....@gmail.com

unread,
Aug 1, 2016, 3:41:47 PM8/1/16
to LEfSe-users
Hello Nicola,

I ran into the same issue that Duy had. My collaborator wanted me to change the colors and orders of the two groups, and I figured I could just change the labels and rerun the Lefse analysis, and now I am getting different orders of the organisms within the groups.

I attached below two figures. 

Can this still be attributed to the random effect in the bootstrapping procedure?

Thanks for any insight you might have!

Best,
~Lina
old_h_vs_cd.png
new_h_vs_cd.png

Nicola Segata

unread,
Aug 2, 2016, 11:58:34 AM8/2/16
to lina....@gmail.com, LEfSe-users
Hi Lina,
 it's a bit unusual but yes, I think that is due to the random effect in the bootstrapping procedure.
cheers
Nicola

lina....@gmail.com

unread,
Aug 2, 2016, 1:44:07 PM8/2/16
to LEfSe-users, lina....@gmail.com, nicola...@unitn.it
Thanks for the clarification, I appreciate it!
~Lina
Message has been deleted

capricy gao

unread,
Aug 9, 2016, 7:28:11 AM8/9/16
to LEfSe-users, lina....@gmail.com, nicola...@unitn.it
Hello Nicola,

Could you please explain how the lda scores are calculated. The paper only mentioned the bootstrap steps, but was not clear how it is really calculated. I recently start to use it and wonder how those scores are related to the coefficients of linear discriminants?

Thank you very much for your attention!

C.

capricy gao

unread,
Aug 9, 2016, 7:29:49 AM8/9/16
to LEfSe-users

Hello Nicola,

Could you please explain how the lda scores are calculated. The paper only mentioned the bootstrap steps, but was not clear how it is really calculated. I recently start to use it and wonder how those scores are related to the coefficients of linear discriminants?

Thank you very much for your attention!

C.

Reply all
Reply to author
Forward
0 new messages