Dear Laurence,
Thank you for your incredibly swift reply.
I've tried messing around with the calculation now in various different ways to include the (c-a) and the (d-b), and I've gotten really close to the results that AntConc gives me with the following method, but i'm still slightly off.
That is:
E1 = c*(a+b) / (c+d)
E2 = d*(a+b) / (c+d)
LL1 = 2*((a*log(a/E1)) + (b*log(b/E2)))
E3 = (c-a)*(a+b) / (c+d-a-b)
E4 = (d-b)*(a+b) / (c+d-a-b)
LL2 = 2*((a*log(a/E3)) + (b*log(b/E4)))
LL = (LL1+LL2)/2
To be perfectly honest with you, this (intuitively) does not seem right to me at all, but the results I'm getting with this are the closest of all my attempts (and depending on the keyword i double-check with the AntConc results, it's either correct or off by at most 0.02)
I've tried so many different ways, but I have no idea where I'm going wrong.
I would love to include this calculation in my MA thesis - that's why I'm trying so hard to figure it out. I'd appreciate any and all further help.
Thank you.
Best,
Sebastian