there are variety of corpus found and each of them give different sets
of words. then what's the significance of using corpus for statistical
NLP?
Ben
The same corpus can be used consequently by many researchers, for example, in word sense
disambiguation. By comparing these disambiguation statistics one can get an idea about any
improvements in the disambiguation application(s), method(s) being used. Naturally, if
different corpora were used it would not give an accurate picture, but some corpora are
better suited for certain purposes than some others.
PsykoPat