MultEval V0.4.2 Now Available

47 views

Skip to first unread message

Jonathan Clark

unread,

Jan 3, 2012, 10:02:24 AM1/3/12

to multeval...@googlegroups.com

V0.4.2 - 12/3/2012

* Multi-threaded bootstrap resampling and approximate randomization significance tests (large time savings for many systems with many optimization runs)

* Fixed bug in n-best scoring that caused oracle *submetrics* such as precision and recall (*not* overall score for BLEU, TER, METEOR, etc. metrics) to be reported incorrectly for oracle hypotheses

* Updated to Guava V11

Some timing results on 4 threads vs 1 using the recent multi-threading improvements on the 3 example systems with 3 optimizer runs each:

Load METEOR 25.5s / 25.5s

Collect Sufficient Stats 32.7s / 86.7s

Bootstrap Resampling 70.9s / 189s

Approximate Randomization 122s / 336s

TOTAL 4m 13s / 10m 39s

V0.4.1 - 12/30/2011

* Fix sizing bug reported by John DeNero, which caused MultEval to crash

* Removed "static" keyword from several places within the TER library to make it more amenable to multi-threading

Jonathan Clark

unread,

Aug 27, 2012, 4:27:50 PM8/27/12

to multeval...@googlegroups.com

V0.4.3 - 8/27/2012

* Upgraded to Meteor 1.4 (also released 8/27/2012)

- Note: This change only affects previously unsupported languages by enabling new stemmers

Reply all

Reply to author

Forward

0 new messages