The "default" sound implementation has changed quite a bit over the years. It has been Flash, HTML5 and Web Audio.
The only API which supports low latency, reliable, playback is the Web Audio API, which should be the default on Chrome and Webkit nightlies.
I'm not sure what would introduce an additional delay when comparing older gwt-voices / Flash with newer ones. I did at some point switch from the Motion Twin compiler to the Flex compiler (MXMLC), at the same time bumping the Flash language version. It possible that the newer .swf ActionScript version comes with some additional call overhead.