There are dozens of audio file formats out there. I guess the most known ones are WAV, MP3, AIFF
Wav is usually not compressed and (usually) created with PCM encoding. I say "usually" because it's not required by the Wav specification:
WAV is a so called container format which doesn't enforce the actual encoding of the data, it just structures the data chunks.
Given the myriads of possibilities, I was suggesting that you'd only support e.g.
as this is probably the most adequate precision for speaker recognition.
Once you're able to get to the actual sound data, you can reuse the code from FileHelper.java to transform those 16 bits unsigned int values into doubles
HTH
Amaury