Hi Jeremy,
Sorry for the late response, I missed your post when it was first submitted.
The most likely reason why you don't see a speedup is that you compile the program with 32-bit (x86, not x86-64) compiler, and link it with 32-bit Yeppp! library.
Yeppp! library does not include optimized implementations for 32-bit x86; you currently need to target x86-64 to get all performance benefits.
Regards,
Marat