I have just uploaded HDRVDP-2.2.2, which includes a few examples showing how to run the metric on HDR and SDR images. Perhaps these examples can answer your question. If the predictions still do not look right, the best is to post an example of test and reference imges that cause the problem.
When reporting results in the paper, I strongly recommend mentioning which predictor was used. Unfortunately not every paper does it.
Each predictor has a different purpose. They are all documented in hdrvdp.m.
The text from the home page:
"
If you find the metric useful, please cite the paper below and include the version number, for example, "HDR-VDP-2.2.2 [Mantiuk et al., 2013]". Mention also which predictor (Q, Q_MOS, P_det, etc.) is reported in your paper. The version number should be included in order to make sure that your results can be reproduced. As new data sets become available, we will be updating the HDR-VDP-2 code and its calibration parameters and releasing new versions, but the older version will still be available for download. The HDR-VDP-2 version can be queried by calling the function hdrvdp_version. It will return a fractional number, such as 2.21, which should be interpreted as release 2.2.1.
"