There are a few more questions I need some clarification on.
a. In the Quantization Scheme, every real number is represented as r=s(q-z)
so for softmax=> {exp(Beta*x)} /{Sum(exp(Beta*x))}
would mean that it should be viewd as
{exp(Beta*ScalF_ip(Q-ZP_ip))}/{Sum(exp(Beta*ScalF_ip(Q-ZP_ip)))}
But
here, there is no input argument for zero point and scaling factor to be seen. why is that so? Can we neglect these Quantization parameters? In the case of Logistics, ZP is seen
b.Does the 8 in
#L2682 mean that op_scaling_factor is taken care of ( 256=2^8 -> should be multiplied to the result, right? followed by an addition of -128[op_zp])?
c.And how is 'beta' value converted to beta_multiplier and left_shift value? Only the float value is obtained from the .tflite file, is there any preprocessing of the input parameters taken place?
d.Shouldn't I convert the ip_ScalF into the same multiplier & shift value? If so how is it done?
e.the output from the
source code seems to be in the range(0,255), but according to the
spec, it should be in range (-128,127). Am I missing something?