Hello,
I am trying to understand better your observation. Please be sure to correct me if I am misunderstanding your observation.
From this information provided, I understand your observation is that the Vision API seems to only read or text detect only the selectable texts in your PDF when it has only texts but it ignore to detects and extracts text from any image in pages that include both Images and texts? If so, I am comparing the Page 1 of the two attached Documents and I can't seem to corroborate this understanding because the Page 1 does not seem to have any images. Can you please clarify on the details being shared?
I can attempt reproducing the issue but I find it necessary to understand the behavior being explained.