Hello Alex,
Starting with the general question: For a general pano, the condition
is that all images are takes from exactly the same spot. If the
position changed between two images, you might see an object from one
side in one, and from a different side in another image. And objects
may swap their relative position (think about trees in a forest). There
is no reasonable/consistent way to merge such images. Then there is the
corner case of a pure 2D scene where such perspective issues don't occur
-- then the camera may move.
Then there are cases like your's, where the scene is not too far from
2D, and therefore a not too far from acceptable result can be obtained
with a moving camera. But a strictly distortion free pano is impossible.
I gave this a try, see the attached pto. The result may be acceptable
depending on what you want to do with it. It's certainly much better
than your reference.
The steps were to manually place all control points, because the control
point finder fails on grid like structures (I think that went wrong in
your attempt: There were many wrong control points and all images were
optimised to the same position.). There are a few verticals and
horizontals. I placed all points on the left panel array and optimised
translation. One could improve it further with more control points and
a few more horizontal lines. One could also repeat the same for the
right half to get a good version of that.
best regards, lukas wirz