Per the FITS standard, the first pixel is coordinate (1,1). FITS does not say whether pixel (1,1) should be displayed as the *bottom*-left or *top*-left. If you read a FITS image into a python array, array element [0,0] is pixel (1,1).
cheers,
--dustin