In deep water, GOFS 3.1 and ESPC-D are broadly similar because they use virtually identical ocean data assimilation. The US Navy does not measure the skill of its global prediction products in coastal regions, because it has higher resolution models for such regions that only use the global products for boundary conditions.
For coastal regions, I would expect ESPC-D to the different, and likely better, that GOFS 3.1, because:
a) Native horizontal resolution is 2x
b) ESPC-D includes tides
c) ESPC-D includes surface pressure forcing
d) GOFS is forced by stand-alone NAVGEM (i.e. with a data atmosphere), ESPC-D is coupled to NAVGEM
NCEP's RTOFS is similar if GOFS 3.1 with different atmospheric forcing and slightly different data assimilation and is still running today.
Alan.