I put the training data set and testing data set into RMSE calculation at the same time. The answer is around 0.5 eV/atom, instead of 2 meV/atom
I checked your database and the output file of prediction and found that the structure 6909 (structure6909.xsf) have an extremely large error between database and prediction value.
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Cartesian atomic coordinates (input) and corresponding atomic forces:
x y z Fx Fy Fz
(Ang) (Ang) (Ang) (eV/Ang) (eV/Ang) (eV/Ang)
--------------------------------------------------------------------------------------
Ti -0.000220 1.283603 1.456099 -0.000336 0.096083 -0.470577
Ti 2.567306 -0.428006 4.368296 0.000336 -0.096083 0.470576
O 0.000054 0.427836 0.948746 0.000283 -0.007268 0.038711
O 2.567032 0.427761 4.875649 -0.000283 0.007268 -0.038710
O -0.000024 -0.427908 3.860944 0.000643 -0.032689 2.321371
O 2.567109 1.283504 1.963451 -0.000643 0.032690 -2.321371
Cohesive energy : 17.15281892 eV
Total energy : -4969.12457650 eV
Mean force (must be zero) : -0.000000 -0.000000 -0.000000
Mean absolute force : 0.000421 0.045347 0.943553
Maximum force : 0.000643 -0.032689 2.321371
RMS force : 1.368948
The maximum force is acting on atom 5.
All forces are given in eV/Angstrom.
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
The error is 161.237 eV or 26.873 eV/atom. If I use this value to calculate the RMSE, the result will never be smaller than 0.3 eV/atom.
Do you know why this structure has that kind of large error? The structure isn't on the testing data set, hence, it should be included inside the training data set, right? (Actually, not only structure 6909, but also some other structures have the same situation.)
Best Regards,
Shih-Han Wang