my mistake.
It was matched as TRUE, since the 154 exists in DEWEY:
but in this field the librarian has put a value which i should not have taken into account in the first place, because it is totally wrong.
So the TRUE here, is a false positive, which i need to avoid.
Same goes with the
20.949where the DEWEY lies in the digits before the dot character, (probably the librarian meant 020) while the regex selects the 949,
so if i go with this approach, i will not correct all values, but some i will also alter them in a very bad way. So i try to use OpenRefine as a way to visualize with facets,
how the DEWEY data are. Then accordingly, i will create the appropriate regex, so as to capture only for instance the value stored in $1 of the following: (\d\d\d)(\s), or
(\d\d\d)(\.\d\d\)
This way i will be pretty sure that i get real DEWEY numbers extracted, and not garbage put in the DEWEY field by librarians.
Thank you again for your patience to read all this, hope it is clear now what i try to achieve