I have been working on an Microsoft Access Validator tool – at least something that could potentially turn into one: https://github.com/exponential-decay/MDBValidator
I have also been keeping notes and updating the Archive Team wiki as I go: http://fileformats.archiveteam.org/wiki/Access
In my work I've come to the conclusion that PRONOM lists two signatures for Access 95 and 97 and two for Access 2000 and 2002 that might want to be merged, respectively. This is because the actual format used between the two versions, 95 and 97, and then 2000/2002 respectively is the same
The difference between the two is essentially how big the database can be and the size of variables that can be held within that newly sized data structure.
The current signatures PRONOM adopts read the Jet version at the beginning of file and then a variably positioned field later on in the database that relates to the version of access that created it. The variable part of the signature for 95/95 for example, begins ASCII: 'AccessVersion' and then the version number within {0-1024} bytes.
The information being read is simply stored by Access in what is termed a ‘Data Page’ when it is created. This data page can be located anywhere within the data stream and is one of many throughout the stream.
What I discovered through the research is that the version of the database doesn’t change between Microsoft Access versions. Jet 3 and 4 are coupled closely with their respective access versions. What changes between access versions is the DLL that reads and writes to the database. I’ve documented more on the archive team wiki. While this is important information it is distinct from the ‘file format’ alone which is what PRONOM should be recording.
The change might also result in modifying the signature to remove the search for the Access strings: 'AccessVersion' and 'A.c.c.e.s.s.V.e.r.s.i.o.n'. DROID might simply look for the Jet DB version.
The advantage of this (but by no means a sole reason for change) is that the current search for the Access version and build number is so variable it could potentially be found up to the very end of a maximum 2GB file (If I understand the max size of a 2kl DB. The Access signature is currently one of the slowest – if not the slowest signature search in the whole of the PRONOM database.
A concern about this is that we might then want to change the records to reflect Jet DB instead of Access. This is probably a more extreme change but perhaps more accurate. What would happen at this point is the two signatures would begin to pick up instances of Microsoft Money and Visual Basic databases – users would need to be informed about this – probably via the PRONOM record. My feeling is that it is more accurate but understand some users would be uncomfortable with such a radical change.
So, two questions:
File format | Compatible with Access 97? | Compatible with Access 2000? | Compatible with Access 2002? |
Access 97 format (created with Access 97) | Yes | No | No |
Access 2000 file format (created with Access 2000) | No | Yes | Yes |
Access 2000 file format (created with Access 2002) | No | Yes (Access 2000 features only) | Yes |
Access 2002 file format (created with Access 2002) | No | No | Yes |
Q. Can I have permission to use the images you've provided on the Wiki? - they will become Creative Commons 0