Hello !
I am encountering a weird behavior when importing XML (EAD 2002) files into AtoM 2.4.
I'd like to note that I am setting up AtoM for a colleague, and that I don't really know the software in depth. So the "problem" may very well be related to a configuration parameter.
We want to import some XML files into AtoM. This is the first time these files are imported, and unfortunately, some entries are incorrect. We can easily fix them by reading the job log.
However, the behavior of these jobs is not very user-friendly so far due to the following issue:
On the first try, I import my XML file, and the job returns the following log :
Aug 08 07:05:07 atomefa2 php[24699]: 2018-08-08 00:05:07 > Job started.
Aug 08 07:05:07 atomefa2 php[24699]: 2018-08-08 00:05:07 > Importing XML file: AMATH5.xml.
Aug 08 07:05:07 atomefa2 php[24699]: 2018-08-08 00:05:07 > Indexing imported records.
Aug 08 07:05:07 atomefa2 php[24699]: 2018-08-08 00:05:07 > Update type: import-as-new
Aug 08 07:05:08 atomefa2 php[24699]: 2018-08-08 00:05:08 > libxml error 504 on line 5 in input file: Element eadheader content does not follow the DTD, expecting (eadid , filedesc , profiledesc? , revisiondesc?), got (eadid itemdesc proitemdesc )
Aug 08 07:05:08 atomefa2 php[24699]: 2018-08-08 00:05:08 > libxml error 534 on line 7 in input file: No declaration for element itemdesc
Aug 08 07:05:08 atomefa2 php[24699]: 2018-08-08 00:05:08 > libxml error 534 on line 23 in input file: No declaration for element proitemdesc
Aug 08 07:05:08 atomefa2 php[24699]: 2018-08-08 00:05:08 > Creating a new record: Amathonte
Aug 08 07:05:08 atomefa2 php[24699]: 2018-08-08 00:05:08 > Creating a new record: Rapports et correspondance
Aug 08 07:05:08 atomefa2 php[24699]: 2018-08-08 00:05:08 > Creating a new record: Fouilles Amathonte 1969-1996.
Aug 08 07:05:08 atomefa2 php[24699]: 2018-08-08 00:05:08 > Import complete.
Aug 08 07:05:08 atomefa2 php[24699]: 2018-08-08 00:05:08 > Job finished.
On the 2nd try, I tried to import the exact same file for this example. Note the log:
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Job started.
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Importing XML file: AMATH5.xml.
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Indexing imported records.
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Update type: import-as-new
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 504 on line 5 in input file: Element eadheader content does not follow the DTD, expecting (eadid , filedesc , profiledesc? , revisiondesc?), got (eadid itemdesc proitemdesc )
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 534 on line 7 in input file: No declaration for element itemdesc
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 534 on line 23 in input file: No declaration for element proitemdesc
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 504 on line 5 in input file: Element eadheader content does not follow the DTD, expecting (eadid , filedesc , profiledesc? , revisiondesc?), got (eadid itemdesc proitemdesc )
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 534 on line 7 in input file: No declaration for element itemdesc
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 534 on line 23 in input file: No declaration for element proitemdesc
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 504 on line 5 in input file: Element eadheader content does not follow the DTD, expecting (eadid , filedesc , profiledesc? , revisiondesc?), got (eadid itemdesc proitemdesc )
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 534 on line 7 in input file: No declaration for element itemdesc
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > libxml error 534 on line 23 in input file: No declaration for element proitemdesc
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Creating a new record: Amathonte
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Creating a new record: Rapports et correspondance
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Creating a new record: Fouilles Amathonte 1969-1996.
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Import complete.
Aug 08 07:05:51 atomefa2 php[24699]: 2018-08-08 00:05:51 > Job finished.
Seems to me that the log is not cleared and previous one is carried into the new one. The only way I found so far to get only the latest errors is to restart the atom-worker between each import.
When it comes to our case, our first XML file logged a few hundred of errors due to a misspelled attribute that was almost everywhere.
We fixed them, and were unable to understand why the job kept outputting the same errors.
So, is that normal ? If it's the case, is there a way to ask Atom to "clear" the log of a given file before a new import ?
Thanks a lot,
Quentin