Hi Sarah,
AM can not create METS file through create_transfer_mets.py from manually normalized files with diachritics characters in filename. So SIP fails.
context:
AM
1.12.1
Unzipped bag with following structure :
K:.
│ bag-info.txt
│ manifest-md5.txt
│ processingMCP.xml
│ bagit.txt
│ tagmanifest-md5.txt
│
├───data
│ └───skip-transfer-directory
│ └───1er Communion CHATEL 2008
│ │ 1ère Communion annonce.doc
│ │
│ ├───Reportage
│ │ DSC_0001.jpg
│ │ DSC_0002.jpg
│ │
│ └───Groupe
│ 30x45.jpg
│ G6SS8038.DCR
│ 20x30.jpg
│ G6SS8038.psd
│ G6SS8038.JPG
│
├───metadata
│ metadata.csv
│
└───manualNormalization
├───access
│ └───skip-transfer-directory
│ └───1er_Communion_CHATEL_2008
│ 1ère Communion annonce.pdf
│
└───preservation
└───skip-transfer-directory
└───1er_Communion_CHATEL_2008
1ère Communion annonce.odt
metadata.csv
parts,dc.title,dc.identifier,bcu.process
objects/,FD-KEHREN-OBERSON-ARCHNUMFR_6932-0128,ARCHNUMFR 6932-0128,hierarchical
stdout
Module createTransferMETS_v1.0
--sipUUID "ff538b91-e7b4-4b9b-9965-7ee4ec28e35e" --basePath "/var/archivematica/sharedDirectory/watchedDirectories/workFlowDecisions/extractPackagesChoice/FD-KEHREN-OBERSON-ARCHNUMFR_6932-0128-ff538b91-e7b4-4b9b-9965-7ee4ec28e35e/" --xmlFile "/var/archivematica/sharedDirectory/watchedDirectories/workFlowDecisions/extractPackagesChoice/FD-KEHREN-OBERSON-ARCHNUMFR_6932-0128-ff538b91-e7b4-4b9b-9965-7ee4ec28e35e/"metadata/submissionDocumentation/METS.xml --basePathString "transferDirectory"
Standard streams
Errors and diagnostics (stderr)
'ascii' codec can't decode byte 0xc3 in position 78: ordinal not in range(128)Traceback (most recent call last):
File "/usr/lib/archivematica/MCPClient/job.py", line 111, in JobContext
yield
File "/usr/lib/archivematica/MCPClient/clientScripts/create_transfer_mets.py", line 800, in call
args.xml_file, args.base_path, args.base_path_string, args.sip_uuid
File "/usr/lib/archivematica/MCPClient/clientScripts/create_transfer_mets.py", line 102, in write_mets
fsentry_tree.scan()
File "/usr/lib/archivematica/MCPClient/clientScripts/create_transfer_mets.py", line 188, in scan
self.build_tree(self.root_path, parent=self.root_node)
File "/usr/lib/archivematica/MCPClient/clientScripts/create_transfer_mets.py", line 207, in build_tree
self.build_tree(dir_entry.path, parent=fsentry)
File "/usr/lib/archivematica/MCPClient/clientScripts/create_transfer_mets.py", line 207, in build_tree
self.build_tree(dir_entry.path, parent=fsentry)
File "/usr/lib/archivematica/MCPClient/clientScripts/create_transfer_mets.py", line 207, in build_tree
self.build_tree(dir_entry.path, parent=fsentry)
File "/usr/lib/archivematica/MCPClient/clientScripts/create_transfer_mets.py", line 207, in build_tree
self.build_tree(dir_entry.path, parent=fsentry)
File "/usr/lib/archivematica/MCPClient/clientScripts/create_transfer_mets.py", line 212, in build_tree
db_path = "".join([self.db_base_path, entry_relative_path])
UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 78: ordinal not in range(128)
WorkaroundIf I remove diachritic characters from file name, the SIP passes.
bag structure without diachritic characters below the manualNormalization directory
J:.
│ bag-info.txt
│ manifest-md5.txt
│ processingMCP.xml
│ bagit.txt
│ tagmanifest-md5.txt
│
├───data
│ └───skip-transfer-directory
│ └───1er Communion CHATEL 2008
│ │ 1ère Communion annonce.doc
│ │
│ ├───Reportage
│ │ DSC_0001.jpg
│ │ DSC_0002.jpg
│ │
│ └───Groupe
│ 30x45.jpg
│ G6SS8038.DCR
│ 20x30.jpg
│ G6SS8038.psd
│ G6SS8038.JPG
│
├───metadata
│ metadata.csv
│
└───manualNormalization
├───access
│ └───skip-transfer-directory
│ └───1er_Communion_CHATEL_2008
│ 1ere Communion annonce.pdf
│
└───preservation
└───skip-transfer-directory
└───1er_Communion_CHATEL_2008
1ere Communion annonce.odt
stdout is fine
Module createTransferMETS_v1.0
--sipUUID "b488255e-7366-41d4-9b94-5a4c5f9046f1" --basePath "/var/archivematica/sharedDirectory/watchedDirectories/workFlowDecisions/extractPackagesChoice/FD-KEHREN-OBERSON-ARCHNUMFR_6932-0128-b488255e-7366-41d4-9b94-5a4c5f9046f1/" --xmlFile "/var/archivematica/sharedDirectory/watchedDirectories/workFlowDecisions/extractPackagesChoice/FD-KEHREN-OBERSON-ARCHNUMFR_6932-0128-b488255e-7366-41d4-9b94-5a4c5f9046f1/"metadata/submissionDocumentation/METS.xml --basePathString "transferDirectory"
Let me know if you can reproduce it.
Thank you in advance,
Cheers,
Romain