Readpst

336 views
Skip to first unread message

Anthony Cocciolo

unread,
Mar 6, 2014, 5:33:33 PM3/6/14
to archiv...@googlegroups.com
Hello all,

I noticed that  Archivematica's default formats policy converts from PST to MBOX but notice that the commands and rules for PST to MBOX are not setup by default.  I was wondering if anyone had any luck creating commands to convert from PST to MBOX? 

I have used this command:
Command: readpst -r "%inputFile%" -o "%outputDirectory%"
Ouput location: %outputDirectory%

The only problem is that Archivematica seems to have trouble with the multiple folders/files that get generated from this command.

Readpst is otherwise working properly.

Any suggestions would be appreciated,
Anthony
--
Anthony Cocciolo, Ed.D.
Assistant Professor
Pratt Institute, School of Information and Library Science
144 West 14th Street, Room 604D
New York, NY, 10011-7301
+1 212-647-7702
acoc...@pratt.edu
http://www.thinkingprojects.org

sam...@willamette.edu

unread,
Apr 16, 2015, 4:58:16 PM4/16/15
to archiv...@googlegroups.com
Hi Anthony - Did you ever get this resolved?  We've just started receiving some .pst files and are wondering if we will need to convert them to mbox before ingesting (as you noted, there wasn't a rule set up by default to transform them.)

Also, does anyone have a recommendation for a viewer for mbox files?  We were trying Thunderbird, but it requires a valid email account to get set up (as far as we can tell.)   Any recommendations much appreciated. 

sam...@willamette.edu

unread,
Apr 28, 2015, 12:20:08 PM4/28/15
to archiv...@googlegroups.com
I've been playing around more with ingesting PST files.   I set up a normalization rule and command as Anthony describes below, and it appears to work perfectly until the verification step.   I.e. when looking at the normalization report, it shows the extracted mbox folders, but then fails the verification step with "output file does not exist".  I tried setting the Verification Command to None, but it still fails.     I also tried an Extraction rule, on the off chance that it would then expect multiple rather than a single file, but still no luck.        

Any ideas how to make this conversion from PST to MBOX work successfully on ingest?  For now I'm just going to use readpst at the command line to convert to mbox before ingesting....

Anthony Cocciolo

unread,
Apr 30, 2015, 9:54:12 AM4/30/15
to archiv...@googlegroups.com
Hi Samato,

This was the command that I ended up using for normalizing PST Files:

mkdir -p "%outputDirectory%%fileName%-mbox" && readpst -wr "%inputFile%" -o "%outputDirectory%%fileName%-mbox/" && zip -rm "%outputDirectory%%prefix%%fileName%%postfix%.zip" "%outputDirectory%%fileName%-mbox"

Since the readpst command doesn't create a single MBOX file but multiple files/directories (mirroring the directories in the PST), then all those files and directories are put into a single ZIP file.

For reading the output, I was using Windows MBOX Viewer - http://sourceforge.net/projects/mbox-viewer/ .

All the best,
Anthony

--
You received this message because you are subscribed to the Google Groups "archivematica" group.
To unsubscribe from this group and stop receiving emails from it, send an email to archivematic...@googlegroups.com.
To post to this group, send email to archiv...@googlegroups.com.
Visit this group at http://groups.google.com/group/archivematica.
For more options, visit https://groups.google.com/d/optout.



--
Anthony Cocciolo, Ed.D.
Associate Professor
Reply all
Reply to author
Forward
0 new messages