Announcing mbox-to-tiddlers

116 views
Skip to first unread message

Jeremy Ruston

unread,
Jul 4, 2019, 1:03:42 PM7/4/19
to TiddlyWikiDev
For a personal project I’ve lashed together a simple utility for converting email messages stored in mbox files into individual tiddlers.


Each message in the mbox file is converted into a separate tiddler with the following fields:

• title: message ID
• caption: message subject
• from: message sender
• text: message text
• type: text/plain
• modified: message date (in TiddlyWiki format)

mbox files are a standard export and storage format that is offered by many email clients and services. For testing, I was using mbox files generated by the macOS Mail.app.

As an example, I’ve included in the repository a 18MB TiddlyWiki containing all the messages from the main TiddlyWiki discussion group over the last year. It contains 6,907 tiddlers.

I’ve also copied that example to TiddlySpot for easier access — open the “Recent” tab to see some messages:


The next step is a better framework for browsing and viewing the messages.

Any questions or feedback welcome,

Best wishes

Jeremy

@TiddlyTweeter

unread,
Jul 4, 2019, 2:01:54 PM7/4/19
to TiddlyWikiDev
Ciao Jeremy

Very good and very interesting. I really appreciate this kind of "leverasge" to bring stuff together.

One issue is the titles are very user unfriendly. Given what we talking sbout emails that may NOT have unique Subjects is difficult to see a solution of that issue. 
BUT maybe showing of subject as sub-title and maybe title (an index, basically) hidden?

Early thoughts
Josiah

Jeremy Ruston

unread,
Jul 4, 2019, 2:16:26 PM7/4/19
to TiddlyWikiDev
Hi Josiah

Very good and very interesting. I really appreciate this kind of "leverasge" to bring stuff together.

Thanks. This sort of thing is surprisingly quick and easy to do when the necessary libraries exist.

One issue is the titles are very user unfriendly. Given what we talking sbout emails that may NOT have unique Subjects is difficult to see a solution of that issue. 
BUT maybe showing of subject as sub-title and maybe title (an index, basically) hidden?

Yes indeed. The demo doesn’t include any templates/scaffolding to make it easier to browse/use the messages, you’re just seeing the raw tiddlers.

The titles are ugly, but the message ID is the right choice because it is unique, and will in the future allow messages to be threaded together by their “reply-to” field. The subject line of the email is in the “caption” field.

Best wishes

Jeremy






Early thoughts
Josiah

On Thursday, 4 July 2019 19:03:42 UTC+2, Jeremy Ruston wrote:
For a personal project I’ve lashed together a simple utility for converting email messages stored in mbox files into individual tiddlers.


Each message in the mbox file is converted into a separate tiddler with the following fields:

• title: message ID
• caption: message subject
• from: message sender
• text: message text
• type: text/plain
• modified: message date (in TiddlyWiki format)

mbox files are a standard export and storage format that is offered by many email clients and services. For testing, I was using mbox files generated by the macOS Mail.app.

As an example, I’ve included in the repository a 18MB TiddlyWiki containing all the messages from the main TiddlyWiki discussion group over the last year. It contains 6,907 tiddlers.

I’ve also copied that example to TiddlySpot for easier access — open the “Recent” tab to see some messages:


The next step is a better framework for browsing and viewing the messages.

Any questions or feedback welcome,

Best wishes

Jeremy

--
You received this message because you are subscribed to the Google Groups "TiddlyWikiDev" group.
To unsubscribe from this group and stop receiving emails from it, send an email to tiddlywikide...@googlegroups.com.
To post to this group, send email to tiddly...@googlegroups.com.
Visit this group at https://groups.google.com/group/tiddlywikidev.
To view this discussion on the web visit https://groups.google.com/d/msgid/tiddlywikidev/80a93759-97bb-410a-8ee3-590def401531%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

PMario

unread,
Jul 5, 2019, 1:30:20 AM7/5/19
to TiddlyWikiDev
Hi Jeremy,

The TW contains e-mail addresses in plain text. So if an evil crawler finds them, they can end up in some spam databases, which isn't desireable.

So I think they should be obfuscated or removed.

-mario

PMario

unread,
Jul 5, 2019, 1:48:49 AM7/5/19
to TiddlyWikiDev
Hi,

I think it should be the "subtitle" field instead of the "caption". caption is the short-title

subtitle is already a growing standard used in several plugins. .. subtitle is for longer titles.

just a thought.
mario

Jeremy Ruston

unread,
Jul 5, 2019, 10:32:43 AM7/5/19
to TiddlyWikiDev
Hi Mario


The TW contains e-mail addresses in plain text. So if an evil crawler finds them, they can end up in some spam databases, which isn't desireable. 

So I think they should be obfuscated or removed.

Thanks for reminding me. I shall remove the data from tiddlyspot and github.

Best wishes

Jeremy.
Reply all
Reply to author
Forward
0 new messages