Analyzing Your Own Mail Data

The Enron mail data makes for great illustrations in a chapter on mail analysis, but youâll almost certainly want to take a closer look at your own mail data. Fortunately, many popular mail clients provide an âexport to mboxâ option, which makes it pretty simple to get your mail data into a format that lends itself to analysis by the techniques described in this chapter. For example, in Apple Mail, you can select some number of messages, pick âSave Asâ¦â from the File menu, and then choose âRaw Message Sourceâ as the formatting option to export the messages as an mbox file (see FigureÂ 3-7). A little bit of searching should turn up results for how to do this in most other major clients.

FigureÂ 3-7.Â Most mail clients provide an option for exporting your mail data to an mbox archive

If you exclusively use an online mail client, you could opt to pull your data down into a mail client and export it, but you might prefer to fully automate the creation of an mbox file by pulling the data directly from the server. Just about any online mail service will support POP3 (Post Office Protocol version 3), most also support IMAP (Internet Message Access Protocol), and Python scripts for pulling down your mail arenât very hard to whip up. One particularly robust command-line tool that you can use to pull mail data from just about anywhere is getmail ...

Get Mining the Social Web now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.

Start your free trial

Mining the Social Web by Matthew A. Russell

Analyzing Your Own Mail Data

Don’t leave empty-handed

It’s yours, free.

Check it out now on O’Reilly