1 [[!img notmuch-logo.png alt="Notmuch logo" class="left"]]
4 A corpus of about 108k messages is available for performance testing of
5 notmuch (or other uses).
7 The contents are as follows
9 - `Mail/notmuch-archive`: archive of the notmuch mailing list.
11 - last updated 2012-11-17
13 - converted from mbox with mb2md 3.20.
15 - `Mail/enron`: selected data from the EDRM v2 enron data set
17 - CC Attribution: "ZL Technologies, Inc. (http://www.zlti.com)"
19 - Downloaded via bittorrent
21 http://www.searchdaimon.com/community/dataset/
23 - massaged with scripts/unpack-enron.sh (in the corpus tarball)
25 The corpus is gpg signed by David Bremner with key fingerprint:
27 815B 6398 2A79 F8E7 C727 86C4 762B 57BB 7842 06AD
29 You can download the corpus from
31 - [notmuchmail.org](https://notmuchmail.org/releases/notmuch-email-corpus-0.3.tar.xz) [signature](https://notmuchmail.org/releases/notmuch-email-corpus-0.3.tar.xz.asc)
32 - [UNB](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.3.tar.xz) [signature](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.3.tar.xz.asc)