X-Git-Url: https://git.cworth.org/git?p=notmuch-wiki;a=blobdiff_plain;f=corpus.mdwn;h=89821b980f96f1f26af209bb19038cf46f50754d;hp=11d6a7449294312e37bbe51d2f23c2e9810c9596;hb=bbdcbf63583351af9c0954cb92316f5bb3a8509a;hpb=887275e83eb835513203ba1ce44f16fc389bf764 diff --git a/corpus.mdwn b/corpus.mdwn index 11d6a74..89821b9 100644 --- a/corpus.mdwn +++ b/corpus.mdwn @@ -1,6 +1,7 @@ -## Notmuch Email Corpus +[[!img notmuch-logo.png alt="Notmuch logo" class="left"]] +# Notmuch Email Corpus -A corpus of about 108k messages is available for performance testing of +A corpus of about 108k messages is available for performance testing of notmuch (or other uses). The contents are as follows @@ -14,11 +15,11 @@ The contents are as follows - `Mail/enron`: selected data from the EDRM v2 enron data set - CC Attribution: "ZL Technologies, Inc. (http://www.zlti.com)" - + - Downloaded via bittorrent http://www.searchdaimon.com/community/dataset/ - + - massaged with scripts/unpack-enron.sh (in the corpus tarball) The corpus is gpg signed by David Bremner with key fingerprint: @@ -27,8 +28,5 @@ The corpus is gpg signed by David Bremner with key fingerprint: You can download the corpus from -- [notmuchmail.org](http:///notmuchmail.org/releases/notmuch-email-corpus-0.2.tar.xz) [signature](http:///notmuchmail.org/releases/notmuch-email-corpus-0.2.tar.xz.asc) -- [UNB](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.2.tar.xz) [signature](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.2.tar.xz.asc) -- [Corpus 0.3](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.3.tar.xz) [signature](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.3.tar.xz.asc) - - +- [notmuchmail.org](https://notmuchmail.org/releases/notmuch-email-corpus-0.3.tar.xz) [signature](https://notmuchmail.org/releases/notmuch-email-corpus-0.3.tar.xz.asc) +- [UNB](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.3.tar.xz) [signature](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.3.tar.xz.asc)