X-Git-Url: https://git.cworth.org/git?a=blobdiff_plain;f=corpus.mdwn;h=8834572a4c54cd885f140e49450d7d75bb98c80d;hb=HEAD;hp=ed8a188672c0c92e44ab7b4126bcc3c68f347f36;hpb=77b0a8c1af8a0e4afeb3f8ce2a712e8939aee961;p=notmuch-wiki diff --git a/corpus.mdwn b/corpus.mdwn index ed8a188..8834572 100644 --- a/corpus.mdwn +++ b/corpus.mdwn @@ -1,7 +1,7 @@ [[!img notmuch-logo.png alt="Notmuch logo" class="left"]] # Notmuch Email Corpus -A corpus of about 108k messages is available for performance testing of +A corpus of about 209k messages is available for performance testing of notmuch (or other uses). The contents are as follows @@ -22,11 +22,12 @@ The contents are as follows - massaged with scripts/unpack-enron.sh (in the corpus tarball) +- `Mail/lkml`: lkml messages 1000000 to 1100000 from the gmane archive + The corpus is gpg signed by David Bremner with key fingerprint: 7A18 807F 100A 4570 C596 8420 7E4E 65C8 720B 706B You can download the corpus from -- [notmuchmail.org](https://notmuchmail.org/releases/notmuch-email-corpus-0.3.tar.xz) [signature](https://notmuchmail.org/releases/notmuch-email-corpus-0.3.tar.xz.asc) -- [UNB](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.3.tar.xz) [signature](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.3.tar.xz.asc) +- [notmuchmail.org](https://notmuchmail.org/releases/notmuch-email-corpus-0.5.tar.xz) [signature](https://notmuchmail.org/releases/notmuch-email-corpus-0.5.tar.xz.asc)