From: David Bremner Date: Mon, 19 Nov 2012 17:37:36 +0000 (-0400) Subject: update formatting of corpus page X-Git-Url: https://git.cworth.org/git?p=obsolete%2Fnotmuch-wiki;a=commitdiff_plain;h=8a192eff3de0432916828607bdc042fa55ef2115 update formatting of corpus page --- diff --git a/corpus.mdwn b/corpus.mdwn index 5d262bc..af99c23 100644 --- a/corpus.mdwn +++ b/corpus.mdwn @@ -5,26 +5,27 @@ notmuch (or other uses). The contents are as follows -Mail/notmuch-archive +- `Mail/notmuch-archive`: archive of the notmuch mailing list. -archive of the notmuch mailing list -- last updated 2012-11-17 -- converted from mbox with mb2md 3.20. + - last updated 2012-11-17 -Mail/enron + - converted from mbox with mb2md 3.20. -selected data from the EDRM v2 enron data set -- CC Attribution: "ZL Technologies, Inc. (http://www.zlti.com)" -- Downloaded via bittorrent - http://www.searchdaimon.com/community/dataset/ -- massaged with scripts/unpack-enron.sh +- `Mail/enron`: selected data from the EDRM v2 enron data set + + - CC Attribution: "ZL Technologies, Inc. (http://www.zlti.com)" + + - Downloaded via bittorrent + + http://www.searchdaimon.com/community/dataset/ + + - massaged with scripts/unpack-enron.sh Because of the size of the archive, it is not currently available from http://notmuchmail.org, but can be downloaded from: -- http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.1.tar.gz +- [UNB](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.1.tar.gz) A signature from key "815B 6398 2A79 F8E7 C727 86C4 762B 57BB 7842 06AD" -can be found in +can be found [here](http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.1.tar.gz.asc) -- http://tesseract.cs.unb.ca/notmuch/notmuch-email-corpus-0.1.tar.gz.asc