dspam_train(1)
NAME
dspam_train - train a corpus of mail
SYNOPSIS
dspam_train [ username ] [ spam_dir ] [ nonspam_dir ]
DESCRIPTION
dspam_train is used to train and test a corpus of mail (in maildir format). This tool will present each message to dspam for a classification
and then retrain only if the message was incorrect. This provides close
to real-world training and should be used to build pretrained databases. Upon execution, the tool will automatically determine the ratio
of spam:nonspam and train based on that ratio to ensure both corpora
are trained consecutively. This tool can also be used as a test jig to
measure the efficiency and accuracy of a particular corpus against
dspam in a given configuration.
OPTIONS
- [username]
- Specifies the user to train.
- [spam_dir]
- Specifies the pathname to the directory containing the
corpus of spam. Each message should be separate in its
own file. - [nonspam_dir]
- Specifies the pathname to the directory containing the
corpus of nonspam. Each message should be separate in its
own file.
EXIT VALUE
0 Operation was successful.
other Operation resulted in an error.
AUTHORS
Jonathan A. Zdziarski
For more information, see http://www.nuclearelephant.com.