Re: Bayesian Spam Filtering and Bogofilter
from
Joseph Reagle <reagle@mit.edu>,
Fri, 13 Dec 2002 08:13:45 -0500
On Friday 13 December 2002 01:52 am, Hugo Haas wrote:
> How much training did it take you to get there?
It took about 5 seconds to do the initial training of good and bad spam:
for f in {A-Spam,junk}; do
sed -e "/X-Spam/d" -e "/X-Bogosity/d" -e "/ * /d" $f | bogofilter -s;
done
for f in {trash,inbox,Friends,W3-Legal,W3-PR,W3-xkms-WG}; do
sed -e "/X-Spam/d" -e "/X-Bogosity/d" -e "/ * /d" $f | bogofilter -n;
done
Then I already had performance exceeding SA. Now, like I said, when I nuke
my junk boxes I automatically take about ~1sec to refresh the training.