Re: Bayesian Spam Filtering and Bogofilter

Replies:

  • None.

Parents:

Joseph Reagle <[email protected]> writes:

> I'm fairly happy with bogofilter though it does let some stupid spam
> through occasionally, and isn't catching the latest spams which is
> just a non-spammy natural language sentence or two and a
> link.

Those are the ones that bug me the most, and don't know what to do
about them.  

I'm still SA and going to add bogofilter on my mail server.  My plan
is to do all filtering on the mail server, using it's cycles and
having more of my mail processing take place before my mail comes to
me.  For training purposes I'll resend a message to an alias on the
mail server which will procmail into bogofilter.  I'll probably have
the procmail recipe (at least for training on non-spam) look at other
headers (Received, MUA, etc.) to avoid outside influences from
tainting.

Actually I think the known sold (eg [email protected]) and harvested
addresses I will just send to the spam alias bucket.  Might even make
some honey pots for autotraining in this manner.

> (Also, it still misses some html mail, and I'm willing to consider
> that as very probable spam from the start.)

Ditto.

--
Ted Guild <[email protected]>
http://www.guilds.net

HURL: fogo mailing list archives, maintained by Gerald Oskoboiny