Hey Dean.
* Dean Jackson <
[email protected]> [2003-03-28 14:37+1100]
> Recently I've been disappointed with the performance of
> bogofilter and spamassassin. The combination of the two
> still means I don't get much spam, but they seem to
> be making more mistakes than normal. So, I'm seeking advice.
>
> - Do most people run bogofilter in auto-train mode?
I do, but only in the following case:
- spam and not on my white list.
- not spam and on my white list.
> - Do you use the default settings on bogofilter? I notice
> a lot of spam arriving with a bogorating of 0.85 or above, so
> not being marked as spam.
When I upgraded bogofilter to version 0.11.*, I started having more
spams go though it. So I tweaked the settings.
> - How big are your bogo databases?
hugo@homer ~> ls -l .bogofilter
total 11379
-rw------- 1 hugo www 2969600 Apr 17 21:06 goodlist.db
-rw------- 1 hugo www 8634368 Apr 17 21:06 spamlist.db
> - Is anyone doing any preprocessing on the email before handing
> it off? For example, SpamAssassin notices a message is from
> yahoogroups and thus compensates for the advertising that yahoo
> puts on the bottom. At the moment bogofilter is learning that
> those typically spam-like phrases are not spam.
I do some white list tagging as said above.
> Any tips or tricks appreciated.
I completely dropped SpamAssassin, and am invoking bogofilter as
(assuming that BOGOFILTER_REGISTER is set to yes at some point):
# Bogofilter
:0fw
| bogofilter -pe -o 0.5
# Register mail as spam?
:0c
* BOGOFILTER_REGISTER ?? yes
* ^X-Bogosity: Yes
| bogofilter -s
# Register mail as non-spam?
:0c
* BOGOFILTER_REGISTER ?? yes
* ^X-Bogosity: No
* ^X-HH-Whitelist: YES
| bogofilter -n
:0:
* ^X-Bogosity: Yes
spam
With version 0.11.2, I haven't had any problem with this setting.
Note that you can use bogoutil to remove tokens with a low count or
old tokens, and you may increase the quality of your filtering, but I
haven't done that.
--
Hugo Haas -
http://larve.net/people/hugo/