• Introducing WebPros Cloud - a fully managed infrastructure platform purpose-built to simplify the deployment of WebPros products !  WebPros Cloud enables you to easily deliver WebPros solutions — without the complexity of managing the infrastructure.
    Join the pilot program today!
  • Support for BIND DNS has been removed from Plesk for Windows due to security and maintenance risks.
    If a Plesk for Windows server is still using BIND, the upgrade to Plesk Obsidian 18.0.70 will be unavailable until the administrator switches the DNS server to Microsoft DNS.

SpamAssassin Training and Running System wide BAYES DB

S

SecondPhase

Guest
Hi There, wanted to ask a little about SA training.. when working with sa-learn from the command line it wants to only work with the system user home directory files and cannot deal with the virtual users that qmail uses..

What i did which seemed to work ok, although a lot of work which i haven't finished yet was to run sa-learn as root on a few archives of spam and ham.. then did a sync... and copied the bayes and auto-whitelist files to each qmail users home dir.. (chown popuser) I am now getting bayes test results in the spam subject for these users.

Does anybody know if running a system wide bayes deal would be better than the default way this works? or should i not worry about it now that i'm seeded.

Thanks
 
Hi SecondPhase,

Now that you are "seeded" it will work as well as a site wide. But if I remember correctly, Bayes entries do get aged off the system after a while. So what you are locking yourself into is a cycle of periodic updates to keep the user bayes working.

A site wide has the benefit of being easier to manually update. But the problem with a system wide bayes is that user submitted training will not update into it.

You can check this thread
Semi-automated site wide Bayes training
on the EV1servers forum on implementing a site wide bayes and getting the users to forward spam messages to a spam mailbox on your system which gets fed back into the bayes.

I find that the users in general do NOT do much training, so their personal bayes filters don't get optimised. As a result site wide works better for me.

In fact, one user has pointed out to me that he finds the way training is done on Plesk to be rather clunky. Eg; He has to login to webmail to check his email and if any of them are deemed spam which has been missed, then he has to open a separate window to login to Plesk to do the training.

What I am trying to figure out in the long run is whether it is possible to use BOTH the site wide bayes in conjunction with the personal bayes ......
 
Back
Top