sendmail_bayes(8) Expaminator sendmail_bayes(8) NAME sendmail_bayes SYNOPSIS sendmail_bayes [-D level] [-h] [-v] [-c config-file] DESCRIPTION A Bayesean sendmail 'milter' for spam sendmail_bayes is a sendmail 'milter' which normally runs as a daemon. Sendmail communicates with milters through either a unix or internet-domain socket, which must be specified in both sendmail's ".cf" file and send mail_bayes's configuration file. Command-line options: -D Set the Debug level; 'level' should be a positive inte ger 0..50. Higher values for debug-level cause more internal variables to be dumped to stdout. This is not associated with the error-logging level, ´debug´; '-D' should not be used during normal mail processing. -X Don't daemonise; useful only for debugging. -h Help; print the command-line options and exit. -v Version; print the version number and exit. -c specify a Configuration file. If '-c config-file' is omitted, the environment variable 'SPAMCONFIG' is used. CONFIGURATION sendmail_bayes' configuration file is composed of simple keyword-value pairs, one pair per line. Keywords are not case-sensitive; keyword and value are separated by one or more spaces or tabs. A comment symbol, '#' anywhere on a line causes all following text to be ignored. send mail_bayes will stop scanning for a keyword at the first occurence in the file. This configuration file is shared by the database mainte nance and testing utilities. approval_message <headername> optional; default is 'X-judged-non-spam' When a message has been judged legitimate, a header is added to it containing an indication that it has been passed through the filter. The host name, which may be overridden by 'force_hostname', is appended automatically. force_hostname <hostname> optional; defaults to the host's fully qualified domain name. ´force_hostname´ substitutes the specified string for the hostname in any headers the filter adds to the message. (Currently, only one header is added, an ´accep tance´ header, if the message is judged not to be spam.) force_domainname <domain-name> optional; defaults to the host's DNS domain name. When a ´rcpt to:´ command is received, and a user-hash exists, the filter will append the hosts domain to user names which are not fully qualified before looking up the name in the user-hash. ´force_domainname´ will cause the specified string to be appended, instead. guess <number> optional; defaults to 0.4 This supplies a probability to any word found in a message which cannot be found in the probability hash. Valid val ues are in the range 0..1.0 logfile <pathname> required. Normally, will be the same logfile used by send mail. loglevel <name> optional; defaults to "INFO". Valid values are, in order of decreasing severity, "EMERG", "ALERT", "CRIT", "ERROR", "WARNING", "NOTICE", "INFO", and "DEBUG". number_to_consider <integer> optional; defaults to 100 When all words in a message have been assigned a proba bilty, the probabilities are sorted according to the abso lute value of their difference from 0.5; ´number_to_con sider´ is the number of the highest-ranking probabilities to use in the calculation of "spam probability". Low values for number_to_consider may result in unreliable judgements; high values impose a slightly higher cpu load. probabilityhash <filename> required. This is the name of the probability hash (or more usually, the symbolic link to it) produced by ´make_new_database´. Only a simple filename is required; the directory is supplied by ´spamdatadir´. sendmail_listen <address-family>:<pathname | portnumber> required. The internet-domain port or unix-domain socket to be used by the sendmail milter library to communicate with sendmail. For a unix domain socket, use: sendmail_listen unix:/full/pathname/of/socket , or: sendmail_listen local:/full/pathname/of/socket For an internet-domain connection, use: sendmail_listen inet:portnumber This must, of course, match exactly the ´Xfilter´ configu ration line in sendmail.cf. spamdatadir required. The directory containing the probability hash and the optional username hash. spamlimit <directory> optional; defaults to 0.999 The value which the estimated spam probability must exceed before being condemned and rejected as spam. user <username | userid> optional; defaults to the userid of the process running the filter. If found, the filter will setuid to this user before entering the milter library code. username_db <filename> optional; the user ´opt-in´ database; no default. If no username hash is specified, the filter will be used on all messages, regardless of recipient. If one is specified, filtering will be done only for mes sages destined for names found in the hash. ENVIRONMENT $SPAMCONFIG can be used to supply the full pathname of the configuration file. (The ´-c config-file´ option will override $SPAMCONFIG) FILES Required: a configuration file and probability hash Optional(but recommended): a username hash COPYRIGHT Copyright (c) 2002, J.B.Ward <bward2@users.sourceforge.net> Expaminator Nov.22,2002 sendmail_bayes(8)