XMailforum is a readonly knowledge archive now.

Registering as a new user or answering posts is not possible anymore.

Might the force be with you, to find here what you are looking for.

2019-09-20 - hschneider, Admin

Cookie Disclaimer: This forum uses only essential, anonymous session cookies (xmailforum*), nothing to be scared of.

XMail Forum -> The DSPAM Project

Reply to this topicStart new topicStart Poll

> The DSPAM Project, Interesting new SPAM filter
hschneider
Posted: Jun 1 2004, 05:59 AM
Quote Post


No - I'm not an answering script ...
Group Icon

Group: Admin
Posts: 6631
Member No.: 195
Joined: 19-June 02



QUOTE

DSPAM (as in De-Spam) is an extremely scalable, open-source statistical hybrid anti-spam filter. While most commercial solutions only provide a mere 95% accuracy (1 error in 20), a majority of DSPAM users frequently see between 99.95% (1 error in 2000) all the way up to 99.985% (1 error in 7000). DSPAM is currently effective as both a server-side agent for UNIX email servers and a developer's library for mail clients, other anti-spam tools, and similar projects requiring drop-in spam filtering. DSPAM has been implemented on many large and small scale systems with the largest systems being reported at about 125,000 mailboxes.


What is a Statistical Hybrid Filter?
Present-day language classifiers bear the responsibility of maintaining accuracy in the midst of ever-increasing sample complexity. In the setting of spam filtering, many types of intentional attacks have been introduced such as obfuscation, word list injection, sample flooding, and broken RFC - all which have caused commercial-grade filters to drop to around 95% accuracy or worse. As the complexity of classification text continues to multiply rapidly, many filter developers today are left with conflicted feelings between increasing the complexity of their filter and wise teachings from CS class reminding them that computer science is about controlling complexity, not creating it. At the rate complexity is rising, filters will (and have already begun to) become so resource-intensive that they lose scalability, eventually leading to a second conflict of interests: where fighting spam becomes more expensive than managing it.

DSPAM is a Statistical Hybrid filter and in being such provides higher accuracy levels than commercial solutions with minimal resources By employing algorithms designed to increase the filter's concept-identification abilities, the quality of the existing data is improved resulting in a reduced feature set (rather than an increased one). DSPAM has managed to achieve nearly equal levels of accuracy with present-day Markovian-based filters and other types of filters that employ large feature sets with the added benefit of using a significantly fewer amount of resources. DSPAM presently peaks at 99.985% accuracy, which is ten times more accurate than a human being [1] and is presently being used on implementations as large as 125,000+ mailboxes. Concept identification is important, and it is sometimes necessary to expand the feature set to identify these concepts (such as "free viagra"), however statistical hybrid filters attempt to maximize the efficiency of these concepts rather than expand the feature set even further to cover all kinds of miscellaneous lexical data.


--------------------
Bye,
Harald


-- Download XMail Queue Manager 1.46 NOW: XMail Server Tools
-- Cross platform remote queue management!
-- Message analyzing on the fly!
-- Builtin diagnostics knowledge base!
-- Manages multiple mail queues!

Sponsored by
CD-Produktion und DVD-Produktion and Homestaging Saarland - Immobilien schneller verkaufen in der Region Saarland, Rheinland-Pfalz und Luxembourg
PMEmail PosterUsers Website
Top
Frankie
Posted: Nov 13 2004, 02:04 PM
Quote Post


Newbie
*

Group: Members
Posts: 3
Member No.: 1230
Joined: 21-July 04



ohmy.gif

Hi,

Yes, DSpam looks promissing, but is there a HOWTO or some other readme, specifically based on XMail and DSpam ? Preferably (?) on FreeBSD 5.2.1
I am relatively new to FreeBSD, so any documentation about this combination would be highly appreciated.

Thanx,

Frank
PMEmail Poster
Top
hschneider
Posted: Nov 13 2004, 02:10 PM
Quote Post


No - I'm not an answering script ...
Group Icon

Group: Admin
Posts: 6631
Member No.: 195
Joined: 19-June 02



Sorry - no howto up to now.


--------------------
Bye,
Harald


-- Download XMail Queue Manager 1.46 NOW: XMail Server Tools
-- Cross platform remote queue management!
-- Message analyzing on the fly!
-- Builtin diagnostics knowledge base!
-- Manages multiple mail queues!

Sponsored by
CD-Produktion und DVD-Produktion and Homestaging Saarland - Immobilien schneller verkaufen in der Region Saarland, Rheinland-Pfalz und Luxembourg
PMEmail PosterUsers Website
Top
0 User(s) are reading this topic (0 Guests and 0 Anonymous Users)
0 Members:
« Next Oldest | Announcements | Next Newest »

Reply to this topicStart new topicStart Poll