From d8b82b24a479902c0ba142f8eb317e5ab6ba5d3c Mon Sep 17 00:00:00 2001 From: friendica Date: Wed, 15 Feb 2012 23:02:05 -0800 Subject: [PATCH] Documentation on Friendica changes to B8 and notice of source availability in accordance with LGPL --- library/spam/README | 12 ++++++++++++ 1 file changed, 12 insertions(+) create mode 100644 library/spam/README diff --git a/library/spam/README b/library/spam/README new file mode 100644 index 000000000..a0d67fca7 --- /dev/null +++ b/library/spam/README @@ -0,0 +1,12 @@ +B8 for Friendica + +B8 is an excellent bayesian spam implementation for PHP. However when evaluating it for use in Friendica there were a few shortcomings. B8's primary audience is guestbooks and blogs - single user situations. + +Friendica is a multi-user distributed social environment. So the first thing we need to add to b8 is a concept of user ID. + +Second we don't want to use a second stored set of DB login credentials so we're going to implemetn Friendica's MySQL driver and use our existing connection and credentials. + +The third requirement is that the B8 processing model is to load a set of word/data sets from the DB, perform processing (which may change the value of the data) and then store the results back to the DB. We're in a highly dynamic environment with lots of sometimes concurrent message processing. So the plan is to alter the storage architecture to read data in, do processing, and then apply a somewhat atomic change operation where the changes are performed in a single query using the current data in storage rather than something passed through outside processing and where the data may be outdated come time to store it. + +In accordance with the LGPL of the B8 package these changes are available in source form at http://github.com/friendica/friendica in the directory library/spam + \ No newline at end of file