Friday, September 12th, 2008

Random Spam Filter

I removed the RANDOM spam filter (comparing letter frequences of a comment with statistics for English, French and Finnish). It wasn't catching much spam anyway, and the one it did catch was not random, but contained lots of medicine names, which don't follow the statistical patterns of English, French or Finnish. And also, it did filter out one legitimate comment where the author's name was Finnish and the content was in English (thus matching neither Finnish nor English statistics), which is not nice.

