Tuesday, September 9th, 2008

Breakfast at Tiffany's

Adapted from Wikipedia:

Holly Golightly meets Paul Varjak, a new tenant in her building. Paul helps Holly prepare to visit Sing Sing, a weekly routine from which she earns $100 for an hour's conversation with Sally Tomato, an incarcerated mob boss. Holly does not realize that she is passing coded messages for Sally's drug ring. Paul is a writer but hasn't been published since 1956. Holly ran away from home at 14. The next day Paul attends a party at Holly's where he is introduced to José da Silva Pereira, a handsome, rich Brazilian. Holly, always trying improve her lifestyle by marrying rich men, plans to marry José, although she has started a relationshion with Paul. Months later, Paul has moved out of the brownstone but is visiting Holly, who is leaving the next morning for Brazil. They go out for dinner and upon returning are arrested in connection to Sally's dealings with Sally Tomato. Holly spends the night in jail. Paul picks her up in the morning, along with letter from José in which he breaks up with Holly because his position won't allow him to marry someone who has been in jail. Holly still refuses to marry Paul, but finally changes her mind.

Spam Blocking Statistics

Out of the last 6370 spams, 5798 (91.0%) were blocked based on the IP address of the sender (IPBLACKLIST).

Out of the other 572, 394 (68.9%) were blocked by a simple trap (COIN, a field that should be left empty), 83 (14.5%) were blocked because they contained the same URL more than twice (SAMEURL), 49 (8.57%) had too many urls per word (TOOMANYURL), 15 (2.62%) were blocked by keyword (KEYWORD), 7 (1.22%) had the same values for title, blog name and excerpt (SAMETITLE), 5 (0.874%) had more than 4 URLs pointing to the same server (SAMESERVER), 3 (0.524%) contained random data (RANDOM, none of them actually did but they were spam nonetheless) and 2 (0.350%) contained only hex data (HEXDATA).

Overall, 14 spams had to be hand moderated, which makes a false negative percentage of 0.22%.

The false positives I've had were because of the TOOMANYURL filter, but it also catches a lot of spams. In most of these, the URLs were not real but made of random letters.

