h a l f b a k e r y[marked-for-tagline]
add, search, annotate, link, view, overview, recent, by name, random
news, help, about, links, report a problem
browse anonymously,
or get an account
and write.
register,
|
|
|
Simple but may be effective. Correlate many different samples of junk mail (if only I had many samples of junk mail...) and find strong correlations. These could be common words, common word combinations, etc. The same sort of analysis used for speach recognition. If you wanted to get really fancy,
use markov chains or something but even something simple might do the trick. I haven't seen anything like this floating around but if it was effective, I'd swear by it.
Here ya go
http://thespamletters.com/about.shtml If I could save time in a bottle
I could throw away my gift lists. [reensure, Dec 30 2001]
A Plan for Spam
http://www.paulgraham.com/spam.html Bayesian spam filters are common nowadays (dunno if there were any in 2001) [quarl, Nov 09 2004]
Still stuff to be mined using this approach that Bayesian ignores
http://www.gtoal.co...he_best_of_bot.html You can find common substrings using an algorithm similar to LZ77 compression [gtoal, Mar 29 2006]
Please log in.
If you're not logged in,
you can see what this page
looks like, but you will
not be able to add anything.
Annotation:
|
|
ugh...that's _speech_ recognition. |
|
|
Are you suggesting the correlations could have some sort of meaning hitherto unseen? Messages from the gods? Government codes used to mystify and intrigue the general populace whilst dissemenating signals to suicide shoe bombers/undercover pedants/recipe card traders etc? |
|
|
Do people actually read junk email? I shall look closely at the next one I recieve in the slim chance it may reveal the location of the microfilm! |
|
|
[jeffie], the Halfbakery lets you edit what you wrote. You can fix your typo and delete the annotation you added at the top. |
|
|
«if only I had many samples of junk mail» Personally, I'd rather get sent less, not more. |
|
|
[benfrost] I'm assuming that this idea is for a system which automatically removes junk email. |
|
|
For some reason, I don't get junk mail. I think earthlink blocks it somehow. (Not complaining, by any means.) |
|
|
Earthlink does...but I'd rather deal with it myself; no filter is perfect, and will delete something I DO want to see. |
|
|
Besides, then I can be rude to them or forward them to mail fraud or the FBI... |
|
|
And if you want a lot of samples of spam, I can set my filters up to forward them to you rather than delete them... |
|
|
would someone please send me some junk - I have had no mail at all this weekend and feel so depressed. |
|
|
Baked. E-mail filters already look for '!!!' or 'GET RICH QUICK', etc. If your e-mail program doesn't, make your own filter. |
|
| |