spam assassin | Tech, Faltoo Fundae, and Vichitra ArBits

Self-Spam

May 30, 2008 at 5:27 pm | Posted in arbit, criticism, humour, image processing, sarcasm, spam, Technical, Visions | 9 Comments
Tags: akismet, captcha, email, gmail, google, image processing, nigeria, pranketh, recaptcha, scam, self-spam, spam assassin, spamming, spoof, waterloo

A small technical exercise testing the effectiveness of gmail to combat spam, which on their site says “Fast, searchable email with less spam”.

This is true mostly. However, there is a downside to gmail spam filters w.r.t false-positives, i.e (filter mistakenly flags a good mail as spam)
These numbers are particularly high in case of gmail. I’m comparing it to, say using Spam-Assassin , or Akismet as your spam filter. I’ve had a sufficient number of good mails, not so useful, but not spam either-> which have been flagged off as spam.
This is unacceptable. In my case, I’ve the patience, and also a weird interest to go through my spam messages everday. But, busy people would consider this to be a “costly nuisance”.
The Nuisance is obvious. It’s also costly as the flagged message might have critical business information.
Most people however do not worry much about this. Because they trust their spam filters to weed out only the bad guys. As Matt Cutts of Google says about Google Search so elegantly,

“Sure,I could stop all the spam in the world if I didn’t have to return any search results.” 🙂 [source]

What people don’t like, however, is the appearance of { stimulating/enlarging/Nigerian King bequeathing his 100 million $ /cheap adobe software or Rolex-es / mortgage loans at unbelievable rates } mails in their inbox. This is totally irritating, and this does more damage to the lay-user than the previously mentioned almost-rare case.
Spam filters involve many techniques which are constantly under development. These chiefly include heuristic[ drawing from previous experiences, a learning method], bayesian filters, simple database matching, matching ip’s with regular spammers, and lots more involving complex probability/statistical models beyond my wildest dreams.
So, being the evil chap that I am, I did this nonsensical thing. I sent a spam mail to myself.

Granted, this is stupid. Gmail should place a trust on the sender[ me], and the sending server[gmail.com], and hence classify this as a legitimate mail.
So I repeated this exercise with a well-known email spoofer, www.pranketh.com . What this nifty project [written by two brilliant chaps from University of waterloo] does is pretty simple.
As the site says – “It allows you to send an email, that looks like its sent from someone else.” Or in simple terms.
I can send an email from id’s like mukeshambani[at]reliance[dot]in, or soniagandhirocks[at]congressrocks[dot]gov[dot]in.
[ note the delicate usage of “rocks”, Blogger belongs to google. I don’t want no risks]. A screenshot of pranketh’s page.

Yes. I know your doubt. If its so easy, why do nefarious miscreants use stupid yahoo/gmail id’s to threaten people, or send smokescreen bomb-footage, to the extremely retarded tv channel Aaj Tak[ self-proclaimed to be “Sarv-Shresht].
The point is the email server, folks. Your id is spoofed. But the smtp server name. No no. I couldn’t dream in my distant dreams to get a gov.in smtp[ Since I don’t happen to be a chinese (govt-sponsored) hacker 😦 ]. As a test case, try sending a prankethmail to your own id. And check the “original mail” option in gmail.
Ah. The post digresses from its core issue. Lets come back to the main point, shall we.
An important reason to gmail, not blocking my self-spam, was that it trusts my id, and its servers.
I tried sending the same text through pranketh. Guess what. It thrashes the mail left and right, before it even leaves their servers. Why?. They use Aksimet.

My advice to gmail [ In the remotest probability that Matt Cutts is reading this,], and other mail providers is this.
Google’s servers are checking all our mail-contents for generating their automated ads and stuff anyways. So, there is no illusion of privacy. So, the next time, I’m sending a mail, check the contents before hand. Warn me if its spammish. Keep the thresholds appropriately such that I’m not regularly annoyed with these warnings.
If all smtp servers start this routine, we can see at least some major changes.

1. All the worlds emails would take a longer time to reach their destination. [ There’s got to be some catch.This is it]

2. Say I send one mail to a big bunch of people, it’d be scanned for spam-behaviour only once. Then some certification can be piggy-backed along, saying its reliable, and not spam. The experts can handle that bit. Not too difficult.

3. Botnets prevention. Say, some dumbo privately runs a smtp server, and has been been subjected to a backdoor/trojan attack. And this is currently acting as a zombie sending out bunch of viagrish mails to innocent people, who’ve left their email id’s lying out in the open.

I’m not saying you give up your earlier approach. That’d be foolish. But if its absolutely obvious that a mail is spammy[ self-spam for eg.]. Block it before it leaves your grounds.

Now, a general warning to all those who think spams are obnoxious. Your bigdaddydog@rediffmail.com might be the prettiest email id around. But don’t leave it on some arbit website, for all the world to see. One syntax-based text crawler and you get thousands of them.
Believe me, some of these spammers are millionaires[ Not the Nigerian kind]. And run their business professionally. And have awesome technical expertise too.
If anything, don’t make their job easier. Let them just fight it out with the big-guys[ yahoo, google, msft et al].

I reiterate. If you desperately want to put your email id on the net, use images like these.

And, a word of caution. Even this is not safe. Within 1-2 years, google image search is going to search the contents in images. And character recognition. Piece of cake.
So, what do you do next. Do not fear, I have the ideas.

1. When you must, put your email id’s with re-captcha. I’d written a post about this some time back. My email id through this schema would be abhi…@gmail.com .Go to their website and register for their free service. The only reason this idea is safe, however, is because Spammers, like all of us, are average-ramesh hard-working people. They do not have time to fill your captchas. Where as, your friends and people who want to see your email id so desperately, do.

2. Use images, but this time, write them with 3-d blocks. Even by extreme image processing hacking standards, this is nearly safe for 5-6 years.

3. Do not put email id’s on the internet.

This is probably the first in a series of spam-related posts to come.

P.S: I kinda remembered the first word of my blog’s title, and how I hadn’t paid any attention to it, for the past few months.

And, let me clarify. I love gmail.

Meta
Blogroll
Timepass
Mindless Sheep
- 27,395 and counting...
arbit Bengaluru Carnatic chappar criticism Ethics fakereviews humour image processing Internet introduction Linux literary Microsoft music narcissism news wagon nitk nostalgia Poe poetry Politics protocol sarcasm spam Technical travel twitter Uncategorized Visions
afro-american akismet amol palekar anorexia audiophish audio processing bengalooru Bengaluru blackberry bosey bsnl call center captcha Carnatic cartoons cheap humour chetan Bhagat coffee communist china davangere delhi bomb blasts deve gowda dinga email farewell figures of speech five point someone free tibet geekiness gmail google grammar nazis himesh reshammiya hitchhiker's guide to the galaxy hotels image processing India fights back istwitterdown Linux mandolin shrinivas narendra modi nigeria nitk noam chomsky notareview phishing scams placement pranketh recaptcha review richmond road Sakkut scam self-spam senti shashi tharoor shivraj patil site-down slashdot sonu nigam spam assassin spamming spoof temple-tours terror emails tetris Three mistakes tronix turing twitter upma sucks vacation waterloo women's liberation woodlands
Top Posts
An Indian Blogger, I am…
Archives
- October 2012
- August 2012
- June 2011
- March 2011
- August 2009
- July 2009
- June 2009
- January 2009
- November 2008
- October 2008
- September 2008
- August 2008
- July 2008
- June 2008
- May 2008
- April 2008
- March 2008
- February 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- July 2007
- June 2007
- May 2007
- April 2007
- March 2007
- February 2007
- January 2007
- December 2006
May 2024

M T W T F S S

1 2 3 4 5

6 7 8 9 10 11 12

13 14 15 16 17 18 19

20 21 22 23 24 25 26

27 28 29 30 31

« Oct
Stats
Chirpy Twitter, Twirpy Chitter
Tweets by _logik
Pages
- About me, this Blog, and everything else
- googlef09ecbc4298a5bf3.html

May 2024
M	T	W	T	F	S	S
	1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Blog at WordPress.com.
Entries and comments feeds.

	sup murthy on Chetan Bhagat – The thre…
	Mustafa on Chetan Bhagat – The thre…
	Logik on Fire in Babylon – Caribb…
	Shamanth on Fire in Babylon – Caribb…
	SWATI on The Full Monty Problem…

Tech, Faltoo Fundae, and Vichitra ArBits

Self-Spam

Benda Kalu

Search. Find Posts. Read

Recent Posts

Recent Comments

Meta

Blogroll

Timepass

Mindless Sheep

Top Posts

An Indian Blogger, I am…

Archives

Stats

Chirpy Twitter, Twirpy Chitter

Pages