tech support 13

  • Subscribe to our RSS feed.
  • Twitter
  • StumbleUpon
  • Reddit
  • Facebook
  • Digg

Monday, 17 October 2005

On Spam

Posted on 15:44 by Unknown
Spam is a tricky problem. Or as Matt Haughey says "spam bloggers sure are resourceful little bastards."

For a while now, the Blogger team has been contending with spam on Blog*Spot through mechanisms like Flag as Objectionable and comment/blog creation CAPTCHAs. The spam classifier that Pal described has also dramatically reduced the amount of spam that folks experience when browsing NextBlog.

However, spam is still being created and, as was widely noted, Blogger was especially targeted this weekend.

One group of folks who are particularly affected by blog spam are those who use blog search services and those who subscribe to feeds of results from those services. When spam goes up, it directly affects the quality of those results. I'm exceedingly sympathetic with these folks because, well, we run one of those services ourselves.

So given that the problems is hard, what more are we doing? One thing we can do is improve the quality of the Recently Updated information we publish.

Recently Updated lists like the one Blogger publishes are used by search services to determine what to crawl and index. A big goal in deploying the filtered NextBlog and Flag as Objectionable was to improve our spam classifiers. As we improve these algorithms, we plan to pass the filtered information along automatically. Just as a first step, we're publishing a list of deleted subdomains that were created this weekend during the spamalanche.

Greg from Blogdigger (one of the folks who consumes blog data) points out that "ultimately the responsibility for providing a quality service rests on the shoulders of the individual services themselves, not Google and/or Blogger." However, we think by sharing what we've learned about spam on Blogger we can hopefully improve the situation for everyone.

We can also make it more difficult for suspected spammers to create content. This includes placing challenges in front of would-be spammers to deter automation.

Of course, false positives are an unavoidable risk with automatic classifiers. And it's important to remember that the majority of content being posted on Blog*Spot is not spam (we know this from the ongoing manual reviews used to train the spam classifier).

Some have suggested that we go a step farther and place CAPTCHA challenges in front of all users before posting. I don't believe this is an acceptable solution.

First off, CAPTCHAs represent a burden for all users (the majority of whom are legit), an impossible barrier for some, and are incompatible with API access to Blogger.

But, most importantly, wrong-doers are already breaking CAPTCHAs on a daily basis. And not through clever algorithmic means but via the old-fashioned human-powered way. We've actually been able to observe when human-powered CAPTCHA solvers come on-line by analyzing our logs. You can even use the timestamps to determine from whence this CAPTCHA-solving originates.

One thing we've learned from Blog Search, is that even if spam were completely solved on Blog*Spot, there would still be a problem. As others have concluded, we've realized that this is going to be an on-going challenge for Blogger, Google and all of us who are interested in making it easier for people to create and share content online.
Email ThisBlogThis!Share to XShare to FacebookShare to Pinterest
Posted in | No comments
Newer Post Older Post Home

0 comments:

Post a Comment

Subscribe to: Post Comments (Atom)

Popular Posts

  • Partner Profile: Lijit
    Periodically, we profile a Blogger partner that can add functionality to your blog. This week we'd like to spotlight Lijit , a company b...
  • One-click Blogging with BlogThis! Chrome Extension
    by Chang Kim, Product Manager, Blogger More and more of you are using Google Chrome ( more than 30 million active users now !), and we want ...
  • Stuff to Think About
    Check out this post by CJ , who's already integrated Google's AJAX Search API into his Blogger blog : "A little HTML, a little...
  • Go Mobile
    A while back we mentioned that we've been working with Sony Ericsson to incorporate blogging into their new generation of cameraphones....
  • Monetize!
    You may have noticed that about a week ago a new tab showed up in Blogger for your blog. The tab is called Monetize, and in case it isn'...
  • Bloggers wanted!
    Have opinions about Blogger? If so, we'd like to meet you. We are looking for participants willing to document their blogging practices ...
  • Blogger Widget for Mac Dashboard
    I’m really excited to introduce you to the Blogger widget, one of the three widgets for Mac OS X 10.4 now available on Google Labs . This l...
  • Blogger Help Group
    Hi, everyone - it's Andrea from Blogger Support here. I'm very excited to announce the launch of our new Blogger User-to-User Help G...
  • Keeping Your Blog Secure
    While October is to many a month of candy and costumes, it also happens to be National Cyber Security Awareness Month in the U.S. In that s...
  • The Future of Moviegoing
    MobMov is a grass roots guerilla drive-in movie event type of thing. They represent the future of going out to watch movies with friends. D...

Categories

  • +1
  • 10th Birthday
  • 2010
  • accessibility
  • ads
  • adsense
  • Amazon
  • Android
  • Blog2Print
  • Blogger
  • Blogger birthday
  • Blogger Fiesta
  • Blogger Meetup
  • Blogger Stats
  • Blogger Template Designer
  • Blogger2Print
  • blogspot
  • BlogThis
  • blogworld
  • Buzz
  • calendar
  • Chrome
  • code
  • commenting
  • community
  • conference
  • custom domain
  • developers
  • DMCA
  • draft
  • dynamic views
  • events
  • feedburner
  • feeds
  • firefox
  • follow by email
  • following
  • foxytunes
  • FTP
  • gadgets
  • GAN
  • Google Analytics
  • Google Buzz
  • Google Sites
  • google+
  • grandcentral
  • help
  • ios
  • jump
  • knol
  • lightbox
  • mobile
  • monetize
  • music
  • navbar
  • New UI
  • next blog
  • OneTrueFan
  • openid
  • OpenSky
  • Page Creator
  • pages
  • pixelodeon
  • polls
  • post summaries
  • read more
  • recommend
  • SEO
  • Share
  • support
  • SXSW
  • template designer
  • twitter
  • video
  • videoblogging
  • Viglink
  • web fonts
  • webcall
  • youtube
  • zemanta

Blog Archive

  • ►  2013 (5)
    • ►  September (1)
    • ►  August (1)
    • ►  June (1)
    • ►  April (2)
  • ►  2012 (18)
    • ►  December (1)
    • ►  November (2)
    • ►  October (1)
    • ►  September (1)
    • ►  August (1)
    • ►  July (2)
    • ►  June (1)
    • ►  May (2)
    • ►  April (3)
    • ►  March (1)
    • ►  February (1)
    • ►  January (2)
  • ►  2011 (47)
    • ►  December (2)
    • ►  November (5)
    • ►  October (6)
    • ►  September (4)
    • ►  August (2)
    • ►  July (8)
    • ►  June (4)
    • ►  May (3)
    • ►  April (4)
    • ►  March (6)
    • ►  February (2)
    • ►  January (1)
  • ►  2010 (40)
    • ►  December (6)
    • ►  November (2)
    • ►  October (4)
    • ►  September (5)
    • ►  August (6)
    • ►  July (1)
    • ►  June (5)
    • ►  May (1)
    • ►  April (1)
    • ►  March (3)
    • ►  February (4)
    • ►  January (2)
  • ►  2009 (42)
    • ►  December (1)
    • ►  November (3)
    • ►  October (5)
    • ►  September (9)
    • ►  August (7)
    • ►  July (2)
    • ►  June (4)
    • ►  May (3)
    • ►  April (3)
    • ►  March (2)
    • ►  February (2)
    • ►  January (1)
  • ►  2008 (34)
    • ►  December (2)
    • ►  November (2)
    • ►  October (3)
    • ►  September (2)
    • ►  August (7)
    • ►  July (2)
    • ►  June (2)
    • ►  May (3)
    • ►  April (4)
    • ►  March (2)
    • ►  February (4)
    • ►  January (1)
  • ►  2007 (47)
    • ►  December (3)
    • ►  November (5)
    • ►  October (6)
    • ►  September (2)
    • ►  August (5)
    • ►  July (3)
    • ►  June (5)
    • ►  May (7)
    • ►  April (3)
    • ►  March (5)
    • ►  February (1)
    • ►  January (2)
  • ►  2006 (91)
    • ►  December (7)
    • ►  November (6)
    • ►  October (6)
    • ►  September (9)
    • ►  August (7)
    • ►  July (5)
    • ►  June (5)
    • ►  May (8)
    • ►  April (7)
    • ►  March (12)
    • ►  February (11)
    • ►  January (8)
  • ▼  2005 (176)
    • ►  December (9)
    • ►  November (11)
    • ▼  October (13)
      • Unreliable Narrator
      • If you can't say something nice ...
      • Spam, APIs and Whitelisting
      • Blog Book Deal
      • Spam Barriers (Redux)
      • Spam Barriers
      • On Spam
      • Weblog Usability
      • Photolightning
      • Blogging in the Early Republic
      • Introducing Backlinks
      • Google Reader in the Wild
      • Write a Novel!
    • ►  September (29)
    • ►  August (23)
    • ►  July (13)
    • ►  June (22)
    • ►  May (25)
    • ►  April (29)
    • ►  March (2)
Powered by Blogger.

About Me

Unknown
View my complete profile