Spider Spamming
As Sam repeatedly reminds us, HTTP GETs should be idempotent.
Usually, it’s “granny” who must be protected from GETting some unsafe resource. In the case of blogs, however, it’s not granny you need to worry about. It’s search engine spiders.
sv-crawl.looksmart.com - - [03/Nov/2005:14:37:47 -0600] "GET /cgi-bin/MT-2.5/sxp-comments.pl?comments_form&static=1&entry_id=75&author=affiliate%20software&email=ciali@mail.com&url=http://www.partnersmanager.com&text=great%20site&bakecookie=1&post=%20POST HTTP/1.1" 302 571 "-" "Mozilla/4.0 compatible ZyBorg/1.0 (wn-14.zyborg@looksmart.net; http://www.WISEnutbot.com)" sv-crawl.looksmart.com - - [03/Nov/2005:14:37:47 -0600] "GET /cgi-bin/MT-3.0/sxp-comments.pl?comments_form&static=1&entry_id=75&author=affiliate%2520software&email=ciali@mail.com&url=http://www.partnersmanager.com&text=great%2520site&bakecookie=1&post=%2520POST HTTP/1.1" 200 6139 "-" "Mozilla/4.0 compatible ZyBorg/1.0 (wn-14.zyborg@looksmart.net; http://www.WISEnutbot.com)" sv-crawl.looksmart.com - - [03/Nov/2005:22:10:26 -0600] "GET /cgi-bin/MT-2.5/sxp-comments.pl?comments_form&static=1&entry_id=75&author=levitra&email=ciali@mail.com&url=http://www.one-levitra.com&text=great%20site&bakecookie=1&post=%20POST HTTP/1.1" 302 552 "-" "Mozilla/4.0 compatible ZyBorg/1.0 (wn-14.zyborg@looksmart.net; http://www.WISEnutbot.com)" sv-crawl.looksmart.com - - [03/Nov/2005:22:10:26 -0600] "GET /cgi-bin/MT-3.0/sxp-comments.pl?comments_form&static=1&entry_id=75&author=levitra&email=ciali@mail.com&url=http://www.one-levitra.com&text=great%2520site&bakecookie=1&post=%2520POST HTTP/1.1" 200 6084 "-" "Mozilla/4.0 compatible ZyBorg/1.0 (wn-14.zyborg@looksmart.net; http://www.WISEnutbot.com)"
Does your blogging system accept comments via HTTP GET?