Welcome to the GoFuckYourself.com - Adult Webmaster Forum forums.

You are currently viewing our boards as a guest which gives you limited access to view most discussions and access our other features. By joining our free community you will have access to post topics, communicate privately with other members (PM), respond to polls, upload content and access many other special features. Registration is fast, simple and absolutely free so please, join our community today!

If you have any problems with the registration process or your account login, please contact us.

Post New Thread Reply

Register GFY Rules Calendar
Go Back   GoFuckYourself.com - Adult Webmaster Forum > >
Discuss what's fucking going on, and which programs are best and worst. One-time "program" announcements from "established" webmasters are allowed.

 
Thread Tools
Old 04-27-2018, 06:25 AM   #1
Cyber Fucker
Hmm
 
Cyber Fucker's Avatar
 
Industry Role:
Join Date: Sep 2005
Location: On an endless road around the world for rock and roll.
Posts: 12,642
:2cents DotBot from moz.com is not obeying robots.txt directives

It was pounding like crazy 1000s of my pages daily, I had not choice but to ban on webserver level.

Besides what good it does? It's just spying on your keywords and links.

Well, even after ban it is still trying to access my sites. (I allowed only robots.txt for it. It reads it and does not give a fuck.) So their shit is obviously broken.

What I’m trying to say is that, perhaps you should review your access logs from time to time and see what is wasting your bandwidth and server resources.


For example, my other observation is that within last 3 years I have received from 500 000 to 1000 000 hacking and exploiting attempts per each live website. While all of them were not successful, it must have some impact on server performance.

Well configured webserver can reduce all this bad stuff by 90-99%.
__________________
Cyber Fucker is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 04-27-2018, 09:55 AM   #2
rowan
Too lazy to set a custom title
 
Join Date: Mar 2002
Location: Australia
Posts: 17,393
What directive(s) was the bot ignoring?
rowan is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 04-27-2018, 09:58 AM   #3
Cyber Fucker
Hmm
 
Cyber Fucker's Avatar
 
Industry Role:
Join Date: Sep 2005
Location: On an endless road around the world for rock and roll.
Posts: 12,642
Code:
User-agent: dotbot
Disallow: /
__________________
Cyber Fucker is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 04-27-2018, 10:02 AM   #4
freecartoonporn
Confirmed User
 
freecartoonporn's Avatar
 
Industry Role:
Join Date: Jan 2012
Location: NC
Posts: 7,683
disallow all

allow google.

or contact the dotbot guys
freecartoonporn is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 04-28-2018, 10:55 AM   #5
Bladewire
StraightBro
 
Bladewire's Avatar
 
Industry Role:
Join Date: Aug 2003
Location: Monarch Beach, CA USA
Posts: 56,229
Have you blocked it by IP? If so , has it come back under other IP's?
Bladewire is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 04-28-2018, 11:21 AM   #6
TFCash
Confirmed User
 
Industry Role:
Join Date: Apr 2001
Posts: 1,738
Try this little script out, it seems to grab most of the bad bots.

Bot Black Hole
__________________
TeenFlood.com Online since 1998.

TFCash KissMeGirl
VirginRiches MondoBucks

tim at tfcash.com or submit a ticket at our HelpDesk
TFCash is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 04-28-2018, 12:10 PM   #7
Cyber Fucker
Hmm
 
Cyber Fucker's Avatar
 
Industry Role:
Join Date: Sep 2005
Location: On an endless road around the world for rock and roll.
Posts: 12,642
Nah, I've blocked it by user-agent header, I got no problem with it now since it’s blocked.
This thread was more like an educational one and a suggestion for everyone to monitor webserver logs, at least from time to time.
I'm fine.

Less resources eaten by bad stuff = more resources for good traffic = speed and speed = better SEO.
__________________
Cyber Fucker is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Old 04-28-2018, 09:18 PM   #8
rowan
Too lazy to set a custom title
 
Join Date: Mar 2002
Location: Australia
Posts: 17,393
Even "good" bots can cause issues.

At one point GoogleBot was fetching 150k+ pages per day from one of my sites. The site is heavily database driven so fetching two pages per second continuously did cause some server load issues.

You can dial back the crawl rate in webmaster tools, but that setting expires after 90 days, and then Googlebot just starts pounding away again. They deliberately ignore the Crawl-Delay robots.txt directive.
rowan is offline   Share thread on Digg Share thread on Twitter Share thread on Reddit Share thread on Facebook Reply With Quote
Post New Thread Reply
Go Back   GoFuckYourself.com - Adult Webmaster Forum > >

Bookmarks

Tags
time, robots.txt, access, ban, sites, allowed, reads, logs, wasting, bandwidth, resources, server, review, i’m, fuck, 1000s, crazy, pages, daily, pounding, moz.com, obeying, directives, choice, spying



Advertising inquiries - marketing at gfy dot com

Contact Admin - Advertise - GFY Rules - Top

©2000-, AI Media Network Inc



Powered by vBulletin
Copyright © 2000- Jelsoft Enterprises Limited.