Page 1 of 3 123 LastLast
Results 1 to 20 of 45

Thread: How do you detect "Hit Bots"

  1. #1
    I love AskDamageX.com prono's Avatar
    Join Date
    May 2009
    Location
    South Bitch Miami, Fla
    Posts
    94

    Default How do you detect "Hit Bots"

    I have been hearing about Hit Bots and I guess the term is pretty self explanatory. A program that generates hits by using automated bots... But how do you detect if somebody is sending bots to your site?

    Also, I have been noticing on my Google Analytics that I am getting one visit per key word phrase. Some phrases are actually ridiculous and I find it hard to believe that somebody found my site using these keywords. Is this a bot? Is Google Crawling itself? Thanks to any answers you may bring to the table.

  2. #2
    LifestyleAmateurs.com nation-x's Avatar
    Join Date
    Oct 2005
    Location
    Rock Hill, SC
    Posts
    8,779

    Default

    Hitbot Detector Instructions:

    Go to the kitchen and get some Aluminum Foil
    Make yourself an antenna like this:

    login to your trade script and refresh every 10 seconds until you see the antenna start to vibrate

  3. #3
    In hypno-vision morient's Avatar
    Join Date
    Nov 2007
    Location
    The penis of America
    Posts
    1,431

    Default

    Quote Originally Posted by prono View Post
    I have been hearing about Hit Bots and I guess the term is pretty self explanatory. A program that generates hits by using automated bots... But how do you detect if somebody is sending bots to your site?

    Also, I have been noticing on my Google Analytics that I am getting one visit per key word phrase. Some phrases are actually ridiculous and I find it hard to believe that somebody found my site using these keywords. Is this a bot? Is Google Crawling itself? Thanks to any answers you may bring to the table.
    One way is to set up an invisible link on your site that the visitors can't see or click on and see if it gets clicked anyway. Check the site/ip that click was generated by and then block them.

  4. #4
    I love AskDamageX.com prono's Avatar
    Join Date
    May 2009
    Location
    South Bitch Miami, Fla
    Posts
    94

    Default thanks

    Quote Originally Posted by morient View Post
    One way is to set up an invisible link on your site that the visitors can't see or click on and see if it gets clicked anyway. Check the site/ip that click was generated by and then block them.
    Wow, good trick... I was twisting up my foil until I read this..

  5. #5
    Jack-of-all-trades JACOBKELL's Avatar
    Join Date
    Dec 2006
    Location
    In motion
    Posts
    4,565

    Default

    I will tell you what aheib told me:use common sense.And it working as a charm.

  6. #6
    Who does #2 work for? AmateurFlix's Avatar
    Join Date
    Oct 2005
    Posts
    13,922

    Default

    Quote Originally Posted by prono View Post
    Also, I have been noticing on my Google Analytics that I am getting one visit per key word phrase. Some phrases are actually ridiculous and I find it hard to believe that somebody found my site using these keywords. Is this a bot? Is Google Crawling itself? Thanks to any answers you may bring to the table.
    they're probably real searches. the idea of using a hitbot is generally to credit the referring site as having sent a hit, so they'll typically come from trades, not from search engine referrers.

    people search for lots of really weird phrases.

  7. #7
    I love AskDamageX.com prono's Avatar
    Join Date
    May 2009
    Location
    South Bitch Miami, Fla
    Posts
    94

    Default thanks

    Quote Originally Posted by JACOBKELL View Post
    I will tell you what aheib told me:use common sense.And it working as a charm.
    What is that? Common sense is not always so common. That's what the magic eight ball has told me...



    Quote Originally Posted by AmateurFlix View Post
    they're probably real searches. the idea of using a hitbot is generally to credit the referring site as having sent a hit, so they'll typically come from trades, not from search engine referrers.

    people search for lots of really weird phrases.
    Thanks for your reply,

    But I think you may have misunderstood my question, or maybe I did not phrase it correctly. I was not implying that people are using hit bots to use google to access my site. These were not to be confused with one another. Two separate questions..

    1st question, how to detect hit bots
    2nd question, does google use bots to crawl itself

  8. #8
    In hypno-vision morient's Avatar
    Join Date
    Nov 2007
    Location
    The penis of America
    Posts
    1,431

    Default

    Quote Originally Posted by prono View Post
    2nd question, does google use bots to crawl itself
    Google uses a bot to scan your sites... as do all other SE's.

    When you block bots make sure they aren't the "useful" ones.

  9. #9
    D'oh!! willwank's Avatar
    Join Date
    Oct 2006
    Location
    Hamilton, ON
    Posts
    4,077

    Default

    Quote Originally Posted by nation-x View Post
    Hitbot Detector Instructions:

    Go to the kitchen and get some Aluminum Foil
    Make yourself an antenna like this:

    login to your trade script and refresh every 10 seconds until you see the antenna start to vibrate
    I would personally used a Maxtor array as a conductor since Chinese bots travels such a long distance and have weaker signals, but thats just me
    "If you put a thing in the center of your life, who lacks power to nourish, it will eventually destroy you, and everything you are"
    Oh btw, it's come full circle. Node/V8 - Low Level Server Side JavaScript. Benchmarked here - libs & packages here - read up here
    Other stuff:: Textpattern CMS | Vim | Douglas Crockford | GT.M db |@willwankman | 437654594

  10. #10
    Jack-of-all-trades JACOBKELL's Avatar
    Join Date
    Dec 2006
    Location
    In motion
    Posts
    4,565

    Default

    Quote Originally Posted by prono View Post
    What is that? Common sense is not always so common. That's what the magic eight ball has told me...
    Yes you are correct

  11. #11
    Who does #2 work for? AmateurFlix's Avatar
    Join Date
    Oct 2005
    Posts
    13,922

    Default

    1. there are some scripts with prepackaged hitbot detection methods. they work because no one but the author knows how they work. if you find a way to detect hitbots effectively, then you shouldn't post about it either because someone will write a hitbot to defeat your methods. except for some basic things like the hidden link technique mentioned, you won't find much information about this topic, and for good reason. making it easier for the average webmaster to figure this stuff out would make it much harder for everyone else to detect hitbots. it's a catch 22.

    2. AFAIK, googlebot would not send a referer containing search terms to your site. I doubt if google crawls itself. It is entirely possible for someone else to write a spider which would crawl google's search results and end up on your site, although I doubt if that is happening with any significant frequency for many terms on your one site.

  12. #12
    I love AskDamageX.com prono's Avatar
    Join Date
    May 2009
    Location
    South Bitch Miami, Fla
    Posts
    94

    Default Once again

    Quote Originally Posted by morient View Post
    Google uses a bot to scan your sites... as do all other SE's.

    When you block bots make sure they aren't the "useful" ones.
    Again, I will try to rephrase this so it is better understood.

    Does google send out those little invisible guys called bots to google.com... Wait, I may have lost somebody there so lets back up....

    Google bots go here ----> Google
    Google bots crawl Google
    Yes or No?

    I know that SE bots crawl my site but do they crawl themselves? Does Google crawl Google, Does yahoo crawl yahoo, does msn crawl msn?


    Quote Originally Posted by AmateurFlix View Post
    2. AFAIK, googlebot would not send a referer containing search terms to your site. I doubt if google crawls itself. It is entirely possible for someone else to write a spider which would crawl google's search results and end up on your site, although I doubt if that is happening with any significant frequency for many terms on your one site.
    Ok, I see you understood what I was getting at.. Thanks for your reply...
    Last edited by prono; June 4th, 2009 at 10:02 AM.

  13. #13
    D'oh!! willwank's Avatar
    Join Date
    Oct 2006
    Location
    Hamilton, ON
    Posts
    4,077

    Default

    Quote Originally Posted by AmateurFlix View Post
    1. there are some scripts with prepackaged hitbot detection methods. they work because no one but the author knows how they work. if you find a way to detect hitbots effectively, then you shouldn't post about it either because someone will write a hitbot to defeat your methods. except for some basic things like the hidden link technique mentioned, you won't find much information about this topic, and for good reason. making it easier for the average webmaster to figure this stuff out would make it much harder for everyone else to detect hitbots. it's a catch 22.
    Continous calculation of a trades statistical deviation is an alternative to the "hidden link(s)" approach. Humans tend to be sheeps, bots does not. Not failsafe but used together with common sense its a pretty darn good tool.
    "If you put a thing in the center of your life, who lacks power to nourish, it will eventually destroy you, and everything you are"
    Oh btw, it's come full circle. Node/V8 - Low Level Server Side JavaScript. Benchmarked here - libs & packages here - read up here
    Other stuff:: Textpattern CMS | Vim | Douglas Crockford | GT.M db |@willwankman | 437654594

  14. #14
    Lonewolf Internet Sales Toby's Avatar
    Join Date
    Oct 2006
    Location
    Texas
    Posts
    1,220

    Default

    Quote Originally Posted by willwank View Post
    I would personally used a Maxtor array as a conductor since Chinese bots travels such a long distance and have weaker signals, but thats just me
    Try using titanium foil. It's more expensive and harder to find than aluminum foil, but it will improve your reception about ten fold.

  15. #15
    Uhn Tiss Uhn Tiss Uhn Tis Cyberpod's Avatar
    Join Date
    Apr 2006
    Location
    NE PA
    Posts
    1,601

    Default

    Since we are on the topic of bots and bot like hits, maybe someone else has seen this. I am now blocking 4-5 IP's a week from teen-hot.com for bad hits. I just brought up my draupnir "no ref" logs and here is my top 3 hitters for the last 24 hours. The first two have whois info that shows Germany and Russia and the third is always high in the list and comes back as a Google crawl server.
    I have been blocking these IP's (not the google one) as they show up since they also usually come in at about 50% productivity. Any thoughts as to what might be going on? the IP's change all the time and are usually in different C blocks and countries.

    3965 42.5 % 91.96.193.213
    1053 11.3 % 93.158.151.25
    510 5.5 % 66.249.66.161

  16. #16
    Who does #2 work for? AmateurFlix's Avatar
    Join Date
    Oct 2005
    Posts
    13,922

    Default

    Cyberprod, I just had to block 91.205.124.9 for the same reason. ~10k noref hits, doesn't appear that they were generating any clicks at all.

    googlebot has been going bonkers on my site lately as well, however I suspect that might have something to do with my adding ~50k pages to my domain.

  17. #17
    LifestyleAmateurs.com nation-x's Avatar
    Join Date
    Oct 2005
    Location
    Rock Hill, SC
    Posts
    8,779

    Default

    I wrote a script to detect hitbots... but I decided not to release it... because I didn't want the drama... some of the sites that sent me 100% hitbot traffic are very well known. Another reason I didn't release it is because I think that if I did the cheaters would figure out a way around it anyway... so I have just been using it for my own use. I just had a long phone conversation with another developer who wrote something similar (for a trade script he was planning on releasing) and when he realized that as much as 40% of the traffic coming to his sites was fake... alot from zombie machines infected with trojans. he realized that the only way that his script would be successful would be if he didn't include that feature.... simply because noone could grow a site using the script with that feature in it.

    It comes down to this... do you want to build a large traffic site or combat hitbots? I am here to tell you that a good percentage of the traffic pool on most skim (and probably noskim too) sites is fake traffic. When I started filtering the traffic my network went from 300k/day to 120k/day. It's kind of a no win situation.

  18. #18
    D'oh!! willwank's Avatar
    Join Date
    Oct 2006
    Location
    Hamilton, ON
    Posts
    4,077

    Default

    Quote Originally Posted by nation-x View Post
    .... simply because noone could grow a site using the script with that feature in it.

    It comes down to this... do you want to build a large traffic site or combat hitbots? I am here to tell you that a good percentage of the traffic pool on most skim (and probably noskim too) sites is fake traffic.
    QUOTED FOR TRUTH

    I've opted to combat bots, be extremely selective and seo my sites, and for that my sites grow extremely slow compared to many others. But on the other hand, I challenge anyone with tgps/mgps to beat my signup ratios.
    Last edited by willwank; June 4th, 2009 at 10:35 AM.
    "If you put a thing in the center of your life, who lacks power to nourish, it will eventually destroy you, and everything you are"
    Oh btw, it's come full circle. Node/V8 - Low Level Server Side JavaScript. Benchmarked here - libs & packages here - read up here
    Other stuff:: Textpattern CMS | Vim | Douglas Crockford | GT.M db |@willwankman | 437654594

  19. #19
    Jack-of-all-trades JACOBKELL's Avatar
    Join Date
    Dec 2006
    Location
    In motion
    Posts
    4,565

    Default

    Quote Originally Posted by AmateurFlix View Post
    Cyberprod, I just had to block 91.205.124.9 for the same reason. ~10k noref hits, doesn't appear that they were generating any clicks at all.

    googlebot has been going bonkers on my site lately as well, however I suspect that might have something to do with my adding ~50k pages to my domain.
    According to google 91.205.124.9 is 91.205.124.9
    Bot Yanga WorldSearch Bot v1.1/beta (Yanga – United Kingdom)

  20. #20
    Jack-of-all-trades JACOBKELL's Avatar
    Join Date
    Dec 2006
    Location
    In motion
    Posts
    4,565

    Default

    Quote Originally Posted by nation-x View Post
    I wrote a script to detect hitbots... but I decided not to release it... because I didn't want the drama... some of the sites that sent me 100% hitbot traffic are very well known. Another reason I didn't release it is because I think that if I did the cheaters would figure out a way around it anyway... so I have just been using it for my own use. I just had a long phone conversation with another developer who wrote something similar (for a trade script he was planning on releasing) and when he realized that as much as 40% of the traffic coming to his sites was fake... alot from zombie machines infected with trojans. he realized that the only way that his script would be successful would be if he didn't include that feature.... simply because noone could grow a site using the script with that feature in it.

    It comes down to this... do you want to build a large traffic site or combat hitbots? I am here to tell you that a good percentage of the traffic pool on most skim (and probably noskim too) sites is fake traffic. When I started filtering the traffic my network went from 300k/day to 120k/day. It's kind of a no win situation.
    Yes i noticed that too,for example if you check no img stats on tp you will see how almost every trade have 50% of noimg traffic so imagine what would be if you block or redirect such traffic.Also there are some trades which generates extremely high productivity from bad countries(which is probably infected computers)and you site is 5x bigger thanks to it but i still delete such trades anyway,i don't wanna support cheaters at any cost.

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •