#1
  1. No Profile Picture
    Registered User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Aug 2005
    Location
    Hungary
    Posts
    20
    Rep Power
    0

    Question Mysterious links crawled by Google and sitemap-builders


    Hi, this is not really a problem, but an interesting issue.
    when I crawled my site (pyrocenter.hu) with a sitemap builder, named GsiteCrawler, it showed up some really strange urls, like http://pyrocenter.hu/forum/mnt/disk0-SAMSUNG_SP0802N-part6/Home/Hp/hpim3245 , linked from http://pyrocenter.hu/forum/forum.php?open=3&page=14
    The strange in this is, that we can see some of the server's inner filesystem, with the type of the hdd! The link mentioned is BTW a _blank link to another site...
    These links aren't included in the sitemap, GSitecrawler doesn't include them.
    But the Google crawled my site, and found an url with HTTP-error: http://pyrocenter.hu/%5C'news/news.php?open=133
    That's really funny. All links pointing to a page of the news-page are dinamycally generated, and are of course valid links. I don't know how Google got such an url on my page, and I don1t know how GsiteCrawler got such a page, but these are all because of the same unknown problem.
    BTW, sometimes spider-simulators are also getting such urls, but never at the same site, and never the same, so it must be some strange server error.
    Does anyone have such paranormal link too? :-) Or only I have to deal with UFOs? :-)
  2. #2
  3. No Profile Picture
    Contributing User
    SEO Chat Discoverer (100 - 499 posts)

    Join Date
    Aug 2005
    Location
    down by the seside.net
    Posts
    153
    Rep Power
    14
    What planet are you hosting from? Perhaps it's the radiation :-)

    just kidding :-) -- if you notice strange things with the GSiteCrawler, send me a mail, it's usually easy / fast to "clear up" and the things I can't find you can post here (or in a conspiracy-theory-forum) :-)

    Regarding your page, it includes the following:
    <img src="mnt/disk0-SAMSUNG_SP0802N-part6/Home/Hp/hpim3245">

    Since Google now accepts all kinds of files, the GSiteCrawler picked up on this image as well. Not sure if it should be there or not, in any case it's on your site.

    Regarding your error link, I'm just guessing that it should be http://pyrocenter.hu/news/news.php?open=133 and perhaps you have an incorrect link somewhere :-).

    Crawlers like the GSiteCrawler are really good to help find these types of mistakes. It's not like your site will get thrown out of Google for them, but it's good practice to clean it up anyway.
  4. #3
  5. No Profile Picture
    Registered User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Aug 2005
    Location
    Hungary
    Posts
    20
    Rep Power
    0
    Thanks, I think I must get a hosting service in another galaxis :-).
    Really, this image is there, but I don't know how it got there, it really shouldn't happen... But the news link I can't find anywhere... Maybe aliens have manipulated my browser :-).

Similar Threads

  1. Why I fear Google sitemaps...
    By rjonesx in forum Google Optimization
    Replies: 42
    Last Post: Aug 27th, 2005, 02:01 AM
  2. Does Submitting to Google Sitemap - Hurts anyway...
    By kamran in forum Google Optimization
    Replies: 20
    Last Post: Jun 25th, 2005, 10:23 PM
  3. Now what... XML Sitemap, Google, and others
    By jrothra in forum Google Optimization
    Replies: 4
    Last Post: Jun 16th, 2005, 10:53 AM
  4. Google penalty for hidden sitemap links?
    By pawoodster in forum Google Optimization
    Replies: 9
    Last Post: Aug 19th, 2004, 05:00 PM
  5. New Google algo checklist/todos
    By Webby in forum Google Optimization
    Replies: 56
    Last Post: Mar 30th, 2004, 02:41 PM

IMN logo majestic logo threadwatch logo seochat tools logo