|
|
|||||||||
|
|||||||||
|
|||||||||
| |
||
| |||||||||
![]() |
|
|
«
Previous Thread
|
Next Thread
»
|
Thread Tools | Search this Thread | Rate Thread | Display Modes |
|
|
|
Speeding Shipping, Improving Compliance – Read the ScanCode Systems Case Study |
|
#1
|
|||
|
|||
|
How did the spiders find me?
I have a one page site with no outgoing or incoming links from it or to it. It is a static page that I just put up on my host about 3 weeks ago to see how it looked.
My url is not and has not been written out on any other site, forum, chat room, etc. How the heck did they index my site?????? |
|
#2
|
|||
|
|||
|
Are you entirely sure your site isn't in any backlink anywhere on the web? :confused:
|
|
#3
|
|||
|
|||
|
absolutely positive. When searching for links to my site on Google - no links are found. However, when I type in the name of my site, I find its indexed.
Last edited by BillyA : May 21st, 2007 at 07:29 PM. |
|
#4
|
||||
|
||||
|
No big mystery. Google is not showing your link(s). If you did not submit manually to Google, try a link command on Yahoo and/or MSN. I guarantee, if you are truly indexed, you will find one or more backlinks. You will not find all backlinks using Google.
__________________
SEO Tips for Newbies Beginner's Guide to Search Engine Optimization How to improve your rank in the SERPs Link Building 101 |
|
#5
|
|||
|
|||
|
I've used this to check:
http://www.prsearch.biz/inbx.php and have used the link: function in Google and Yahoo to try to find any links. There are none. However, when I type in the name of my site, the page is indexed. With no links in and no links out, how does Google even know the site is there? The reason I bring it up is that its a download page for sensitive files and I don't want it indexed. I now know I have to include the meta tag robot command to stop the indexing, but since there were no links in or out, I never thought it would even get indexed. The only thing I can think of - could it be my hosting company in some way? Last edited by BillyA : May 21st, 2007 at 08:22 PM. |
|
#6
|
||||
|
||||
|
Personally I agree with SEO_AM.
I have a feeling that Google can sometimes spider a server and log any domains without .htaccess or robots.txt exclusion... but I wouldn't expect them to index a site unless they found links to it somewhere. It could be that your host/server has indexed pages (stats etc) which include links to all domains or it could be something like the domain name was previously used etc. Quite honestly - now you know you're indexed - I don't see any point in trying to second guess the reason Be aware that "the link: command" in Google is no use at all in determining whether a site has backlinks or not... it shows only a random sample of links - thus may show you "0 of 3" !
__________________
ClickyB "The quality of the visitor is more important than the volume".. Egol 22nd Feb 2008
New to SEO? Start Here: SEO FAQForum Rules & Posting Guidelines Last edited by ClickyB : May 21st, 2007 at 08:28 PM. |
|
#7
|
|||
|
|||
|
Yeah, thanks for the responses....
Just a little peeved as its cost me a domain name. |
|
#8
|
||||
|
||||
|
It happens a lot to me. When I buy a domain name which is usually through godaddy or network solutions and then put a hosting account on it it gets spidered, with no links. Never really thought about it but it could have to do with being on shared Ip addresses or from google looking at whois records. or server domain lists
|
|
#9
|
|||
|
|||
|
Another reason for the Indexing could be the hosting service you used. When you registered the domain it could have been listed on the site which would cause a backlink that you can not find out about.
The only way to prevent this from happening from what I have been able to find out is by using the robots.tx and including your site as an excluded page. |
|
#10
|
|||
|
|||
|
For a one page site do I need to have a robot.tx file and list the page to be excluded? Or will the meta tag instruction on that page be fine?
|
|
#11
|
||||
|
||||
|
Quote:
robots.txt is the best option always. Coz though chances are very less, robots meta tag can still be overlooked by the SE spiders. So robots.txt is the best exclusion tool you have
__________________
Social Mobile Messaging SEO FAQs - You might find your answer here. SEOchat Forum Rules - Read Before You Post **Do what you feel in your heart to be right- for you'll be criticized anyway. You'll be damned if you do, and damned if you don't.** |
|
#12
|
||||
|
||||
|
User-agent: *
Disallow: / Thats it save it as robots.txt and upload it to the root of your webserver. Some rouge bots ignore robots.txt though so if the info is very sensitive you may want to lock the folder and put a User Name and Password on it |
|
#13
|
|||
|
|||
|
Thanks everyone! You guys have been a great help!!
|
|
#14
|
|||
|
|||
|
i had the same problem
real hidden webpage ... and the google found it from my google toolbar ... |
|
#15
|
|||
|
|||
|
One more concern though
Okay, so I used robot.txt to have my website crawled. What is interesting is that I find some of the of the robots.txt file has folders that are usually not shown or found like the /admin or /adminonline or /superuser in the "disallow" part.
So, why is it that users do use these folder names here so openly? How can I prevent any body from just seeing it? Its so easy you see, if I type a website with /robot.txt it would simply show me the contents. |
![]() |
| Viewing: SEO Chat Forums > Search Engine Strategies > Search Engine Spiders > How did the spiders find me? |
| Thread Tools | Search this Thread |
| Display Modes | Rate This Thread |