|
|
|||||||||
|
|||||||||
|
|||||||||
| |
||
| |||||||||
![]() |
|
|
«
Previous Thread
|
Next Thread
»
|
Thread Tools | Search this Thread | Rate Thread | Display Modes |
|
#1
|
||||
|
||||
|
robot.txt file
what is robot.txt file and how it benefit for SEO .....
explain in detail |
|
#2
|
|||
|
|||
|
Hi,
Because there are many questions already asked before it could be wise to check out previous postings about robots.txt The quality of this board would go down if the same questions would be repeated over and over again with the same answers given. So please use the search function....and type robots.txt , sure you will find the information to answer your question. regards, hans |
|
#3
|
|||
|
|||
|
Quite right - heard of google?
Yes, and also take the first 3 words of your question and...
http://www.google.com/search?q=what+is+robot.txt&sourceid=mozilla-search&start=0&start=0 |
|
#4
|
||||
|
||||
|
What is the benefit - if you do not want to disallow any of your pages?
I don't currently use a robot.txt file b/c I don't care if it crawls everything. Things the spiders wont get to are password protected.
__________________
RustyBrick Web Development - The Search Engine Roundtable Google Keyword Position Reporting - Advanced Link Analysis - Vonage Internet Phone - Third Party SEO Directory Need 1,000s of links? Free Coop Ad Network |
|
#5
|
||||
|
||||
|
Quote:
there are some reasons to not allow spiders to visit some pages, it could be e.g. if you don't want visitors to find page in SE results but want only home page to be visible in SE, or maybe page contain something which couldn't be good for spider e.g. if you're using anti-SPAM scripts like WPoison and in order to not let legitimate spiders to be trapped in bogus pages you should disallow this pages to be crawled so crawlers which obey robot instructions will not visit them(SPAM Bots probably ignore this rules anyway and would be trapped...). Also you can do this in order to simply save bandwidth(of course it's pretty drastic measure |
|
#6
|
||||
|
||||
|
Quote:
But if I want it to spider every page of mine then why have it? Do I need to disallow a .css or .js page? They wouldn't get anything from it anyway. Let's talk about a fairly small >20 page site with fairly static content. |
|
#7
|
||||
|
||||
|
Rustybrick
Why not just have a robots.txt file that allows all? User-agent: * Disallow: That will acomplish the same thing and the robots will actually find a file on your server when they go looking for it.
__________________
Soli Deo Gloria, Tony |
|
#8
|
|||
|
|||
|
yep - having one that allows all just prevents a 404 error which is worthwhile i suppose. gotta keep those spiders happy!
|
|
#9
|
||||
|
||||
|
Quote:
I have a 404 page. Are search engines happier if you have a robot.txt file that allows everything? Whats the point. Anyone know what is the point for a robot.txt file for a site that wants to be fully crawled? |
|
#10
|
|||
|
|||
|
Quote:
Even IF a search engine should index all....it's wise to have a robots.txt file in the root of your website. First to prevent a 404 error (not only one but many since SE are looking for this file first before indexing your website) ....and to see how many spiders visits are coming to your website, since normal visitors will not look for it !!! Your stats will show it.....!!! regards, hans |
|
#11
|
||||
|
||||
|
thanks
|
![]() |
| Viewing: SEO Chat Forums > Other > HTML Coding > robot.txt file |
| Thread Tools | Search this Thread |
| Display Modes | Rate This Thread |
|
|
|
|