Discuss Disallow in Robots.txt VS NoIndex tag in the Search Engine Optimization forum on SEO Chat. Disallow in Robots.txt VS NoIndex tag Search Engine Optimization forum discussing general tips and tricks to optimize your website for the search engines as a white hat or black hat. Find ideas to fine tune your website for top rankings using keywords, meta tags, SEF, and more.
Our weekly Search Engine newsletter covers the constantly evolving world of search engine optimization like no one else. To stay abreast of all the latestest news, reviews, and how-to's, subscribe today!
BuySellLinks.com is a Text Link Advertising Brokerage by the founder of LinkAdage that specializes in high end quality text links and a controlled membership base. To quickly build inventory we are paying webmasters who add quality domains that we can accepted into our inventory. We also offer free installation.
Tired of dealing with shady text link buyers and/or sellers? Try BuySellLinks.
This month's featured article takes a peek at Google Adsense and teaches you everything you need to know to properly maintain your account and get the most bang for your buck.
Posts: 73
Time spent in forums: 20 h 28 m 42 sec
Reputation Power: 4
Disallow in Robots.txt VS NoIndex tag
I have a page on my site that I do not want indexed and want to know the most effective way to go about this. Should I put the NoIndex tag in the HTML of the page, or should I disallow the page in my Robots.txt file. I remember hearing that one strategy is more effective than the other. What are your thoughts?
Posts: 465
Time spent in forums: 3 Days 21 h 59 m 40 sec
Reputation Power: 55
robots.txt will work just fine. Actually, I don't see a reason to not using it.
In theory, you can even save a bit of bandwidth, because a bot will need to load a page before it gets access to meta, while robots.txt is processed without docs being opened at all. Not a big deal, though.
And still both methods do not guarantee that a file won't be actually crawled. They merely tell bots to not do this, so some not so civilized bots still may ignore these instructions. If you really want to block a file from being opened, use rewriterule in your apache conf.
Posts: 527
Time spent in forums: 1 Week 10 h 10 m 54 sec
Reputation Power: 33
Quote:
Originally Posted by ROGERKLOTZ
Should I put the NoIndex tag in the HTML of the page, or should I disallow the page in my Robots.txt file.
The "noindex" attribute and the "disallow" directive do different things.
1. "noindex" attribute: Google robots will crawl the page and see that you do not want it to be indexed. Google will not show the page in the SERP's.
2. "disallow" directive: Google robots will not crawl the page. The URL of the page will be indexed by Google, but not its content (as Google did not crawl the page). In some particular circumstances, the URL of the page could appear in the SERP's.
So the "noindex" attribute is the safest solution. If the page should not be seen by unauthorized persons, you better use a user/password authentication system to give access to the page rather to rely on search engines.
Jean-Luc
__________________ AWStats Support : add-on's, extra sections, forum, installation assistance + Professional AWStats Services 200ok.eu Broken Link Checkerfinds 404 errors, error pages with 200 ok status, missing images, protocol errors, password protected pages, bad domain names, redirect loops, parking pages and more