|
|
|||||||||
|
|||||||||
|
|||||||||
| |||||||||
![]() |
|
|
«
Previous Thread
|
Next Thread
»
|
Thread Tools | Search this Thread | Rate Thread | Display Modes |
|
|
|
Be the architects of evolution and help create the mobile internet future. It’s your move---enter to win here! |
|
#1
|
|||
|
|||
|
Robots.txt syntax for wildcards?
I would like to exclude all pages ending in /print.html and all pages with a ? in the url string in my robots.txt file. Is the following syntax correct for Yahoo, Google and MSN?
Disallow: /*print.html$ Disallow: /*? |
|
#2
|
||||
|
||||
|
Quote:
Put these in your robots.txt file user-agent: * disallow: /*? disallow: /print.html
__________________
Site Map SEO FAQs - You might find your answer here. SEOchat Forum Rules - Read Before You Post **Do what you feel in your heart to be right- for you'll be criticized anyway. You'll be damned if you do, and damned if you don't.** |
|
#3
|
||||
|
||||
|
Marfola, something like that looks like your best bet.
However, personally I'd prefer something like this in the <head> section of my pages: PHP Code:
That would get rid of files already (but inadvertently) indexed as well. You could even make those files '301' redirect to their indexable equivalents
__________________
Last edited by dzine : May 12th, 2008 at 06:43 AM. |
|
#4
|
|||
|
|||
|
I think it should be :
user-agent: * disallow: /*? disallow: /*print.html |
![]() |
| Viewing: SEO Chat Forums > Search Engine Strategies > Search Engine Optimization > Robots.txt syntax for wildcards? |
| Thread Tools | Search this Thread |
| Display Modes | Rate This Thread |
|
|
|
|
|
|
|