Search Engine Optimization
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
 
 
User Name:
Password:
Remember me
Go Back   SEO Chat ForumsSearch Engine StrategiesSearch Engine Optimization

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread SEO Chat Forums Sponsor:
  #1  
Old January 22nd, 2004, 08:11 AM
neil_rutherford neil_rutherford is offline
Registered User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Nov 2003
Posts: 12 neil_rutherford User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
After being spidered....

I'm using the VB forum on a site I'm developing and the spiders having been running mental on the site for a while now... everything is fine apart from the forum links... there are pages that don't need to be indexed, like usercp.php in VB which are getting multiple hits because each URL like has a different session ID in it.... Inktomi seems to be the worst.

I've added the pages that the spiders don't need to index and put them in the robots.txt file.

Because the spiders have been going mental for a few months now, 90% of the existing links are irrelevant pages that all have different URL parameters due to the session ID.

Now that the spiders know about previous links, and now they are being informed to ignore them.

Q: Would the spiders eventually forget about these links never to return on old previously spidered links?

Q: Does the disallow in the robots file inform them to remove the any existing content over time?

Or simply put, what would the net effect be by doing this?

Reply With Quote
  #2  
Old January 30th, 2004, 12:35 AM
requiem requiem is offline
Contributing User
SEO Chat Novice (500 - 999 posts)
 
Join Date: Oct 2003
Posts: 532 requiem User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 12 m 59 sec
Reputation Power: 6
Sessionids are a bad idea when it comes to spiders, it is what we would like to call a spider trap. Because of the sessionid changes with each session, you will have spiders indexing the same content over and over and over again.

The best id is not to serve spiders sessionids or not to pass the session variables
as a part of the URL string. I am not sure if I followed your question about robots.txt so I will not address it.

Reply With Quote
Reply

Viewing: SEO Chat ForumsSearch Engine StrategiesSearch Engine Optimization > After being spidered....


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread 
Rate This Thread:


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

vB code is On
Smilies are On
[IMG] code is On
HTML code is Off
View Your Warnings | New Posts | Latest News | Latest Threads | Shoutbox
Forum Jump


Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
  
 





© 2003-2008 by Developer Shed. All rights reserved. DS Cluster 5 hosted by Hostway
Stay green...Green IT