
January 22nd, 2004, 08:11 AM
|
|
Registered User
|
|
Join Date: Nov 2003
Posts: 12
Time spent in forums: < 1 sec
Reputation Power: 0
|
|
|
After being spidered....
I'm using the VB forum on a site I'm developing and the spiders having been running mental on the site for a while now... everything is fine apart from the forum links... there are pages that don't need to be indexed, like usercp.php in VB which are getting multiple hits because each URL like has a different session ID in it.... Inktomi seems to be the worst.
I've added the pages that the spiders don't need to index and put them in the robots.txt file.
Because the spiders have been going mental for a few months now, 90% of the existing links are irrelevant pages that all have different URL parameters due to the session ID.
Now that the spiders know about previous links, and now they are being informed to ignore them.
Q: Would the spiders eventually forget about these links never to return on old previously spidered links?
Q: Does the disallow in the robots file inform them to remove the any existing content over time?
Or simply put, what would the net effect be by doing this?
|