Search Engines - Classic
 
Forums: » Register « |  User CP |  Games |  Calendar |  Members |  FAQs |  Sitemap |  Support | 
 
 
User Name:
Password:
Remember me
Go Back   SEO Chat ForumsSearch EnginesSearch Engines - Classic

Reply
Add This Thread To:
  Del.icio.us   Digg   Google   Spurl   Blink   Furl   Simpy   Y! MyWeb 
Thread Tools Search this Thread Rate Thread Display Modes
 
Unread SEO Chat Forums Sponsor:
  #1  
Old April 1st, 2003, 03:34 PM
Blake Blake is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Mar 2003
Posts: 89 Blake User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 6
STUPID Altavista


[Tue Apr 1 14:08:14 2003] [error] [client 216.39.48.30] File does not exist: /home/dnware/public_html/news/archive/feb/htt

[Tue Apr 1 13:39:05 2003] [error] [client 216.39.48.1] File does not exist: /home/dnware/public_html/news/archive/dec/htt

[Tue Apr 1 11:46:22 2003] [error] [client 216.39.48.30] File does not exist: /home/dnware/public_html/news/archive/articles/pressrelease02062003.php

[Tue Apr 1 09:40:30 2003] [error] [client 216.39.48.61] File does not exist: /home/dnware/public_html/products/databas

[Tue Apr 1 01:28:41 2003] [error] [client 216.39.48.30] File does not exist: /home/dnware/public_html/products/databas

[Tue Apr 1 01:10:48 2003] [error] [client 216.39.48.61] File does not exist: /home/dnware/public_html/news/archive/articles/htt

[Mon Mar 31 22:44:11 2003] [error] [client 216.39.48.61] File does not exist: /home/dnware/public_html/company/htt

[Mon Mar 31 19:05:56 2003] [error] [client 216.39.48.30] File does not exist: /home/dnware/public_html/news/archive/dec/htt

Reply With Quote
  #2  
Old April 1st, 2003, 07:53 PM
theBear's Avatar
theBear theBear is offline
Contributing User
SEO Chat Novice (500 - 999 posts)
 
Join Date: Mar 2003
Location: Maine USA
Posts: 524 theBear User rank is Private First Class (20 - 50 Reputation Level)theBear User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 25 m 38 sec
Reputation Power: 6
Send a message via AIM to theBear
Re: STUPID Altavista

Yep, they can't even parse out all the links to spider or are you talking about Scooter it has the same problem. Wait till you see grub.

Cheers,
__________________
theBear

Reply With Quote
  #3  
Old April 2nd, 2003, 02:52 AM
stuijts's Avatar
stuijts stuijts is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Mar 2003
Location: Kerken, Germany
Posts: 115 stuijts User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 m 10 sec
Reputation Power: 6
What does this mean? (and what means "grub"?)

I wonder, if it also could apply to a problem I have.

I have my site bloggitt.de also submitted through the Express Inclusion, and since two weeks (so from the beginning) they give me error 700. They also say, the problem is on my side - but I cannot figure out what could be wrong.

What especially wonders me, is the "normal" crawling from other spiders...
__________________
Birthe

Reply With Quote
  #4  
Old April 2nd, 2003, 03:48 PM
theBear's Avatar
theBear theBear is offline
Contributing User
SEO Chat Novice (500 - 999 posts)
 
Join Date: Mar 2003
Location: Maine USA
Posts: 524 theBear User rank is Private First Class (20 - 50 Reputation Level)theBear User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 25 m 38 sec
Reputation Power: 6
Send a message via AIM to theBear
Quote:
Originally posted by "stuijts"

What does this mean? (and what means "grub"?)

I wonder, if it also could apply to a problem I have.

I have my site bloggitt.de also submitted through the Express Inclusion, and since two weeks (so from the beginning) they give me error 700. They also say, the problem is on my side - but I cannot figure out what could be wrong.

What especially wonders me, is the "normal" crawling from other spiders...


I'll answer the grub question first---- grub is yet another web crawling robot (243 clients running - crawling 18,958,378 URLs in the last 24 hours). Grub has joined forces with LookSmart to take community web crawling to the next level. Our mission to eventually crawl and assemble the latest state information for every document on the Internet remains unchanged.

There are still issues to be worked out. Folks this puppy should prove interesting.

http://www.grub.org

Now to help with those other questions could you show us the log entries for the crawler that is having problems with your site? A number of crawlers aren't too smart, in fact some are down right stupid.

Cheers,

Reply With Quote
  #5  
Old April 2nd, 2003, 04:19 PM
stuijts's Avatar
stuijts stuijts is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Mar 2003
Location: Kerken, Germany
Posts: 115 stuijts User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 m 10 sec
Reputation Power: 6
First thanks for the info about grub. Thats interesting!

On my log entries I find following a lot the last couple of days:
216.39.48.91 bloggitt.de - [02/Apr/2003:05:28:12 +0200] "GET /robots.txt HTTP/1.1" 200 37 "-" "Scooter/3.2"
216.39.48.91 bloggitt.de - [02/Apr/2003:05:28:17 +0200] "GET /gebloggitt70.html HTTP/1.1" 200 7366 "-" "Scooter/3.2"

This must be the regular Scooter, as (to my info) the IP for the Express Submission is 216.39.50.xxx

Nothing to be found on the error log.

From the Express Submission I get each morning the message, the site couldn't be spidered because of Error 700. Once they meant, it's because of the ISO-Code I defined on the top. So I deleted it to test - but no result. The server is not blocking their IP.
I really can't figure out why - especially as the regular Scooter seems to have no problems! (no results to be found on AV, but I guess that still takes some time)

Quote:
A number of crawlers aren't too smart, in fact some are down right stupid.

That's what I am beginning to believe reg. the AV EI!

Reg,
Birthe

Reply With Quote
  #6  
Old April 2nd, 2003, 04:45 PM
theBear's Avatar
theBear theBear is offline
Contributing User
SEO Chat Novice (500 - 999 posts)
 
Join Date: Mar 2003
Location: Maine USA
Posts: 524 theBear User rank is Private First Class (20 - 50 Reputation Level)theBear User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 25 m 38 sec
Reputation Power: 6
Send a message via AIM to theBear
Quote:
Originally posted by "stuijts"

First thanks for the info about grub. Thats interesting!

On my log entries I find following a lot the last couple of days:
216.39.48.91 bloggitt.de - [02/Apr/2003:05:28:12 +0200] "GET /robots.txt HTTP/1.1" 200 37 "-" "Scooter/3.2"
216.39.48.91 bloggitt.de - [02/Apr/2003:05:28:17 +0200] "GET /gebloggitt70.html HTTP/1.1" 200 7366 "-" "Scooter/3.2"

This must be the regular Scooter, as (to my info) the IP for the Express Submission is 216.39.50.xxx

Nothing to be found on the error log.

From the Express Submission I get each morning the message, the site couldn't be spidered because of Error 700. Once they meant, it's because of the ISO-Code I defined on the top. So I deleted it to test - but no result. The server is not blocking their IP.
I really can't figure out why - especially as the regular Scooter seems to have no problems! (no results to be found on AV, but I guess that still takes some time)

Quote:
A number of crawlers aren't too smart, in fact some are down right stupid.

That's what I am beginning to believe reg. the AV EI!

Reg,
Birthe



Scooter has a few problems it doesn't properly parse out the links for retrieval. It results in 404 errors because the page it tries to get isn't a real page Scooter is droping the last part of the urls in the link.

That is why this happens ...

Tue Apr 1 01:10:48 2003] [error] [client 216.39.48.61] File does not exist: /home/dnware/public_html/news/archive/articles/htt

and

[Tue Apr 1 03:18:35 2003] [error] [client 216.39.48.102] File does not exist: /home/virtual/site8/fst/var/www/html/Cata

The last one should read:

/home/virtual/site8/fst/var/www/html/Catalog/

What is an error 700 ???? I haven't run into that one.

Cheers,

Reply With Quote
  #7  
Old April 2nd, 2003, 05:26 PM
stuijts's Avatar
stuijts stuijts is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Mar 2003
Location: Kerken, Germany
Posts: 115 stuijts User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 m 10 sec
Reputation Power: 6
Error 700 ist driving me crazy

Their first explanation:

Quote:
Technical situations that block indexing include, but are not limited to:
1) Indexing disallowed or Access Forbidden by server, meta robot tag or robot.txt file
2) Server sided redirect: url location change
3) Host Server busy or non-responsive
4) Problems with DNS (domain name server)
5) Page Requires Password entry or cookie acceptance

Reply With Quote
  #8  
Old April 2nd, 2003, 06:09 PM
theBear's Avatar
theBear theBear is offline
Contributing User
SEO Chat Novice (500 - 999 posts)
 
Join Date: Mar 2003
Location: Maine USA
Posts: 524 theBear User rank is Private First Class (20 - 50 Reputation Level)theBear User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 25 m 38 sec
Reputation Power: 6
Send a message via AIM to theBear
Quote:
Originally posted by "stuijts"

Error 700 ist driving me crazy

Their first explanation:

Quote:
Technical situations that block indexing include, but are not limited to:
1) Indexing disallowed or Access Forbidden by server, meta robot tag or robot.txt file
2) Server sided redirect: url location change
3) Host Server busy or non-responsive
4) Problems with DNS (domain name server)
5) Page Requires Password entry or cookie acceptance


Do you have a log entry of the bot hiting your server?

I had no trouble hiting your site.

Cheers,

Reply With Quote
  #9  
Old April 2nd, 2003, 06:21 PM
stuijts's Avatar
stuijts stuijts is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Mar 2003
Location: Kerken, Germany
Posts: 115 stuijts User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 m 10 sec
Reputation Power: 6
Nope.
Apart from the ones I copied above.

Reply With Quote
  #10  
Old April 16th, 2003, 12:09 PM
piel's Avatar
piel piel is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Mar 2003
Posts: 328 piel User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 3 h 51 m 38 sec
Reputation Power: 6
does anyone even care about AV anymore? It seems to me like 1/2 their results are just garbage anyway
__________________
Got Yoga?

Reply With Quote
  #11  
Old April 16th, 2003, 12:15 PM
stuijts's Avatar
stuijts stuijts is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Mar 2003
Location: Kerken, Germany
Posts: 115 stuijts User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 1 m 10 sec
Reputation Power: 6
I don't know. I quite like their combined suggestions - and I expected that they would become more important after their relaunch.

BUT: my last 4 weeks with AV really annoyed me, and although Scooter visits me often - I'm still not listed. And reg. my paid listing: they only got me once, now I receive 1201-Errors: the site is already listed, but couldn't be updated.

If they realy want to make a chance, in my opinion they have to do a lot of work to keep up with their competition.

Reply With Quote
  #12  
Old April 16th, 2003, 08:13 PM
Phoenix's Avatar
Phoenix Phoenix is offline
Contributing User
SEO Chat Beginner (1000 - 1499 posts)
 
Join Date: Jan 2003
Location: Texas!
Posts: 1,135 Phoenix User rank is Corporal (100 - 500 Reputation Level)Phoenix User rank is Corporal (100 - 500 Reputation Level)Phoenix User rank is Corporal (100 - 500 Reputation Level)Phoenix User rank is Corporal (100 - 500 Reputation Level) 
Time spent in forums: 3 Days 9 h 4 m 3 sec
Reputation Power: 8
Send a message via AIM to Phoenix
Birthe,

I do agree, in my book AV is toast. No meat just fluff. Aren't impressed with search engine results either despite recent updates. On every occasion I have paid to submit to Altavista, it hasn't resulted more than a few hits a month. Averaging it out, its about 10.00 a click.
I declined to submit to AV for my education site, which is crawled by a good majority of the major engines out there. Figured it wasn't worth the money.
BUT, I am excited to see the developments of GRUB. I have read that they plan to implement a toolbar so that its a combined community shared crawling. Might give some engines a run for there money at the combined power of the crawling. But I already think of somethings to prevert the use of this toolbar. It comes down to time and money.

Good luck with Av crawling your site.
Haven't even heard of 700 errors. Sorry I couldn't be more assistance.

Ben

Reply With Quote
  #13  
Old April 16th, 2003, 09:06 PM
theBear's Avatar
theBear theBear is offline
Contributing User
SEO Chat Novice (500 - 999 posts)
 
Join Date: Mar 2003
Location: Maine USA
Posts: 524 theBear User rank is Private First Class (20 - 50 Reputation Level)theBear User rank is Private First Class (20 - 50 Reputation Level) 
Time spent in forums: 25 m 38 sec
Reputation Power: 6
Send a message via AIM to theBear
Grub is going to face some major problems.

Scope of problem type problems just like every other SE.

Cheers,

Reply With Quote
  #14  
Old April 17th, 2003, 09:40 AM
piel's Avatar
piel piel is offline
Contributing User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Mar 2003
Posts: 328 piel User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: 3 h 51 m 38 sec
Reputation Power: 6
Quote:
Originally posted by "stuijts"

If they realy want to make a chance, in my opinion they have to do a lot of work to keep up with their competition.


keyword: a LOT...AV is not even a 2nd tier engine IMHO anymore. Who uses them really other than maybe for the Babblefish? Actually this brings up a question: what is the breakdown of internet users' use of search engines? meaning google holds what market share, vs the others?

Reply With Quote
  #15  
Old April 22nd, 2003, 10:39 AM
Victor Victor is offline
Registered User
SEO Chat Newbie (0 - 499 posts)
 
Join Date: Apr 2003
Posts: 8 Victor User rank is Just a Lowly Private (1 - 20 Reputation Level) 
Time spent in forums: < 1 sec
Reputation Power: 0
I don't waste my time with AltaVista.

Most of their SERP are irrelevant and who uses them anyways?

With any luck Overture will whip them back into shape but I would not count on this anytime soon.

Reply With Quote
Reply

Viewing: SEO Chat ForumsSearch EnginesSearch Engines - Classic > STUPID Altavista


Thread Tools  Search this Thread 
Search this Thread:

Advanced Search
Display Modes  Rate This Thread