#1
  1. Around...The
    SEO Chat Discoverer (100 - 499 posts)

    Join Date
    Jun 2004
    Location
    Amsterdam
    Posts
    456
    Rep Power
    16

    Check multiple pages for penalization


    Would like to check 7000+ pages to see if they are penalized/de-indexed by Google.

    There is the possibility to check the PR of those pages by the 100 and disrecard the ones with PR, but that would probably still leave 1000+ pages to check otherwise.

    Any ideas?
  2. #2
  3. Contributing User
    SEO Chat Adventurer (500 - 999 posts)

    Join Date
    Jul 2005
    Location
    Canada
    Posts
    762
    Rep Power
    23
    What are you looking for?

    Of course site:domain.com will tell you how many pages are indexed. So you could figure out how many are not indexed. But not being indexed is not necessarily a sign of being penalized. Pages are dropped for many other reasons like duplicate titles, lack of content, too little content, not relevant ...

    You could also check PR but not all pages will have rank assigned depending on your link structure ... and they have to be indexed (see above).
  4. #3
  5. Around...The
    SEO Chat Discoverer (100 - 499 posts)

    Join Date
    Jun 2004
    Location
    Amsterdam
    Posts
    456
    Rep Power
    16
    Well basically I don't want to link out to any page that could be considered penalized/banned/delisted/in-bad-neighborhood, regardless of the reason why.

    My assumptions are that if a page has PR and/or is indexed it is ok to link to. Therefore the first check is if those pages have PR and further analyze those without PR.

    For those page without PR I would like to test if they are indexed. Is there a possibility to check multiple pages to see if they are indexed. Like a info:http://www.domain.com/page.html, but then a bulk check.

    I know this isn't a fool prove method, but it is the best I can come up...
  6. #4
  7. Contributing User
    SEO Chat Adventurer (500 - 999 posts)

    Join Date
    Jul 2005
    Location
    Canada
    Posts
    762
    Rep Power
    23
    I'm sorry ... are these pages on your site?

    Or are you trying to validate link partners?
  8. #5
  9. Around...The
    SEO Chat Discoverer (100 - 499 posts)

    Join Date
    Jun 2004
    Location
    Amsterdam
    Posts
    456
    Rep Power
    16
    Should have been more clear, sorry as well...

    Am trying to validate link partners.
  10. #6
  11. Contributing User
    SEO Chat Good Citizen (1000 - 1499 posts)

    Join Date
    Apr 2005
    Posts
    1,473
    Rep Power
    19
    Hmmm... Why not you try http://www.bad-neighborhood.com ?
  12. #7
  13. Around...The
    SEO Chat Discoverer (100 - 499 posts)

    Join Date
    Jun 2004
    Location
    Amsterdam
    Posts
    456
    Rep Power
    16
    Sounds like an interesting idea, but it seems that whenever the word sex or casino is mentioned anywhere it is considered bad. Even a brand new site with a brand new page with information about a educational sex book is considered bad. Just as a regular link to Las Vegas casino's.

    Doesn't seem like it is giving the right information, but thanks for the thought!
  14. #8
  15. Mr. Goober Guy ;)
    SEO Chat Good Citizen (1000 - 1499 posts)

    Join Date
    Aug 2004
    Location
    Tampa, Florida
    Posts
    1,320
    Rep Power
    24
    You can sign up for http://www.uptimebot.com/ - once signed up, you can use the Page Rank tool to display all urls with no PR. Sounds to me though you're looking for bad neighbors. If your link section doesn't display PR with the listing - which I have one that doesn't - I manually review each site one at a time. Usually I end up cleaning out 4 or 5 a month that went...south and it is definitely an all day affair. Having PR display with the listing helps because you only need to review sites with no PR, cutting the work down significantly into minutes rather than hours.

    Good luck buddy.
    Cheerios!

    New to SEO? See the FAQ!

    My Disclaimer:
    Don't Listen To Me - I know nothing!
  16. #9
  17. Ditzy
    SEO Chat Adventurer (500 - 999 posts)

    Join Date
    Aug 2004
    Posts
    743
    Rep Power
    33
    Originally Posted by sufyaaan
    Hmmm... Why not you try http://www.bad-neighborhood.com ?
    Thanks for that tip, new to me. I'm checking them out right now but having trouble getting past their Too Blue background.

    Yes, definetly thanks. Neat function, found three bad links out of 1400 on my 'jinxed' site but now I think I've gone blind.
  18. #10
  19. Ditzy
    SEO Chat Adventurer (500 - 999 posts)

    Join Date
    Aug 2004
    Posts
    743
    Rep Power
    33
    Originally Posted by tfbpa
    Sounds like an interesting idea, but it seems that whenever the word sex or casino is mentioned anywhere it is considered bad. Even a brand new site with a brand new page with information about a educational sex book is considered bad. Just as a regular link to Las Vegas casino's.

    Doesn't seem like it is giving the right information, but thanks for the thought!
    Hmmm, you may be right about that. It found 3 links on my site, 1 of them was duped, which means 2. PR for those 2 was 4 and 5. I guess I'm still jinxed.
  20. #11
  21. Contributing User
    SEO Chat Adventurer (500 - 999 posts)

    Join Date
    Jul 2005
    Location
    Canada
    Posts
    762
    Rep Power
    23
    We are talking about a bot that does this over at this thread:

    http://forums.seochat.com/showthread.php?t=43476&page=15&pp=15

    Even with an API key you are limited to 1000 queries per day ... from past experience that would take at least 4 hours to check all 7000 links.

Similar Threads

  1. too much content too quickly?
    By deano6410 in forum Google Optimization
    Replies: 5
    Last Post: Aug 11th, 2005, 08:21 PM
  2. Dynamic pages dissapeared from google
    By donkeyderby in forum Google Optimization
    Replies: 14
    Last Post: May 17th, 2005, 09:55 AM
  3. 500 keyword rich informative pages
    By algo in forum Search Engine Optimization
    Replies: 0
    Last Post: Jan 11th, 2005, 11:25 PM
  4. Replies: 4
    Last Post: Jan 12th, 2004, 04:08 PM

IMN logo majestic logo threadwatch logo seochat tools logo