#1
  1. No Profile Picture
    Registered User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Nov 2012
    Posts
    2
    Rep Power
    0

    How to remove very LARGE number of links from Google Index


    Hi guys,
    I work as seo consultant for large website. Main content of website are pages of objects(accomodation, restaurants etc).
    Because of bad SEO we end up with literally houndred of thousands of pages indexed. We repaired this in summer - all bad links have 301 redirects, and nobody links anymore to that content. After half a year nothing changed. Pages are still in google index. I googled and found 3 possible sollutions:

    1. use google removal tool: it was designed to remove only few pages, and according to google it should be used only for urgent removal. I can write a script that will automaticly add pages to google removal, but it is against their policies (if links was in one folder it would be possible to delete all in folder, but they are not).

    2. define rule to robots.txt
    Google knows wildcards in robots.txt, so i can define rules to remove this urls. Problem with this solutions is, that google will not crawl this pages, but they will remain in index. When or if ever google removes it from index, i didnt find answer for that.
    I put few of my bad pages to robots.txt month ago and they are still indexed.

    3. put links back on my site - google will find them, go through them, find 301 and deindex them. But I am not sure if this isnt harmfull for my site.

    I am hopeless with this. Do you know any better sollution?
  2. #2
  3. Philip@SearchBenefit.com
    SEO Chat Good Citizen (1000 - 1499 posts)

    Join Date
    Oct 2009
    Location
    Massachusetts, USA
    Posts
    1,388
    Rep Power
    1009
    I am sorry that no one has replied to this but your description of the problem is highly confusing, apparently not only to me.

    I understand that you have a bunch of pages indexed that you do not want indexed. If I understand correctly, you
    have removed the content of those pages and 301 redirected their URLs (is that what you mean by "bad links"?). So why is it a problem that those pages are still indexed? As I see it, the worst thing that can happen is that people will find you in search via those pages, click on the search result and be 301-redirected to some better page, no? It's hard to fathom how having *more* pages indexed can hurt you.

    If there is a specific reason (please explain it?) and if you *must* have those pages deindexed, one thing to try would be to generate a new XML sitemap and submit it via Webmaster's tools, then wait a while.
  4. #3
  5. No Profile Picture
    Contributing User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Dec 2010
    Posts
    82
    Rep Power
    34
    I had a similar problem with about 50k pages of low quality, near duplicate content being indexed that shouldn't have been. My fault, they found their way into my sitemap, I should have been more careful. To make it worse, they were short life pages, and as they died my 404's skyrocketed. It took me about 6 weeks to clear out the 404's in WMT. All the junk pages have been 301 redirected to the nearest similar page. I set robots to nofollow, noarchive, noindex for the pages. I of course corrected my sitemap. A couple of weeks after I did this, I used the removal tool to remove the directories that these pages fell into. The junk pages are out of the SERPS now, but WMT still shows them in the number of indexed pages.

    Strangely, even though WMT updates my number of indexed pages every Sunday, the number only changes every other week. So I am waiting 2 weeks to hopefully see some of these junk pages deindexed. The count only goes down by 3-5k each time. In the way that you are only allowed to clear out 1000 404 listings a day, I wonder if there is an invisible limit to the number or percentage of pages that can be deindexed within a certain time period.

    If there is a faster way to get them deindexed I would love to know it. It's taken 5 weeks to see a drop of 16k. I'd like to see another 30-40k gone.

    I have not seen much in the way of positive ranking results yet from my efforts. Search query keywords and postitions have started to improve, but impressions are still down and WMT shows only 5 clicks a day, never more. I realize it's probably a rounded estimate for that number, but it's always the same. It's like I'm only allowed x amount of visitors a day via G.
    Last edited by eeyipes; Dec 1st, 2012 at 03:08 PM.
  6. #4
  7. No Profile Picture
    Registered User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Nov 2012
    Posts
    2
    Rep Power
    0
    Originally Posted by PhilipSEO
    I am sorry that no one has replied to this but your description of the problem is highly confusing, apparently not only to me.

    I understand that you have a bunch of pages indexed that you do not want indexed. If I understand correctly, you
    have removed the content of those pages and 301 redirected their URLs (is that what you mean by "bad links"?). So why is it a problem that those pages are still indexed? As I see it, the worst thing that can happen is that people will find you in search via those pages, click on the search result and be 301-redirected to some better page, no? It's hard to fathom how having *more* pages indexed can hurt you.

    If there is a specific reason (please explain it?) and if you *must* have those pages deindexed, one thing to try would be to generate a new XML sitemap and submit it via Webmaster's tools, then wait a while.
    Actually you understand it quite well.
    1. I want them to be deindexed because they were nearly duplicate content, and they ara messing up my SERP. Bacause of this duplicates my pages rates 2-3x lower.
    2. Solution with sitemap is not working, at least for my page. I created a sitemap for that pages 3 months ago, but that didnt help. Pages from that sitemap was still indexed 2weeks ago.
    3. It seems that solution with robots.txt is working, 3weeks ago I defined wildcards for 100k of my pages, and they begin to disappear from google index.

Similar Threads

  1. Link Building 101
    By GaryTheScubaGuy in forum Link Development
    Replies: 172
    Last Post: Feb 10th, 2011, 03:25 PM
  2. Google's Algorithm... Why Google is Failing.
    By rjonesx in forum Google Optimization
    Replies: 34
    Last Post: Dec 23rd, 2010, 03:42 AM
  3. When Google got it wrong
    By Alex324 in forum Google Optimization
    Replies: 10
    Last Post: Mar 12th, 2006, 03:10 AM
  4. Googleguy speaks! Some good clarifications
    By thewormman in forum Google Optimization
    Replies: 30
    Last Post: Jun 4th, 2005, 02:01 AM
  5. Reality & Truth - Competative Keywords -v- Real Life Searches
    By -search-engines-web in forum Google Optimization
    Replies: 2
    Last Post: Dec 27th, 2003, 05:25 PM

IMN logo majestic logo threadwatch logo seochat tools logo