#1
  1. No Profile Picture
    Registered User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Aug 2007
    Posts
    21
    Rep Power
    0

    Exclamation URGENT: How to remove PDF file from Google Search Results??


    All,

    One of my client website is on test server and 80-100 pages of the site that mostly includes pdf files showing on Google search results. Client want me to remove the whole site from Google search. He don't want to leak the information before launch. Last month I have implemented below Google webmaster recommended techniques for removing site from Google search:-

    1. Implementation of robotst.txt with below text in file:-

    User-agent: * Disallow: /


    2. Implemented below code on all pages of site:-
    <meta name="robots" content="noarchive" />
    <meta name="robots" content="noindex, nofollow" />
    <meta name="robots" content="nosnippet" />


    3. I have placed entire website removal request in Google webmaster tool. But the request got denied because of below reason:-

    Your request has been denied because the webmaster of the site hasn't applied the appropriate robots.txt file or meta tags to block us from indexing or archiving this page.
    Please work with the webmaster of this site or select an alternate removal option from the webpage removal request tool.


    4. After that I have manually submitted the site pages including the pdf file on Google webmaster tool - "Individual URLs: web pages, images, or other files" option. Now most of the site pages are removed from Google search but pdf files removal request got denied because of below reason:-

    Your request has been denied because the webmaster of the site hasn't applied the appropriate robots.txt file or meta tags to block us from indexing or archiving this page.
    Please work with the webmaster of this site or select an alternate removal option from the webpage removal request tool.


    I want to know how I can remove these pdf files from Google search.

    Thanks for reading this thread and help.

    SEO DNA
  2. #2
  3. Contributing User
    SEO Chat Discoverer (100 - 499 posts)

    Join Date
    Jan 2007
    Posts
    347
    Rep Power
    27
    specifically disallow googlebot and place an password protect in the htaccess file.
  4. #3
  5. No Profile Picture
    mtw
    Contributing User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Apr 2009
    Posts
    32
    Rep Power
    11
    Yeah password protect the directory containing the pdfs - check your robots.txt through webmaster tools. Read more on robots.txt at robotstxt.org.
  6. #4
  7. No Profile Picture
    Registered User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Aug 2007
    Posts
    21
    Rep Power
    0

    Exclamation


    Originally Posted by 1fast72nova
    specifically disallow googlebot and place an password protect in the htaccess file.
    Originally Posted by mtw
    Yeah password protect the directory containing the pdfs - check your robots.txt through webmaster tools. Read more on robots.txt at robotstxt.org.
    Thanks for the reply.

    I think the command
    User-agent: * Disallow: / disallow all and every type of robots to enter the site.

    Protecting the PDF directory through password is a good idea but it will just block the visitors to open the pdf. The Google search result issue of these pdf pages will still exist.

    What's your point of view??
  8. #5
  9. No Profile Picture
    mtw
    Contributing User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Apr 2009
    Posts
    32
    Rep Power
    11
    Have you thought about creating a new directory - placing your pdfs in this directory and then putting a no follow on this directory using your robots.txt file?
  10. #6
  11. No Profile Picture
    mtw
    Contributing User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Apr 2009
    Posts
    32
    Rep Power
    11
    example:

    User-Agent: *
    Disallow: /new_pdf_directory
  12. #7
  13. No Profile Picture
    Contributing User
    SEO Chat Discoverer (100 - 499 posts)

    Join Date
    May 2009
    Location
    Manchester
    Posts
    123
    Rep Power
    11
    Originally Posted by SEO_DNA
    Thanks for the reply.

    I think the command
    User-agent: * Disallow: / disallow all and every type of robots to enter the site.

    Protecting the PDF directory through password is a good idea but it will just block the visitors to open the pdf. The Google search result issue of these pdf pages will still exist.

    What's your point of view??
    Does this help?

    http://www.antezeta.com/blog/avoid-search-engine-indexing
  14. #8
  15. No Profile Picture
    Registered User
    SEO Chat Explorer (0 - 99 posts)

    Join Date
    Aug 2007
    Posts
    21
    Rep Power
    0
    All,

    Thanks for the help. I had successfully blocked the PDF files through robots.txt file and it works good. But still some pdf files with same folder name coming on search results. I have to submit search removal request of these files manually on Google webmaster tool.

    Google robots didn't crawl my site after implemented all search removal codes on my site pages, but still some old archive pages are showing on Google search. Every time I have to submit search removal request of archive result pages manually on Google webmaster tool. Like yesterday when I had checked the status of my site on Google search then only 2 pages were showing. I had removed that 2 pages through Google webmaster tool. Today again 2 new pages are showing on search result.

    My question here is that "how I can check all Google old archive pages of my site, so that I will remove them one time??"

    Thanks for your help.
    SEO_DNA

Similar Threads

  1. CSS dropshadow technique used on H1 tags
    By DubbelDee in forum Search Engine Optimization
    Replies: 4
    Last Post: Mar 5th, 2007, 06:34 AM
  2. Need N/A SE Rank explained
    By Ramses357 in forum Google Optimization
    Replies: 21
    Last Post: Jul 12th, 2005, 08:06 PM
  3. Article: new Google feature called Google Suggest
    By dirtdog1960 in forum SEO Chat Articles
    Replies: 0
    Last Post: Dec 12th, 2004, 01:09 AM
  4. Link Popularity being Re-Defined and Revised
    By -search-engines-web in forum Google Optimization
    Replies: 17
    Last Post: Sep 2nd, 2004, 01:30 AM

IMN logo majestic logo threadwatch logo seochat tools logo