Ann Smarty

Tell Google How to Treat Your Content : Disallow, Nofollow, NoIndex

April 22nd, 2008 by Ann Smarty | 7 Comments

A lot has been said recently about different ways to keep your content hidden from search engines. Still there is much confusion regarding which method serves which aim. So I decided to make a quick easy-to-understand guideline of what should be done depending on what you need to achieve.

So mainly you might want a search engine to treat your content in 4 possible ways:

  1. crawl, index and rank your content;
  2. do not crawl but (possibly) return in search results (i.e. rank);
  3. do not index and consequently do not rank;
  4. do not follow the given link only from this very page.


While with the first option you do not generally need any directives; with 3 other aims the right techniques would be:

  • do not crawl => robots.txt directive (NOTE: the page might still be indexed and ranked when discovered via external links pointing to it);
  • do not index => ‘noindex’ directive attached to the page (NOTE: if you use both robots.txt disallow directive and noindex meta tag, the page may still be indexed: a search engine won’t crawl the page and hence will be unaware of the noindex directive; so in this case ‘more’ doesn’t mean ‘better’);
  • do not follow the link => rel=”nofollow” attribute attached to a link (NOTE: if there are other ‘dofollow’ external/internal links to the page, it will be crawled, indexed and ranked).

hide-content.jpg

Sebastian had also a great piece published at SEOMoz blog discussing crawler and indexer directives and how they are treated by different search engines.



Comments

7 responses so far ↓

  • doug m on Apr 22, 2008 at 11:39 am

    this is a great article. now how would I stop links from showing up that have no content, IE: certain php pages that don’t related to my posts

  • Louis Liem on Apr 22, 2008 at 12:31 pm

    so, correct me if i’m wrong..

    noindex means not displaying the page on SERP and nofollow means not following the link and in google’s case, not giving any PR link juice?

  • Name changed to protect the Sphinner on Apr 22, 2008 at 2:33 pm

    I’ve long wondered whether to add “noindex,follow” to the front page of my blog to avoid a duplicate content penalty.

    The problem is that I have links from social networking profiles pointing there and so its gaining (a little) rank.

    I read that search engines understand blogs and don’t impose this penalty - but do the SEOs out there have a definitive answer?

  • chris on Apr 22, 2008 at 9:20 pm

    good job!

  • kokotaro on Apr 22, 2008 at 9:30 pm

    thanks, now i can understand completely the crawl, index and rank.
    but i always feel confused that what’s the differences between “external nofollow” and “nofollow”.
    i really want to know the answer. thanks again.

  • Matt on Apr 25, 2008 at 7:20 pm

    Nofollow does not prevent engines from following the link. You are mistaken. The link is still followed. All it does is prevent passing of link juice.

  • Ann Smarty on Apr 26, 2008 at 4:18 am

    @Matt - no, actually a link with Nofollow attribute won’t be followed. You can run a simple test by creating a new page at your site and nofollow all links to it. The page won’t be crawled and indexed until you create a “dofollow’ link to it.

    Here is also quite a straightforward citation by Matt Cutts if you don’t trust me:

    “NoFollow as an individual link attribute means don’t follow this particular link…”

    http://www.stonetemple.com/articles/interview-matt-cutts.shtml

Leave a Comment