nofollow
From Wikipedia, the free encyclopedia
nofollow is an HTML attribute value used to instruct some search engines that a hyperlink should not influence the link target's ranking in the search engine's index. It is intended to reduce the effectiveness of certain types of search engine spam, thereby improving the quality of search engine results and preventing spamdexing from occurring in the first place.
Contents
[edit] Concept and specification
The concept for the specification of the attribute value nofollow was designed by Google’s head of webspam team Matt Cutts and Jason Shellen from Blogger.com in 2005.[1]
The specification for nofollow[2] and IETF RFC 3667 & RFC 3668. The authors intend to submit this specification to a standards body with a liberal copyright/licensing policy such as the GMPG, IETF, and/or W3C.[1] is copyrighted 2005-2007 by the authors and subject to a royalty free patent policy, e.g. per the W3C Patent Policy 20040205,
[edit] What nofollow is not for
The nofollow attribute value is not meant for blocking access to content or preventing content to be indexed by search engines. The proper methods for blocking search engine spiders to access content on a website or for preventing them to include the content of a page in their index are the Robots Exclusion Standard (robots.txt) for blocking access and on page Meta Elements that are designed to specify on an individual page level what a search engine spider should or should not do with the content of the crawled page.
[edit] Introduction and support
Google announced in early 2005 that hyperlinks with rel="nofollow" attribute[3] would not influence the link target's PageRank. In addition, the Yahoo and MSN search engines also respect this tag.[4]
How the attribute is being interpreted differs between the search engines. While some take it literally and do not follow the link to the page being linked to, others still "follow" the link to find new web pages for indexing. In the latter case rel="nofollow" actually tells a search engine "Don't score this link" rather than "Don't follow this link." This differs from the meaning of nofollow as used within a robots meta tag, which does tell a search engine: "Do not follow any of the hyperlinks in the body of this document.".
[edit] Interpretation by the individual search engines
While all engines that support the attribute exclude links that use the attribute from their ranking calculation, the details about the exact interpretation of the attribute vary from search engine to search engine.[5][6]
[edit] Usage by weblog software
Most weblog software marks reader-submitted links this way by default (with no option to disable it without code modification). A more sophisticated server software could spare the nofollow for links submitted by trusted users like those registered for a long time, on a whitelist, or with a high karma. Some server software adds rel="nofollow" to pages that have been recently edited but omits it from stable pages, under the theory that stable pages will have had offending links removed by human editors.
The widely used blogging platform WordPress versions 1.5 and above automatically assign the nofollow attribute to all user-submitted links (comment data, commenter URI, etc).[8] However, there are several free plugins available that automatically remove the nofollow attribute value.[9]
[edit] Usage on other websites
MediaWiki software, which powers Wikipedia, was equipped with nofollow support soon after initial announcement in 2005. The option was enabled on most international Wikipedias. One of the prominent exceptions was the English language version. Initially, after a discussion, it was decided not to use rel="nofollow" in articles and to use a URL blacklist instead. In this way, English Wikipedia contributed to the scores of the pages it linked to, and expected editors to link to relevant pages.
In May 2006, a patch to MediaWiki software allowed to enable nofollow selectively in namespaces. This functionality was used on pages that are not considered to be part of the actual encyclopedia, such as discussion pages and resources for editors.[10] Following increasing spam problems and a within-Foundation order from Jimmy Wales, rel="nofollow" was added to article-space links in January 2007;[11].[12] However, the various interwiki templates and shortcuts that link to other Wikimedia Foundation projects and many external wikis such as Wikia are not affected by this policy.
Other websites like Slashdot, with high user participation, use improvised nofollow implementations like adding rel="nofollow" only for potentially misbehaving users. Potential spammers posing as users can be determined through various heuristics like age of registered account and other factors. Slashdot also uses the poster's karma as a determinant in attaching a nofollow tag to user submitted links.
A number of social bookmarking and photo sharing websites also use the rel="nofollow" tag for their outgoing links, including: Digg.com, Furl, Propeller.com (formerly Netscape.com), Yahoo! My Web 2.0, YouTube, and Technorati Favs.[13]
[edit] Repurpose for paid links
While the effectiveness of the nofollow attribute to prevent comment spam is in doubt and raises other issues instead,[14] search engines have moved ahead and attempted to repurpose the attribute for something different. Google began suggesting the use of nofollow also as a machine-readable disclosure for paid links, so that these links do not get credit in search engines results.[15]
The growth of the link buying economy, where companies' entire business models are based on paid links that affect search engine rankings,[16] caused the debate about the use of nofollow in combination with paid links to move into the center of attention of the search engines, who started to take active steps against link buyers and sellers.[17] This triggered a very strong response by the web master community.[18]
[edit] Nofollow to control internal PageRank flow
Search engine optimization professionals started using the nofollow attribute to control the flow of PageRank within a website. This is an entirely different use than it was intended originally. Nofollow was designed to control the flow of PageRank from one website to another. However, SEOs realized that a nofollow used for an internal link should just work as well as nofollow used for external links. It makes sense, for example, to use nofollow for internal links to pages that are not relevant for search engines and only for visitors who are already on the website. This includes pages, such as "About Us", "Contact Us", "Terms of Use" or "Privacy Policy". Matt Cutts, one of the initial designers of the nofollow attribute, encouraged this use of the attribute.[19]
[edit] Criticism
Some weblog authors object to the use of rel="nofollow", arguing, for example,[20][21] that
rel="nofollow" Link spammers will continue to place links for clicking (by surfers), even if those links are ignored by search engines. Google is advocating the use of rel="nofollow" in order to reduce the effect of heavy inter-blog linking on page ranking. According to the statistics shown at the Akismet's antispam service [1], this attribute has not stopped the comment spam and has not discouraged spammers. Lack of accreditation may discourage informational comments by professionals wishing to establish their PageRank reputation.
