|
|
Hiding your web pages from search enginesBackgroundSearch engines and some other organisations use 'robot' or 'web crawler' software to find and index web pages and their contents. Sometimes you may want to hide a web page from a search engine, for example if you are testing a new site before launching it. There are several ways to achieve this. Hiding your web pagesTo achieve the highest level of secrecy for a digital file, simply do not put it on a web server or Internet-connected machine. This is clearly impractical for documents that you want to share with others. Several tools can restrict access to shared documents:
This article provides details about the first two options. robots.txt fileIf you have your own web server, you can create a robots.txt file that tells search engines which pages on the server they should ignore. See the Web Server Administrator's Robots Exclusion Protocol Guide for information about how to do this. The robots exclusion protocol is supported by many of the bigger search engines, but not all of them. For information about a particular search engine's attitude to the protocol, look in the company's privacy policy. robots meta tagThe <meta name="robots"> tag can help hide your web page from web crawlers, spiders and robots. For complete protection, use this tag: Using this metadata tag... Using this tag... EXTRA TIP: Make sure there are no links from other web pages to the page you want to hide. Search engine spiders/robots can (and will) follow those links to your page. Remember also that messages sent to discussion lists (for example, web-forum@unimelb) should not include the secret URL, because these discussion lists are often archived on a web site. Related ResourcesAll these articles were current on 11 September 2002:
|
|
Contact the University : Disclaimer & Copyright : Privacy : Accessibility |
|
Date Created: 14 Apr 2005 |
The University of Melbourne ABN: 84 002 705 224 |