11

Indexing the web

Abstract:

Some libraries catalogue websites as if they were books. Other initiatives to manually index (part) of the web are web directories and web pages. Web directories offer a structured way to retrieve websites. Startpages, a typically Dutch phenomenon, list all their links in one page. Social bookmarks too are a kind of manual web indexing: many people save relevant links on a bookmark site and in doing so help to build a manual index. The most important role is played by the automatic web indexing mechanisms search engines use. Google has overcome the ranking problem all search engines face. The PageRank mechanism sorts the retrieved web pages according to the links they receive from other sites. Some search engines ...

Get Indexing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.