Free-text searching

All of the approaches described here can now commonly include asearch engine. These products analyse the text content of documents, and allow system users to quickly find relevant documents from words or phrases entered at a system prompt.

A search engine may have a number of techniques for locating documents that contain specified words or phrases. The most basic technique employs an inverted word index, which is a simple list of all the words used in a collection, sorted alphabetically, with links back to the documents containing these words.

XML is based on plain text formats, so is naturally easy to index using a search engine.

Increasingly, search engines are becoming XML-sensitive, and are able to identify text that ...

Get XML Companion, The, Third Edition now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.