Automatic indexing of text material can be very basic, or it can involve some advanced techniques. It normally begins with lexical analysis and it can imply the use of stop word lists, stemming techniques, the extraction of meaningful word combinations or statistical term weighting. Sometimes word combinations are linked to controlled vocabularies or classifications. For two decades now the Text REtrieval Conferences (TREC) have been the laboratory for specialists in this field.
automatic text indexing
We are all familiar with automatic indexing of texts because web search engines offer us the possibility to search for ...