Entity-Centric Analysis: A Deeper Understanding of the Data
Throughout this chapter, itâs been implied that analytic approaches that exhibit a deeper understanding of the data can be dramatically more powerful than approaches that simply treat each token as an opaque symbol. But what does âa deeper understandingâ of the data really mean? One interpretation is being able to detect the entities in documents and using those entities as the basis of analysis, as opposed to document-centric analysis involving keyword searches or interpreting a search input as a particular type of entity and customizing results accordingly. Although you may not have thought about it in those terms, this is precisely what emerging technologies such as WolframAlpha do at the presentation layer. For example, a search for âtim oâreillyâ in WolframAlpha returns results that imply an understanding that the entity being searched for is a person; you donât just get back a list of documents containing the keywords (see Figure 8-3). Regardless of the internal technique thatâs used to accomplish this end, the resulting user experience is dramatically more powerful because the results conform to a format that more closely satisfies the userâs expectations.
Figure 8-3. Sample results for a âtim oâreillyâ query with WolframAlpha
Although itâs beyond the scope of this chapter to ponder the various ...
Get Mining the Social Web now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.