You are previewing Next Generation Search Engines.
O'Reilly logo
Next Generation Search Engines

Book Description

Recent technological progress in computer science, Web technologies, and the constantly evolving information available on the Internet has drastically changed the landscape of search and access to information. Current search engines employ advanced techniques involving machine learning, social networks, and semantic analysis.
Next Generation Search Engines: Advanced Models for Information Retrieval is intended for scientists and decision-makers who wish to gain working knowledge about search in order to evaluate available solutions and to dialogue with software and data providers. The book aims to provide readers with a better idea of the new trends in applied research.

Table of Contents

  1. Cover
  2. Title Page
  3. Copyright Page
  4. Editorial Advisory Board and List of Reviewers
    1. Editorial Advisory Board
  5. Preface
    1. NEEDS AND REQUIREMENTS FOR INFORMATION RETRIEVAL
    2. OBJECTIVES OF THE BOOK
    3. TARGET AUDIENCE
    4. A BRIEF OVERVIEW OF THE ORGANIZATION OF THE BOOK
  6. Section 1: Indexation
    1. Chapter 1: Indexing the World Wide Web
      1. ABSTRACT
      2. INTRODUCTION
      3. ORGANIZING THE WEB
      4. LAYING OUT THE INDEX
      5. SCALING THE SYSTEM
      6. EXTRACTING FEATURES FOR RANKING
      7. OPEN SOURCE SEARCH ENGINES
      8. FUTURE RESEARCH DIRECTIONS
      9. CONCLUSION
    2. Chapter 2: Decentralized Search and the Clustering Paradox in Large Scale Information Networks
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. DISTRIBUTED AND P2P INFORMATION RETRIEVAL
      5. CLUSTERING AND NETWORK CLUSTERING FOR INFORMATION RETRIEVAL
      6. A DECENTRALIZED VIEW FOR INFORMATION RETRIEVAL
      7. CLUSTERING PARADOX AND DECENTRALIZED SEARCH
      8. FUTURE DIRECTIONS
      9. CONCLUSION
    3. Chapter 3: Metadata for Search Engines
      1. ABSTRACT
      2. INTRODUCTION
      3. THE E-SCIENCE ISSUE: SEARCHING THROUGH METADATA
      4. CASE STUDY 1: HIGH ENERGY PHYSICS
      5. CASE STUDY 2: EARTH SCIENCES
      6. CASE STUDY 3: LIFE SCIENCES
      7. CONCLUSION
    4. Chapter 4: Crosslingual Access to Photo Databases
      1. ABSTRACT
      2. INTRODUCTION
      3. PROBLEMS OF IMAGE DESCRIPTION BY AUTHORS
      4. SEMANTIC AMBIGUITIES
      5. USER BEHAVIOUR IN QUERYING
      6. CROSS-LANGUAGE QUERYING
      7. GROWING DIFFICULTY WITH THE INCREASE OF THE LANGUAGE NUMBER
      8. SEMANTIC DISAMBIGUATION BY IMAGE PROCESSING
      9. EVALUATION
      10. CONCLUSION
    5. Chapter 5: Fuzzy Ontologies Building Platform for Semantic Web
      1. ABSTRACT
      2. 1. INTRODUCTION
      3. 2. MOTIVATIONS AND RELATED WORK
      4. RELATED WORK
      5. 3. FUZZY ONTOLOGIES
      6. 4. FUZZY ONTOLOGIES BUILDING METHODOLOGY: FUZZY ONTO METHODOLOGY
      7. 5. THE FUZZY ONTOLOGIES BUILDING PLATFORM ARCHITECTURE SPECIFICATION
      8. FUZZY ONTOLOGIES MODELER FRAMEWORK
      9. FUZZY ONTOLOGIES GENERATION CODE FRAMEWORK
      10. 6. CONCLUSION
  7. Section 2: Data Mining for Information Retrieval
    1. Chapter 6: Searching and Mining with Semantic Categories
      1. ABSTRACT
      2. INTRODUCTION
      3. SEMANTIC SEARCH ENGINE OR QUESTION-ANSWERING SYSTEM?
      4. BACKGROUND
      5. SEMANTIC AND DISCOURSE INDEXATION AS A REVERSE ENGINEERING
      6. SEMANTIC ANNOTATION WITH CONTEXTUAL EXPLORATION
      7. SEMANTIC INDEXATION
      8. BROWSING AND MINING WITHIN SEMANTIC ANNOTATIONS
      9. FUTURE RESEARCH DIRECTIONS
      10. CONCLUSION
    2. Chapter 7: Semantic Models in Information Retrieval
      1. ABSTRACT
      2. INTRODUCTION: USING SEMANTICS IN INFORMATION RETRIEVAL
      3. CHAPTER ORGANIZATION AND STRUCTURE
      4. 1. IR BASICS
      5. 2. STATE OF THE ART IN PROBABILISTIC IR
      6. 3. SEMANTIC MODELING
      7. 4. EXPLANATORY MODELING VS. FUNCTIONAL MODELING
      8. 5. COMPUTER IMPLEMENTATION OF SEMANTIC ACQUISITION
      9. 6. IMPLEMENTATION CHOICES: REVISITING BM25 AND PLSA
      10. 7. MODEL INTEGRATING SEMANTIC AND INDEXING
      11. 9. EXPERIMENTATION
      12. 10. CONCLUSION
    3. Chapter 8: The Use of Text Mining Techniques in Electronic Discovery for Legal Matters
      1. ABSTRACT
      2. INTRODUCTION
      3. A BRIEF HISTORY OF LEGAL DISCOVERY
      4. USING TEXT MINING AND INFORMATION RETRIEVAL IN EDISCOVERY
      5. CHALLENGES IN LEGAL ELECTRONIC DISCOVERY
      6. RECOMMENDATIONS
      7. FUTURE RESEARCH DIRECTIONS
      8. CONCLUSION
    4. Chapter 9: Intelligent Semantic Search Engines for Opinion and Sentiment Mining
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. DOXA OPINION SEARCH ENGINE PROTOTYPE
      5. OVERVIEW OF THE SYSTEM
      6. LINGUISTIC APPROACH
      7. SEARCH ENGINES AND TEXT OLAP
      8. FUTURE RESEARCH DIRECTIONS
      9. CONCLUSION
      10. APPENDIX
  8. Section 3: Interface
    1. Chapter 10: Human-Centred Web Search
      1. ABSTRACT
      2. INTRODUCTION
      3. SEARCHER BEHAVIOUR
      4. SUPPORTING SEARCHER NEEDS
      5. EVALUATING NOVEL WEB SEARCH INTERFACES
      6. FUTURE RESEARCH DIRECTIONS
      7. CONCLUSION
    2. Chapter 11: Extensions of Web Browsers useful to Knowledge Workers
      1. ABSTRACT
      2. INTRODUCTION
      3. 1. SEARCHING FOR INFORMATION
      4. 2. COLLECTING AND MANAGING SOURCES AND DOCUMENTS
      5. 3. SUMMARY TABLES
      6. CONCLUSION
    3. Chapter 12: Next Generation Search Engine for the Result Clustering Technology
      1. ABSTRACT
      2. INTRODUCTION
      3. RELATED WORK
      4. SYSTEM ARCHITECTURE
      5. METASEARCH RANKING
      6. LABEL CONSTRUCTION FOR THE FIRST ROUND
      7. LABEL CONSTRUCTION FOR THE SECOND ROUND
      8. BUILD A HIERARCHICAL TREE STRUCTURE
      9. EXPERIMENT ANALYSIS
      10. CONCLUSION AND FUTURE WORK
    4. Chapter 13: Using Association Rules for Query Reformulation
      1. ABSTRACT
      2. INTRODUCTION
      3. MAXIMAL ASSOCIATION RULES
      4. REFORMULATING A QUERY FOR A SEARCH ENGINE
      5. IDENTIFICATION OF MAXIMAL ASSOCIATION RULES IN SIMILARITY CLASSES
      6. EXPERIMENTS
      7. CONCLUSION
    5. Chapter 14: Question Answering
      1. ABSTRACT
      2. INTRODUCTION
      3. OVERVIEW AND BACKGROUND
      4. STATE-OF-THE-ART QUESTION ANSWERING
      5. QUESTION ANSWERING IN THE SEMANTIC WEB ENVIRONMENT
      6. ISSUES, CONTROVERSIES AND RECOMMENDATIONS
      7. FUTURE RESEARCH DIRECTIONS
      8. CONCLUSION
      9. APPENDIX
    6. Chapter 15: Finding Answers to Questions, in Text Collections or Web, in Open Domain or Specialty Domains
      1. ABSTRACT
      2. INTRODUCTION
      3. QA IN OPEN DOMAIN
      4. QA IN SPECIALTY DOMAIN
      5. QA AND THE WEB
      6. CONCLUSION AND PERSPECTIVES
    7. Chapter 16: Context-Aware Mobile Search Engine
      1. ABSTRACT
      2. 1. INTRODUCTION
      3. 2. RELATED WORK
      4. 3. CASE STUDY
      5. 4. CONTEXT MODELING
      6. 5. SYSTEM ARCHITECTURE
      7. 6. SCENARIO EXECUTION
      8. 7. CONCLUSION
    8. Chapter 17: Spatio-Temporal Based Personalization for Mobile Search
      1. ABSTRACT
      2. INTRODUCTION
      3. MOBILE INFORMATION RETRIEVAL: BACKGROUND AND MOTIVATIONS
      4. EXPERIMENTAL EVALUATION
      5. CONCLUSION AND FUTURE WORK
  9. Section 4: Evaluation
    1. Chapter 18: Studying Web Search Engines from a User Perspective
      1. ABSTRACT
      2. INTRODUCTION
      3. 1. RECENT CHALLENGES IN SEARCH ENGINE EVOLUTION
      4. 2. KEY CONCEPTS RELATED TO WEB INFORMATION SEARCHING
      5. 3. MODELS FOR WEB SEARCHING ANALYSIS
      6. 4. FUTURE DIRECTIONS
      7. 5. CONCLUSION
    2. Chapter 19: Artificial Intelligence Enabled Search Engines (AIESE) and the Implications
      1. ABSTRACT
      2. INTRODUCTION
      3. TECHNOLOGY AND SOCIAL CHANGE
      4. MOORE’S LAW AND ARTIFICIAL INTELLIGENCE (AI)
      5. TECHNOLOGICAL SINGULARITY
      6. EPISTEMOLOGICAL DIMENSIONS OF SEARCH ENGINES (SEs)
      7. SEs AND DANGER OF CENSURE
      8. OLIGOPOLISTIC STRUCTURE OF THE SE MARKET
      9. CONCLUSION
      10. APPENDIX
    3. Chapter 20: A Framework for Evaluating the Retrieval Effectiveness of Search Engines
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. LITERATURE REVIEW
      5. A FRAMEWORK FOR EVALUATING THE RETRIEVAL EFFECTIVENESS OF SEARCH ENGINES
      6. CONCLUSIONS AND FUTURE RESEARCH DIRECTIONS
  10. Compilation of References
  11. About the Contributors
  12. Index