You are previewing Innovative Techniques and Applications of Entity Resolution.
O'Reilly logo
Innovative Techniques and Applications of Entity Resolution

Book Description

Entity resolution is an essential tool in processing and analyzing data in order to draw precise conclusions from the information being presented. Further research in entity resolution is necessary to help promote information quality and improved data reporting in multidisciplinary fields requiring accurate data representation. Innovative Techniques and Applications of Entity Resolution draws upon interdisciplinary research on tools, techniques, and applications of entity resolution. This research work provides a detailed analysis of entity resolution applied to various types of data as well as appropriate techniques and applications and is appropriately designed for students, researchers, information professionals, and system developers.

Table of Contents

  1. Cover
  2. Title Page
  3. Copyright Page
  4. Book Series
  5. Foreword
  6. Preface
  7. Acknowledgment
  8. Section 1: Principles of Entity Resolution
    1. Chapter 1: Overview of Entity Resolution
      1. ABSTRACT
      2. BASIC CONCEPTS OF ENTITY RESOLUTION
      3. CENTRAL ISSUES OF ENTITY RESOLUTION
      4. RESEARCH CHALLENGES OF ENTITY RESOLUTION
      5. APPLICATIONS FOR ENTITY RESOLUTION
      6. OVERVIEW FOR ENTITY RESOLUTION
      7. PRACTICAL ENTITY RESOLUTION
      8. FUTURE RESEARCH DIRECTIONS
      9. CONCLUSION
      10. REFERENCES
      11. ADDITIONAL READING
      12. KEY TERMS AND DEFINITIONS
    2. Chapter 2: Measures of Entity Resolution Result
      1. ABSTRACT
      2. INTROODUCTION
      3. BACKGROUND
      4. FUTURE RESEARCH DIRECTIONS
      5. CONCLUSION
      6. REFERENCES
      7. ADDITIONAL READING
      8. KEY TERMS AND DEFINITIONS
  9. Section 2: Entity Resolution on Various Types of Data
    1. Chapter 3: Entity Resolution on Names
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. FUTURE RESEARCH DIRECTIONS
      5. CONCLUSION
      6. REFERENCES
      7. ADDITIONAL READING
      8. KEY TERMS AND DEFINITIONS
    2. Chapter 4: Context-Based Entity Resolution
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. MAIN FOCUS OF THE CHAPTER
      5. LEVERAGING CED FOR ENTITY RESOLUTION
      6. EXPERIMENTAL EVALUATION
      7. FUTURE RESEARCH DIRECTIONS
      8. CONCLUSION
      9. REFERENCES
      10. ADDITIONAL READING
      11. KEY TERMS AND DEFINITIONS
    3. Chapter 5: Entity Resolution on Single Relation
      1. ABSTRACT
      2. INTRODUCTION
      3. RECORD SIMILARITY COMPUTATION
      4. SYMBOLS NOTATION
      5. MATCHING DEPENDENCIES AND KEYS
      6. DEDUCING MATCHING DEPENDENCIES
      7. REFERENCES
      8. ADDITIONAL READING
      9. KEY TERMS AND DEFINITIONS
    4. Chapter 6: Entity Resolution on Multiple Relations
      1. ABSTRACT
      2. INTRODUCTION
      3. CONCLUSION
      4. REFERENCES
      5. ADDITIONAL READING
      6. KEY TERMS AND DEFINITIONS
    5. Chapter 7: XML Object Identification
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. MAIN FOCUS OF THE CHAPTER
      5. XML PAIRWISE ENTITY RESOLUTION
      6. FUTURE RESEARCH DIRECTIONS AND CONCLUSION
      7. REFERENCES
      8. ADDITIONAL READING
      9. KEY TERMS AND DEFINITIONS
    6. Chapter 8: Entity Resolution on Graph Data Set
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. DISTANCE DEFINITION OF GRAPH
      5. ALGORITHM MATCH
      6. PAIR-WISE ENTITY RESOLUTION ON GRAPHS
      7. NH-INDEX STRUCTURE
      8. QUERY PROCESSING
      9. KERNEL FUNCTION
      10. WAVELET ALIGNMENT KERNEL
      11. WAVELET GRAPH MATCHING KERNEL
      12. FUTURE RESEARCH DIRECTIONS
      13. CONCLUSION
      14. REFERENCES
      15. ADDITIONAL READING
      16. KEY TERMS AND DEFINITIONS
    7. Chapter 9: Entity Resolution on Complex Network
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. OTHER RELATED RESEARCH
      5. FUTURE RESEARCH DIRECTIONS
      6. CONCLUSION
      7. REFERENCES
      8. ADDITIONAL READING
      9. KEY TERMS AND DEFINITIONS
    8. Chapter 10: Entity Resolution on Cloud
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. MAPREDUCE PARADIGM
      5. FORMALIZATION OF ENTITY RESOLUTION PROBLEM
      6. AN ENTITY RESOLUTION METHOD BASED ON MAPREDUCE
      7. FRAMEWORK OF THE CLUSTERING ALGORITHM
      8. GENERATE TOTAL FREQUENCIES F(C) WITH ε-SINGLE DIRECTIONAL NEIGHBORHOOD
      9. GENERATE WAVE OF STRINGS
      10. COMPUTING SIMILARITY VALUES
      11. THEORETICAL ANALYSIS
      12. EXPERIMENTAL RESULTS
      13. EXPERIMENTAL RESULTS
      14. STRATEGIES TO IMPROVE PERFORMANCE
      15. FUTURE RESEARCH DIRECTIONS
      16. CONCLUSION
      17. REFERENCES
      18. ADDITIONAL READING
      19. KEY TERMS AND DEFINITIONS
  10. Section 3: Database Techniques and Entity Resolution
    1. Chapter 11: Basic Data Operators for Entity Resolution
      1. ABSTRACT
      2. INTRODUCTION
      3. SIMILARITY SEARCH
      4. CENTER
      5. MERGECENTER
      6. FUTURE RESEARCH DIRECTIONS
      7. CONCLUSION
      8. REFERENCES
      9. ADDITIONAL READING
      10. KEY TERMS AND DEFINITIONS
    2. Chapter 12: Data Cleaning Based on Entity Resolution
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. DIFFERENT TRUTH DISCOVERY APPROACHES
      5. SUMMARY
      6. REFERENCES
      7. ADDITIONAL READING
      8. KEY TERMS AND DEFINITIONS
    3. Chapter 13: Query Processing Based on Entity Resolution
      1. ABSTRACT
      2. INTRODUCTION
      3. ENTITY-BASED DATA MODEL
      4. OPERATORS OF THE DATA MODEL
      5. QUERY PROCESSING IN ENTITY-BASED DATABASES
      6. THRESHOLD SIMILARITY JOIN SIZE ESTIMATION
      7. MULTI-SIMILARITY JOIN ORDER SELECTION IN ENTITY DATABASE
      8. THE SIZE ESTIMATION OF ENTITY SIMILARITY JOIN RESULT
      9. CONCLUSION
      10. REFERENCES
      11. ADDITIONAL READING
      12. KEY TERMS AND DEFINITIONS
  11. Section 4: Applications for Entity Resolution
    1. Chapter 14: Duplicate Record Detection for Data Integration
      1. ABSTRACT
      2. INTRODUCTION
      3. PROBLEM DEFINITION
      4. DUPLICATE RECORDS DETECTION
      5. THE NAÏVE DUPLICATE RECORDS DETECTION METHOD
      6. CONCLUSION
      7. REFERENCES
      8. ADDITIONAL READING
      9. KEY TERMS AND DEFINITIONS
    2. Chapter 15: Entity Resolution in Bibliography Information Management
      1. ABSTRACT
      2. INTRODUCTION
      3. THE ENTITY RESOLUTION FRAMEWORK: EIF
      4. THE AUTHOR RESOLUTION ALGORITHM BASED ON EIF (AI-EIF)
      5. EXPERIMENTS
      6. CONCLUSION
      7. REFERENCES
      8. ADDITIONAL READING
      9. KEY TERMS AND DEFINITIONS
    3. Chapter 16: Product Entity Resolution in E-Commerce
      1. ABSTRACT
      2. INTRODUCTION
      3. RELATED WORK
      4. SYSTEM DESIGN
      5. EXPERIMENT
      6. CONCLUSION
      7. REFERENCES
      8. ADDITIONAL READING
      9. KEY TERMS AND DEFINITIONS
    4. Chapter 17: Entity Resolution in Healthcare
      1. ABSTRACT
      2. INTRODUCTION
      3. BACKGROUND
      4. CONCLUSION
      5. REFERENCES
      6. ADDITIONAL READING
      7. KEY TERMS AND DEFINITIONS
  12. Compilation of References
  13. About the Contributors