You are previewing Developing Quality Metadata.
O'Reilly logo
Developing Quality Metadata

Book Description

With the explosion of new audio and video content on the Web, it’s more important than ever to use accurate and comprehensive metadata to get the most out of that content. Developing Quality Metadata is an advanced user guide that will help you improve your metadata by making it accurate and coherent with your own solutions. This book is designed to get you thinking about solving problems in a proactive and productive way by including practical descriptions of powerful programming tools and user techniques using several programming languages. For example, you can use shell scripting as part of the graphic arts and media production process, or you can use a popular spreadsheet application to drive your workflow. The concepts explored in this book are framed within the context of a multimedia professional working on the Web or in broadcasting, but they are relevant to anyone responsible for a growing library of content, be it audio-visual, text, or financial.

*Solutions to build your own tools instead of buying off-the-shelf software solutions
*Real-world examples and case studies explore the usefulness of the tools
*Author Cliff Wootton has been building tools and applications for producing and delivering multimedia content for the last 25 years

Table of Contents

  1. Cover
  2. Halftitle
  3. Title
  4. Copyright
  5. Dedication
  6. Contents
  7. Acknowledgments
  8. PART 1. Introduction
    1. 1. Framing the Problem
    2. 2. Metadata
    3. 3. Object Modeling Your Data
    4. 4. Transfer and Conversion
    5. 5. Dealing With Raw Data
    6. 6. Character Mapping and Code Sets
    7. 7. Data Fields
    8. 8. Fields, Records, and Tables
    9. 9. Times, Dates, Schedules, and Calendars
    10. 10. Names, Addresses, and Contacts
    11. 11. Spatial Data and Maps
    12. 12. Paint Me a Picture
    13. 13. Roll Tape!
    14. 14. Rights Issues
    15. 15. Integrating with Enterprise Systems
    16. 16. Data Exchange Formats
    17. 17. XML-Based Tools and Processes
    18. 18. Interfaces and APIs
    19. 19. Scripting Layers
    20. 20. UNIX Command Line Tools
    21. 21. Power Tools
    22. 22. Automation with Shell Scripts
    23. 23. Automation with AppleScript
    24. 24. Script Automation in Windows
    25. 25. Compiled and Interpreted Languages
    26. 26. GUI Tools and Processes
    27. 27. Building Tools
    28. 28. Keep It Moving
    29. 29. Publishing Systems
    30. 30. Adding Intelligence and Metrics
    31. 31. Lateral Thinking
    32. 32. The Bottom Line
  9. PART 2. Tutorials
    1. Tutorials 1: Calling Shell Commands from AppleScript
    2. Tutorials 2: Calling AppleScript from Shells
    3. Tutorials 3: Calling Visual Basic from AppleScript
    4. Tutorials 4: Calling Visual Basic from UNIX
    5. Tutorials 5: Calling UNIX Shell Commands from C
    6. Tutorials 6: Calling Java from C Language
    7. Tutorials 7: Calling C from Java
    8. Tutorials 8: What Your Web Server Log Can Tell You
    9. Tutorials 9: Monitoring Your Operating System Logs
    10. Tutorials 10: Measuring and Monitoring Disk Usage
    11. Tutorial 11: Wrapping FTP Transfers in a Script
    12. Tutorial 12: Wrapping gzip in a Shell Script
    13. Tutorial 13: Many-to-Many Relationships
    14. Tutorial 14: Phonetic Searches
    15. Tutorial 15: Fuzzy Searching and Sorting
    16. Tutorial 16: Finding Buffer Truncation Points
    17. Tutorial 17: Cleaning Unstructured Data
    18. Tutorial 18: Sorting Out Address Data
    19. Tutorial 19: Time Data Cleaning
    20. Tutorial 20: Removing Duplicates
    21. Tutorial 21: Converting TSV to XML
    22. Tutorial 22: Removing Macros from Word Documents
    23. Tutorial 23: Removing all Hyperlinks from Word
    24. Tutorial 24: Recognizing U.S. ZIP Codes
    25. Tutorial 25: Recognizing UK Postal Codes
    26. Tutorial 26: Finding Variable Names in Source Code
    27. Tutorial 27: Finding Double-Quoted Strings
    28. Tutorial 28: Finding Single-Quoted Strings
    29. Tutorial 29: Finding Currency Values
    30. Tutorial 30: Finding Time Values
    31. Tutorial 31: Recovering Text from Corrupted Documents
    32. Tutorial 32: Extracting Text from PDF Files
    33. Tutorial 33: Mail and HTTP Header Removal
    34. Tutorial 34: ISO 8601 Date Format Output
    35. Tutorial 35: Relative Date Tool (theDate)
    36. Tutorial 36: ZIP/Postal Code-to-Location Mapping
    37. Tutorial 37: Shortest Distance Between Two Towns
    38. Tutorial 38: Dealing with Islands
    39. Tutorial 39: Calculate Centroid of Area
    40. Tutorial 40: Extracting Text from Illustrator
    41. Tutorial 41: Generating Candidate Keywords
    42. Tutorial 42: Extracting Metadata from Word Documents
    43. Tutorial 43: Extracting Metadata from Image Files
    44. Tutorial 44: Extract Metadata from a QuickTime Movie
    45. Tutorial 45: Discovering Formats with File Magic
    46. Tutorial 46: Extracting Hyperlinks from Word Documents
    47. Tutorial 47: Extracting URLs from Raw Text
    48. Tutorial 48: Testing URL Hyperlinks
    49. Tutorial 49: Dictionary Lookups via Dict.org
    50. Tutorial 50: Lookup the Online Dictionary from a Web Page
    51. Tutorial 51: Check for Editorial Integrity
    52. Tutorial 52: Publish a Spreadsheet SYLK File
    53. Tutorial 53: Publish a Word RTF Document
    54. Tutorial 54: Publish an Adobe SVG
    55. Tutorial 55: Converting XML to HTML with XSLT
    56. Tutorial 56: Making Excel Spreadsheets with AppleScript
    57. Tutorial 57: Making Word Documents with AppleScript
    58. Tutorial 58: Scripting Alpha Channels in Photoshop
    59. Tutorial 59:Searching and Editing Word Docs
    60. Tutorial 60: Creating a Script Wrapper for Microsoft Word
    61. Tutorial 61: Putting It on the Desktop
    62. Tutorial 62: Remote Renderers and Compilers
  10. A. Data Exchange Containers
  11. B. Metadata Standards
  12. C. A Simple Metadata Dictionary
  13. D. Code Sets
  14. E. Regular Expressions 101
  15. Glossary
  16. Bibliography
  17. Webliography
  18. Index