You are previewing Voice Application Development for Android.
O'Reilly logo
Voice Application Development for Android

Book Description

A practical guide to develop advanced and exciting voice applications for Android using open source software

  • A comprehensive guide containing all the best practices for voice application development for Android

  • Progress quickly from basic apps to more advanced topics

  • Written in an easy-to-follow style with detailed descriptions of the included code examples to help you learn quickly and efficiently

In Detail

Speech technology has been around for some time now. However, it has only more recently captured the imagination of the general public with the advent of personal assistants on mobile devices that you can talk to in your own language. The potential of voice apps is huge as a novel and natural way to use mobile devices.

Voice Application Development for Android is a practical, hands-on guide that provides you with a series of clear, step-by-step examples which will help you to build on the basic technologies and create more advanced and more engaging applications. With this book, you will learn how to create useful voice apps that you can deploy on your own Android device in no time at all.

This book introduces you to the technologies behind voice application development in a clear and intuitive way. You will learn how to use open source software to develop apps that talk and that recognize your speech. Building on this, you will progress to developing more complex apps that can perform useful tasks, and you will learn how to develop a simple voice-based personal assistant that you can customize to suit your own needs.

Table of Contents

  1. Voice Application Development for Android
    1. Table of Contents
    2. Voice Application Development for Android
    3. Credits
    4. Foreword
    5. About the Authors
    6. Acknowledgement
    7. About the Reviewers
      1. Support files, eBooks, discount offers and more
        1. Why Subscribe?
        2. Free Access for Packt account holders
    9. Preface
      1. What this book covers
      2. What you need for this book
      3. Who this book is for
      4. Conventions
      5. Reader feedback
      6. Customer support
        1. Downloading the example code
        2. Web page for the book
        3. Errata
        4. Piracy
        5. Questions
    10. 1. Speech on Android Devices
      1. Using speech on an Android device
        1. Speech-to-text
        2. Text-to-speech
        3. Voice Search
        4. Android Voice Actions
        5. Virtual Personal Assistants
      2. Designing and developing a speech app
      3. Why Google speech?
      4. What is needed to create a Virtual Personal Assistant?
      5. Summary
    11. 2. Text-to-Speech Synthesis
      1. Introducing text-to-speech synthesis
      2. The technology of text-to-speech synthesis
      3. Using pre-recorded speech instead of TTS
      4. Using Google text-to-speech synthesis
        1. Starting the TTS engine
      5. Developing applications with Google TTS
        1. TTSWithLib app – Reading user input
        2. TTSReadFile app – Reading a file out loud
      6. Summary
    12. 3. Speech Recognition
      1. The technology of speech recognition
      2. Using Google speech recognition
      3. Developing applications with the Google speech recognition API
        1. ASRWithIntent app
        2. ASRWithLib app
      4. Summary
    13. 4. Simple Voice Interactions
      1. Voice interactions
      2. VoiceSearch app
      3. VoiceLaunch app
      4. VoiceSearchConfirmation app
      5. Summary
    14. 5. Form-filling Dialogs
      1. Form-filling dialogs
      2. Implementing form-filling dialogs
      3. Threading
      4. XMLLib
      5. FormFillLib
        1. VXMLParser
        2. DialogInterpreter
      6. MusicBrain app
      7. Summary
    15. 6. Grammars for Dialog
      1. Grammars for speech recognition and natural language understanding
      2. NLU with hand-crafted grammars
      3. Statistical NLU
        1. NLULib
        2. Processing XML grammars
        3. Processing statistical grammars
      4. The GrammarTest app
      5. Summary
    16. 7. Multilingual and Multimodal Dialogs
      1. Multilinguality
      2. Multimodality
      3. Summary
    17. 8. Dialogs with Virtual Personal Assistants
      1. The technology of VPA
        1. Determining the user's intention
      2. Making an appropriate response
      3. Pandorabots
        1. AIML
          1. Using oob tag to add additional functions
      4. The VPALib library
      5. Creating a Pandorabot
      6. Sample VPAs – Jack, Derek, and Stacy
        1. Alternative approaches
      7. Summary
    18. 9. Taking it Further
      1. Developing a more advanced Virtual Personal Assistant
      2. Summary
    19. A. Afterword
    20. Index