O'Reilly logo

Readings in Multimedia Computing and Networking by Hong Jiang Zhang, Kevin Jeffay

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

23

Toward Content-Based Audio Indexing and Retrieval and a New Speaker Discrimination Technique

Lonce Wyse and Stephen W. Smoliar,     National University of Singapore

Several techniques for identifying segment transitions in an audio stream are discussed. Gross features are first identified that control more detailed and computationally expensive analysis down stream. Pitch is tracked using some basic streaming principles, and then used as one cue to speaker transitions. A novel speaker discrimination technique is described that makes segmentation decisions when a continuously updated model of the current speaker suddenly ceases to sufficiently account for the input data.

23.1 INTRODUCTION

Despite the multimedia hype, video and audio ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required