Blitz Scribe: Speech Analysis for the Human Speech home Project Print

Blitz Scribe is a new approach to speech transcription driven by the demands of today's massive multimedia corpora. High-quality annotations are essential for indexing and analyzing many multimedia datasets; in particular, our study of language development for the Human Speech me Project depends on speech transcripts. Unfortunately, automatic speech transcription is inadequate for many natural speech recordings, and traditional approaches to manual transcription are extremely labor intensive and expensive. Blitz Scribe uses a semi-automatic approach, combining human and machine effort to dramatically improve transcription speed. Automatic methods identify and segment speech in dense, multitrack audio recordings, allowing us to build streamlined user interfaces maximizing human productivity. The first version of Blitz Scribe is already about 4-6 times faster than existing systems. It is exploring user-interface design, machine-learning and pattern-recognition techniques to build a human-machine collaborative system that will make massive transcription tasks feasible and affordable.

Institute/Organization : MIT Media Lab, Massachusetts Institute of Technology, USA

Research Team : Deb Roy (Director)

Status : Going on