|
||
Website deprecated and outdated. Click here for the new site. | ||
Blitzscribe2A platform independent tool for the rapid transcription of large numbers of speech segments.Source Code and RequirementsBlitzscribe2 can be obtained as part of the Java Speech Toolkit (JSTK, package de.fau.cs.jstk.app.blitzscribe) or as a self contained .jar archive (download below). It requires Java6 or better. UsageBlitzscribe2 is mainly designed for keyboard interactions, to avoid time loss due to mouse-keyboard switches.
Fields are: waveform (1), playback progress (2), text field for transcription (3) and list of available turns (and their transcription, if available) (4). Mouse interactions:
Keyboard interactions:
Use the Open/Save/Save as Buttons to load or save the transcription files. Blitzscribe2 generates a protocol file (yourfile.trl~) that contains a journal of interactions, including timing. This allows to reconstruct the transcription process and learn about transcription time. Audio File FormatBlitzscribe is (for now) limited to 16kHz, 16bit, mono WAV RIFF data (with header), but can be easily modified to read any supported format of the JSTK (raw, speex, alaw, ...). Transcription File FormatThe transcription file format (extension .trl) is plain ASCII and basically a file list. Each line contains the filename (either absolute or relative to the directory of the trl file) and, if available, the transcription after a whitespace. If the file name ends on _SOMENUMBER_SOMENUMBER.wav, Blitzscribe expects these to be time marks in milliseconds, and it displays a number and duration of the turn instead of the filename. Example1: (with partial transcription) 20090427-Hornegger-IMIP01_0001480_0003230.wav so welcome to the Example2: file1.wav Protocol File FormatThe protocol file is named as the transcription file with a trailing '~' character. Each line is formatted as <UNIXTIMESTAMP_IN_MSEC> <FILE> <TRANSCRIPTION& SupportFeel free to post issues at http://code.google.com/p/jstk/issues/list or to the JSTK mailing list jstk(at)speech.informatik.uni-erlangen.de
|