Difference between revisions of "Speech database"

From Phonlab
Jump to navigationJump to search
Line 4: Line 4:
 
Create text transcripts of the words spoken in a sound file. Most plain text editors will be fine for this.
 
Create text transcripts of the words spoken in a sound file. Most plain text editors will be fine for this.
   
One potential complication is that plain text on windows may be saved "end of line" characters that are not compatible with <make_text_grids>, the perl utility that parses transcripts and passes them to <pyalign.py>. You can use the unix utility _d2u_ to change the line endings (this is a separate download into your instance of the BPM).
+
One potential complication is that plain text on windows may be saved "end of line" characters that are not compatible with ''make_text_grids'', the perl utility that parses transcripts and passes them to ''pyalign.py''. You can use the unix utility ''d2u'' to change the line endings (this is a separate download into your instance of the BPM).
  +
  +
If there is a portion of the audio file that you don't want to transcribe, and thus won't include in the tagged database, you can add a "skip region" line to the transcript. For example, if there is a 300ms long beep at the beginning of the audio file you can skip the beep by
  +
  +
# 0,0.3
  +
word, word, word
  +
# 1,1.7
  +
  +
  +
==Align transcript to wave==
  +
The perl script

Revision as of 15:40, 26 May 2017

The procedure outlined here uses utilities in the Berkeley Phonetic Machine, to produce time aligned TextGrids for recorded utterances in English.

Transcript

Create text transcripts of the words spoken in a sound file. Most plain text editors will be fine for this.

One potential complication is that plain text on windows may be saved "end of line" characters that are not compatible with make_text_grids, the perl utility that parses transcripts and passes them to pyalign.py. You can use the unix utility d2u to change the line endings (this is a separate download into your instance of the BPM).

If there is a portion of the audio file that you don't want to transcribe, and thus won't include in the tagged database, you can add a "skip region" line to the transcript. For example, if there is a 300ms long beep at the beginning of the audio file you can skip the beep by

 # 0,0.3
word, word, word
 # 1,1.7


Align transcript to wave

The perl script