Difference between revisions of "Speech database"

Revision as of 15:26, 26 May 2017

The procedure outlined here uses utilities in the Berkeley Phonetic Machine, to produce time aligned TextGrids for recorded utterances in English.

1. create text transcripts of the words spoken in a sound file. Most plain text editors will be fine for this.

One potential complication is that plain text on windows may be saved "end of line" characters that are not compatible with <make_text_grids> - the perl utility that parses transcripts and passes them to <pyalign.py>. You can use the unix utility _d2u_ to change the line endings (this is a separate download into your instance of the BPM).

Revision as of 15:22, 26 May 2017 (view source) Keithjohnson (talk \| contribs) (Created page with "The procedure outlined here uses utilities in the Berkeley Phonetic Machine, to produce time aligned TextGrids for recorded utterances in English. 1. create text transcripts of …")		Revision as of 15:26, 26 May 2017 (view source) Keithjohnson (talk \| contribs) Newer edit →
Line 2:		Line 2:

	1. create text transcripts of the words spoken in a sound file. Most plain text editors will be fine for this.		1. create text transcripts of the words spoken in a sound file. Most plain text editors will be fine for this.
		+
−	~~# one bug~~ is that plain text on windows ~~has~~ "end of line" characters that are not compatible with <make_text_grids> - the perl utility that parses transcripts and passes them to <pyalign.py>	+	One potential complication is that plain text on windows may be saved "end of line" characters that are not compatible with <make_text_grids> - the perl utility that parses transcripts and passes them to <pyalign.py>. You can use the unix utility _d2u_ to change the line endings (this is a separate download into your instance of the BPM).

Difference between revisions of "Speech database"

Revision as of 15:26, 26 May 2017

Navigation menu

Page actions

Page actions

Personal tools

Navigation

Search

Tools