AUTOMATED CHECKS
check_dictionary
Checks that each word in the transcription files is also present
in the dictionary
check_energy
Produces the overall signal to noise ratio of each conversation
input, the noise floor of each, and each utterance in that
conversation for which the boundary lies in a region where the
energy is higher than the noise floor by more than the specified
deviation
check_silence
Determines the length of the silence-only utterances in a transcription file and flags those that are too short
check_bounds
Insures that the start time of every utterance or word
is equal to the end time of the previous utterance or word. it also
makes sure that the end time of the last utterance or word is
equal to the size of the file up to six significant digits and
that the start time of the first utterance is zero.
utterance_hist
Determines the length of the non-silence-only utterances
in all transcription files and builds a distribution reporting
on such information as the mean duration, variance, std. dev,
number of utterances within 1, 2, 3, 4, and 5 std. devs of the mean,
etc.
check_speech_rate
Determines the speech rate of every non-silence utterance
and reports statistics for this information. It also flags utterances
with abnormally high or low speech rates