The following statistics are being compiled at ISIP: cd_location.text => contains the CD location of each conversation in the Swithcboard Corpus *conv_all.text => conversation_sources.text => defines the source for the audio data for each conversation in the Switchboard Corpus. *speaker_stats.text => contains data for each speaker in the Switchboard Corpus * implies that these files are in progress =============================================================================== The following statistics are part of the LDC release and/or the WS97 SWB project. missing_mrk_files.text => contains a list of those conversations which do not have corresponding transcription (.mrk and .txt) in the LDC release WS97_conv_stats.text => contains data for each conversation in the WS97 corpus WS97_caller_stats.text => contains data for each speaker in the WS97 Corpus WS97_speaker_stats.text => contains summaries for each speaker in the WS97 Corpus WS97_conv_hist.text => contains detailed statistics about the utterances for each conversation in the WS97 Corpus WS97_topic_stats.text => definition of each topic number used in the WS97 Corpus