athena.data.datasets.speech_set_kaldiio¶
audio dataset
Module Contents¶
Classes¶
SpeechDatasetKaldiIOBuilder |
SpeechDatasetKaldiIOBuilder |
-
class
athena.data.datasets.speech_set_kaldiio.SpeechDatasetKaldiIOBuilder(config=None)¶ Bases:
athena.data.datasets.base.BaseDatasetBuilderSpeechDatasetKaldiIOBuilder
-
default_config¶
-
num_class¶ return the max_index of the vocabulary
-
speaker_list¶ return the speaker list
-
audio_featurizer_func¶ return the audio_featurizer function
-
sample_type¶
-
sample_shape¶
-
sample_signature¶
-
reload_config(self, config)¶ reload the config
-
preprocess_data(self, file_dir, apply_sort_filter=True)¶ generate a list of tuples (feat_key, speaker).
-
load_scps(self, file_dir)¶ load kaldi-format feats.scp and utt2spk (optional)
-
__getitem__(self, index)¶
-
__len__(self)¶ return the number of data samples
-
filter_sample_by_input_length(self)¶ filter samples by input length
The length of filterd samples will be in [min_length, max_length)
Returns: a filtered list of tuples (wav_filename, wav_len, speaker) Return type: entries
-
compute_cmvn_if_necessary(self, is_necessary=True)¶ compute cmvn file
-