online mix noise audio data in training step #2622

mychiux413 · 2019-12-31T09:57:57Z

Mixing noisy data into training file before runtime could cause data monotonicity, but mixing noisy data in runtime could cause very bad performance, if we read each noise audio to augment each training row. (Ex: for HDD disk, the duration of mixing one audio is almost 100 times slower than freq_time_mask does).

To reduce online mixing time, I use another tf.Dataset to cache noise audio array, then mix them to training data.

usage:

python -u DeepSpeech.py --noshow_progressbar \
  --train_files data/ldc93s1/ldc93s1.csv \
  --test_files data/ldc93s1/ldc93s1.csv \
  --train_batch_size 1 \
  --test_batch_size 1 \
  --n_hidden 200 \
  --epochs 200 \
  --checkpoint_dir <checkpoint_dir> \
  --audio_aug_mix_noise_walk_dirs <directory1-contains-wav-files>,<directory2-contains-wav-files>

Just specify the noise file directory, the process will automatically walk through the whole directory recursively, and collect .wav files (but it doesn't checkout the sample rate).
This program assume every volume of noise audio have been maximized, to save the calculation time of each speech/noise volume balance, it just simply divide speech audio with value between 0~-10 db, and divide noise audio with value between -25~-50 db
The augment time can be as fast as freq_time_mask
--audio_aug_mix_noise_walk_dirs can set multi dirs with comma separated.

To manually adjust volume loudness suppression:

python -u DeepSpeech.py \
...
--audio_aug_mix_noise_max_noise_db -25 \
--audio_aug_mix_noise_min_noise_db -50 \
--audio_aug_mix_noise_max_audio_db 0 \
--audio_aug_mix_noise_min_audio_db -10 \
...

If your noise files are pure non-speaker noise, my experience paramters is --audio_aug_mix_noise_max_noise_db -15, --audio_aug_mix_noise_min_noise_db -25
If your noise files are from speakers, like cocktail party, my experience paramters is --audio_aug_mix_noise_max_noise_db -30, --audio_aug_mix_noise_min_noise_db -50, otherwise, the voice can have a chance to cover the main speaker's volume.
If you want to cache audio array into local disk, set --audio_aug_mix_noise_cache <your cache path>, otherwise cache in memory.

Update upstream

community-tc-integration · 2019-12-31T09:58:14Z

No Taskcluster jobs started for this pull request

The `allowPullRequests` configuration for this repository (in `.taskcluster.yml` on the
default branch) does not allow starting tasks for this pull request.

DanBmh · 2020-02-05T17:10:42Z

I tested it with Freesound Dataset Kaggle 2019 which has about 103h of noise data.
Everything worked as intended. Only i didnt see a great difference in my training results (using Voxforge DE dataset). Maybe its too small.

alokprasad · 2020-02-06T08:36:06Z

I tested it with Freesound Dataset Kaggle 2019 which has about 103h of noise data.
Everything worked as intended. Only i didnt see a great difference in my training results (using Voxforge DE dataset). Maybe its too small.

DId u mean.. the noise dataset is small or Voxforge dataset is small, comparatively.
One suggestion: If your feel noise dataset is small, you can use rnnoise's (https://people.xiph.org/~jm/demo/rnnoise/rnnoise_contributions.tar.gz) dataset

DanBmh · 2020-02-06T12:02:11Z

I did mean the voxforge dataset. It has only around 32h of speech data.

i think rnnoise dataset is smaller than the freesound one (6 vs 22 gb, did not find the length in hours).

Also the noise files of rnnoise are in .raw format and freesound already has .wav format. So you need to convert them before to wav somehow.

DanBmh

Also think about replacing the cache() call with prefetch(tf.data.experimental.AUTOTUNE). For me it reduced the memory usage about 64gb without an impact on training speed.

util/feeding.py

… for memory cost [MOD] deprecate FLAGS.audio_aug_mix_noise_cache

mychiux413 · 2020-02-12T09:51:12Z

To use rnnoise datasets, we should normalize the volume and convert frame rate to 16000 manually, and many of rnnoise audio are almost no sound without normalizing volume.
This mix noise process assume every single noise file volume were maximized, so it doesn't calculate dBFS to balance speech/noise volume when processing.

alokprasad · 2020-02-12T12:17:13Z

@mychiux413 any idea how can be this done ? it should be online process ?

mychiux413 · 2020-02-17T07:29:46Z

@mychiux413 any idea how can be this done ? it should be online process ?

you should prepare normalized noise files by yourself before training start.

there is no standard way to normalize volume, I can only offer an example for you, you can optimize the script by yourself, and don't forget to listen the output audio to make sure everything sounds well.

notice:

I use pydub in the example, before pip install pydub, you should install ffmpeg by sudo apt-get install ffmpeg
the raw data I've downloaded from rnnoise is .raw, which should be manually specified frame rate, sample size, channel
some rnnoise data duration are almost 5 minutes, which is unnecessary in online mixing, so the example split them into 30 secs around.
the script is under python environment 3.7 (typing supported)

usage:

python <python_file.py> --from_dir <directory include rnnoise data> --to_dir <directory to output normalized data>

from __future__ import absolute_import, division, print_function
from pydub import AudioSegment
from multiprocessing import Pool
from functools import partial
import math
import argparse
import sys
import os


def detect_silence(sound: AudioSegment, silence_threshold=-50.0,
                   chunk_size=10) -> (int, int):
    start_trim = 0  # ms
    sound_size = len(sound)
    assert chunk_size > 0  # to avoid infinite loop
    while sound[start_trim:(
            start_trim +
            chunk_size)].dBFS < silence_threshold and start_trim < sound_size:
        start_trim += chunk_size

    end_trim = sound_size
    while sound[(end_trim - chunk_size):end_trim].dBFS < silence_threshold \
            and end_trim > 0:
        end_trim -= chunk_size

    start_trim = min(sound_size, start_trim)
    end_trim = max(0, end_trim)

    return min([start_trim, end_trim]), max([start_trim, end_trim])


def trim_silence_audio(sound: AudioSegment,
                       silence_threshold=-50.0,
                       chunk_size=10) -> AudioSegment:
    start_trim, end_trim = detect_silence(sound, silence_threshold, chunk_size)
    return sound[start_trim:end_trim]


def convert(filename: str, src_dir: str, dst_dirpath: str, dirpath: str,
            normalize: bool, trim_silence: bool, min_duration_seconds: float,
            max_duration_seconds: float):
    if not filename.endswith(('.wav', '.raw')):
        return
    filepath = os.path.join(dirpath, filename)
    if filename.endswith('.wav'):
        sound: AudioSegment = AudioSegment.from_file(filepath)
    else:
        try:
            sound: AudioSegment = AudioSegment.from_raw(filepath,
                                                        sample_width=2,
                                                        frame_rate=44100,
                                                        channels=1)
        except Exception as err:
            print('[retry] {}'.format(err))
            try:
                sound: AudioSegment = AudioSegment.from_raw(filepath,
                                                            sample_width=2,
                                                            frame_rate=48000,
                                                            channels=1)
            except Exception as err:
                print('bypass audio {}, got error: {}'.format(filepath, err))
                return
        try:
            sound = sound.set_frame_rate(16000)
        except Exception as err:
            print('[bypass] {}'.format(err))
            return

    n_splits: int = max(
        1, math.floor(sound.duration_seconds / max_duration_seconds))
    chunk_duration_ms = math.ceil(len(sound) / n_splits)
    chunks = []
    for i in range(n_splits):
        end_ms = min((i + 1) * chunk_duration_ms, len(sound))
        chunk = sound[(i * chunk_duration_ms):end_ms]
        chunks.append(chunk)
    for i, chunk in enumerate(chunks):
        dst_path = os.path.join(dst_dirpath, str(i) + '_' + filename)
        if dst_path.endswith('.raw'):
            dst_path = dst_path[:-4] + '.wav'
        if os.path.exists(dst_path):
            print('audio exists: {}'.format(dst_path))
            return
        if normalize:
            chunk = chunk.normalize()
            if chunk.dBFS < -30.0:
                chunk = chunk.compress_dynamic_range().normalize()
            if chunk.dBFS < -30.0:
                chunk = chunk.compress_dynamic_range().normalize()
        if trim_silence:
            chunk = trim_silence_audio(chunk)

        if chunk.duration_seconds < min_duration_seconds:
            return
        chunk.export(dst_path, format='wav')


def main(src_dir: str,
         dst_dir: str,
         min_duration_seconds: float,
         max_duration_seconds: float,
         normalize=True,
         trim_silence=True):
    assert os.path.exists(src_dir)
    if not os.path.exists(dst_dir):
        os.makedirs(dst_dir, exist_ok=False)
    src_dir = os.path.abspath(src_dir)
    dst_dir = os.path.abspath(dst_dir)

    # n_data = 0
    for dirpath, _, filenames in os.walk(src_dir):
        dirpath = os.path.abspath(dirpath)
        dst_dirpath = os.path.join(dst_dir,
                                   dirpath.replace(src_dir, '').lstrip('/'))
        print('converting dirpath: {} -> {}'.format(dirpath, dst_dirpath))
        if not os.path.exists(dst_dirpath):
            os.makedirs(dst_dirpath, exist_ok=False)

        convert_func = partial(convert,
                               src_dir=src_dir,
                               dst_dirpath=dst_dirpath,
                               dirpath=dirpath,
                               normalize=normalize,
                               trim_silence=trim_silence,
                               min_duration_seconds=min_duration_seconds,
                               max_duration_seconds=max_duration_seconds)
        p = Pool()
        p.map(convert_func, filenames)


if __name__ == "__main__":
    PARSER = argparse.ArgumentParser(description='Optimize noise files')
    PARSER.add_argument('--from_dir',
                        help='Convert wav from directory',
                        type=str)
    PARSER.add_argument('--to_dir', help='save wav to directory', type=str)
    PARSER.add_argument('--min_sec',
                        help='min duration seconds of saved file',
                        type=float,
                        default=1.0)
    PARSER.add_argument('--max_sec',
                        help='max duration seconds of saved file',
                        type=float,
                        default=30.0)
    PARSER.add_argument('--normalize',
                        action='store_true',
                        help='Trim silence, default is true',
                        default=True)
    PARSER.add_argument('--trim',
                        action='store_true',
                        help='Trim silence, default is true',
                        default=True)
    PARAMS = PARSER.parse_args()

    main(PARAMS.from_dir, PARAMS.to_dir, PARAMS.min_sec, PARAMS.max_sec,
         PARAMS.normalize, PARAMS.trim)

DanBmh · 2020-02-17T14:28:41Z

there is no standard way to normalize volume, I can only offer an example for you, you can optimize the script by yourself, and don't forget to listen the output audio to make sure everything sounds well.

Could you add this script to your pull request?

I added a progressbar and a summary to it, feel free to copy it back. The updated code is here: https://github.com/DanBmh/deepspeech-german/blob/master/data/normalize_noise_audio.py

mychiux413 · 2020-02-19T02:34:52Z

I added bin/normalize_noise_audio.py, and did some modifications:

Removed typing for environment compatibility
Fixed pylint error, added warning message for ImportError of tqdm & pydub, because they are not standard packages in requirement.txt
Replaced seconds_to_hours() with util/feeding.py::secs_to_hours()

Usage:

python bin/normalize_noise_audio.py --from_dir <directory include noise data> --to_dir <directory to output normalized data>

alokprasad · 2020-02-20T07:57:41Z

@mychiux413 anyway we can dump the mixed files and see how effective is the mixing of noise to speech file.just to make sure mixing is proper

mychiux413 · 2020-02-20T09:27:44Z

@alokprasad You're right, in fact, all the augmented audio should be able to be reviewed in pipeline, even augment on spectrogram like pitch/tempo/mask..., or we would not have a concept to tune the proper parameters.
But in tensorflow's pipeline, it's not as simple as offline augmentation does, we should dump audio data into tensorboard by tf.summary.audio, and I'm still study this method, also trying to figure out how much refactoring this will affect.

alokprasad · 2020-02-20T12:21:43Z

@mychiux413
I also tried to save the audio using tf.print 's output_stream option in following function

"def augment_noise"
    noise_ratio = tf.math.pow(10.0, choosen_noise_db / 10)
    mixed_audio = tf.multiply(audio, audio_ratio) + tf.multiply(mixed_noise, noise_ratio)
    #save to wav file              
    final_pcm = contrib_audio.encode_wav(mixed_audio,16000)
    tf.print(final_pcm,output_stream="file:///tmp/test.wav",summarize=-1)
    return mixed_audio
    #return tf.multiply(audio, audio_ratio) + tf.multiply(mixed_noise, noise_ratio)

but two problems i am facing

i am not able to change parameter of output_stream dynamically so that multiple wave file is saved.
2.Files size keeps growing so we have to stop training ctrl+c after few steps.

anyway if i listen the audio , i dont think noise is getting augmented to the speech at all.

mychiux413 · 2020-02-21T02:32:00Z

@alokprasad I tried tf.print and listened the audio, it's really augmented, maybe my default parameters are too conservative (because some noise data are "speech noise", I don't know what would them cause if too loud), and the process will not augment every single audio time step, but just randomly augment an interval for each audio, and many intervals in noise file are actually silence.
Don't forget to delete test.wav before each execution, or you will always hear the same output.
Try an extreme example: --audio_aug_mix_noise_max_noise_db=5, --audio_aug_mix_noise_min_noise_db=10, to make sure the noise does exist.

Here is another tip, you can also try --audio_aug_mix_noise_max_audio_db=10, this could simulate microphone over boosted sound effect.

alokprasad · 2020-02-21T06:34:57Z

@mychiux413 "process will not augment every single audio time step, but just randomly augment an interval for each audio" I think this might not produce good result , i think each interval should be mixed with noise.( i.e complete file should be mixed with noise)

Infact it would be good that same audio is fed twice to the network

mixed with noise
without noise.

i have added a flag in transcript.csv file with extra flag "noise_flag" whose value is 0 or 1 .
eg.csv file will have follwing

wav_filename,wav_filesize,transcript,noise_flag
test1.wav,3423,"where are you?",1
test1.wav,3423,"where are you?",0

1 is to mix noise and 0 donto mix noise.

relevant code changes

if train_phase and noise_iterator :
        audio = tf.cond(noise_flag > 0 ,
            lambda:augment_noise(
            audio,
            noise_iterator.get_next(),
            change_audio_db_max=FLAGS.audio_aug_mix_noise_max_audio_db,
            change_audio_db_min=FLAGS.audio_aug_mix_noise_min_audio_db,
            change_noise_db_max=FLAGS.audio_aug_mix_noise_max_noise_db,
            change_noise_db_min=FLAGS.audio_aug_mix_noise_min_noise_db,
        )

            ),
            lambda:audio)

DanBmh · 2020-03-25T10:42:33Z

Yes, it make sense, I will try it, but the arguments would be twice than previous version, and how about specifying the number of sub-speakers for each speech? is this helpful for your experiments?

Do you mean augmenting with not only one but multiple background speech or noise files at once? If you dont think its to complicated this is an interesting idea. It would make the backgound noises even more realistic. In this case I would suggest to make the number not fixed, but random with an upper boundary to simulate different environments.

dabinat · 2020-03-28T06:36:28Z

Here’s a question: is it necessary to run augmentation on every epoch? It seems like augmentation is probably more valuable as the model nears convergence. I wonder if you could balance out the performance hit by not augmenting the first x epochs, when the model still has a high WER.

…setest # Conflicts: # DeepSpeech.py # evaluate.py # util/feeding.py # util/flags.py

mychiux413 · 2020-03-31T04:52:16Z

Here’s a question: is it necessary to run augmentation on every epoch? It seems like augmentation is probably more valuable as the model nears convergence. I wonder if you could balance out the performance hit by not augmenting the first x epochs, when the model still has a high WER.

Here is my recent experiment result below (continuing...), I trained 20 epochs for every model with different parameters

noise file: rnnoise, pointsources noise
train dataset: librivox clean-100.csv clean-300.csv other-500.csv
test dataset: test-clean.csv
the loss records are final step (epoch = 19)
in addition to this, I also mixed the zh-tw speech into librivox, and test the WER.

Name	min_audio_dbfs	max_audio_dbfs	min_snr_db	max_snr_db	limit_audio_peak_dbfs	limit_noise_peak_dbfs	train loss	dev loss	test loss	test wer	test loss (mix TW speech)	test wer (mix TW speech)
Baseline (No Augmentation)							27.685342	24.046401	23.756416	0.137232	121.442734	0.454246
Default mix noise	0	-35	3	30	7	3	69.323678	21.669104	21.383959	0.112958	60.703743	0.270337
speech non over boosted	0	-35	3	30	0	3	64.432057	21.491052	21.344168	0.11471	60.352631	0.261519
noise non over boosted	0	-35	3	30	7	0	66.458655	21.09868	21.09868	0.111596	62.270283	0.269928
Wide speech volume	0	-45	3	30	7	3	67.366901	21.060449	20.68895	0.116559	59.696766	0.2673

The result shows:

whatever the noise parameters are, the tests WER are always better than the test of "No Aug model"
The performance of defending noise (column test wer (mix TW speech)) is very effective with mix noise training
Don't be misled by training loss when mix with noise, because the space of data coverage is large than no-aug.
To inspect the noise mix training, the parameters might lead some trade off here, if we want to enhance cocktail party speech, you might lose some accuracy in clean test, in my opinion, if we skip first x epochs to emphasize the clean environment, which should be equivalent to increase max SNR, so the noise test should be worse then.

So my conclusion is:

Tuning the noise parameters according to your target application environment, which should be equivalent as tuning skip first x epochs.
Of course I will also try your idea if I have free resources later.

alokprasad · 2020-03-31T05:05:25Z

@mychiux413 How you are generating test samples ,is it natural voice with noisy background or you have mixed clean speech with noise and then using it as test wave?

mychiux413 · 2020-03-31T05:41:56Z

@alokprasad mixed clean speech with noise, using the new feature --test_augmentation_files, the every test dataset is always librivox-test-clean.csv

…g, add option to mix multi noise into one audio [MOD] change FLAGS name, gla iterations is optional

tilmankamp · 2020-04-01T11:40:34Z

@mychiux413 Master changed quite a bit since you opened this in December. Could you rebase (and squash) it?

mychiux413 · 2020-04-09T01:35:49Z

@tilmankamp Maybe I should wait for it, the latest master did so much refactoring, the ./bin/run-ldc93s1.sh even does not work, furthermore, I haven't fully understood the new project structure.

DanBmh · 2020-04-09T12:31:43Z

Whats the reason for the last commit (no-sort merge)?

# Conflicts: # DeepSpeech.py # evaluate.py # training/deepspeech_training/util/feeding.py

DanBmh · 2020-04-17T19:04:19Z

@tilmankamp Maybe I should wait for it, the latest master did so much refactoring, the ./bin/run-ldc93s1.sh even does not work, furthermore, I haven't fully understood the new project structure.

@mychiux413 Sent you a pull request.

reuben · 2020-04-17T19:06:23Z

Whats the reason for the last commit (no-sort merge)?

I think @carlfm01 just did an incorrect push at some point. @mychiux413 should be able to just force-push over it.

…oiseaugmaster

Revert "Merge branch 'no-sort' into more-augment-options" This reverts commit 7792226, reversing changes made to f7d1279.

Merge current master for rebase to v0.7

DanBmh · 2020-05-19T08:54:53Z

Am I right that this is now outdated with @tilmankamp's merged pull request #2897?

The overlay augmentation docs describe the same mixing features of noise and speech files.

tilmankamp · 2020-05-19T09:48:50Z

@DanBmh Unfortunately yes. Due to the massive amount of data that we plan to use for overlaying, things had to be tighter integrated with sample reading facilities in util/sample_collections.py. Also some of the augmentations would've been hard to realize on the TensorFlow side of things. Sorry for this decision!

DanBmh · 2020-05-19T17:28:17Z

@DanBmh Unfortunately yes. Due to the massive amount of data that we plan to use for overlaying, things had to be tighter integrated with sample reading facilities in util/sample_collections.py. Also some of the augmentations would've been hard to realize on the TensorFlow side of things. Sorry for this decision!

Maybe you could have informed us earlier, but I'm glad this feature is now in the master branch:) And you also did add some other interesting augmentations.

Do you plan to use the noise augmentation for the next checkpoint release already?

JRMeyer · 2020-09-22T18:17:34Z

@lissyx @reuben -- it seems like this PR can be closed

DanBmh · 2020-09-22T18:47:56Z

Just wanted to note that this PR still has an important feature which is missing in @tilmankamp's overlay implementation: The possibility to run tests with noise mixing.

lissyx · 2020-09-23T04:28:54Z

Just wanted to note that this PR still has an important feature which is missing in @tilmankamp's overlay implementation: The possibility to run tests with noise mixing.

This needs rebasing anyway, but if someone wants to do it and address the issues, it's welcome

carlfm01 and others added 10 commits June 5, 2019 04:27

Remove comments check from alphabet

681f470

Remove sort from feeding

421243d

Remove sort from evaluate tools

d08efad

Merge pull request #1 from carlfm01/master

b0a14b5

Update upstream

Remove TF dependency

ba1a587

[ADD] mix noise audio

aebd08d

[FIX] add missing file decoded_augmentation.py

d255c3f

mix noise works, but performance is bad

ec25136

[MOD] use tf.Dataset to cache noise audio

484134e

rename decoded -> audio

4f24f08

mychiux413 added 2 commits January 2, 2020 16:21

[FIX] don't create tf.Dataset in other tf.Dataset's pipeline

1f57ece

limit audio signal between +-1.0

66cc7c4

DanBmh reviewed Feb 10, 2020

View reviewed changes

util/feeding.py Outdated Show resolved Hide resolved

[FIX] switch shuffle/map for memory cost, replace cache with prefetch…

b7eb0f4

… for memory cost [MOD] deprecate FLAGS.audio_aug_mix_noise_cache

[MOD] limit the buffer size of .shuffle() to protect memory usage

ccae7cc

[ADD] bin/normalize_noise_audio.py

8cc95f9

Daniel added 4 commits March 29, 2020 12:49

Fix issues.

289722d

Save invalid files.

9334e79

Merge remote-tracking branch 'noiseaug/more-augment-options' into noi…

25736e0

…setest # Conflicts: # DeepSpeech.py # evaluate.py # util/feeding.py # util/flags.py

Fix merging errors.

40b431b

[FIX] replace tqdm with prograssbar [ADD] separate speech/noise mixin…

f7d1279

…g, add option to mix multi noise into one audio [MOD] change FLAGS name, gla iterations is optional

Merge branch 'no-sort' into more-augment-options

7792226

Daniel added 4 commits April 12, 2020 20:01

Merge #f7d1279.

c4c3ced

Merge branch 'master' into noisetest

c151b1d

# Conflicts: # DeepSpeech.py # evaluate.py # training/deepspeech_training/util/feeding.py

Fix merge not detecting moved scripts.

c089b7f

Undo personal changes.

491a4b0

Daniel and others added 3 commits April 23, 2020 10:47

Merge branch 'master' of https://github.com/mozilla/DeepSpeech into n…

735cbbb

…oiseaugmaster

To recover the incorrect merge

2fa91e8

Revert "Merge branch 'no-sort' into more-augment-options" This reverts commit 7792226, reversing changes made to f7d1279.

Merge pull request #1 from DanBmh/noiseaugmaster

6b820bb

Merge current master for rebase to v0.7

lissyx added the help wanted label Sep 23, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

online mix noise audio data in training step #2622

online mix noise audio data in training step #2622

mychiux413 commented Dec 31, 2019

community-tc-integration bot commented Dec 31, 2019

DanBmh commented Feb 5, 2020

alokprasad commented Feb 6, 2020

DanBmh commented Feb 6, 2020

DanBmh left a comment

mychiux413 commented Feb 12, 2020

alokprasad commented Feb 12, 2020

mychiux413 commented Feb 17, 2020

DanBmh commented Feb 17, 2020

mychiux413 commented Feb 19, 2020

alokprasad commented Feb 20, 2020

mychiux413 commented Feb 20, 2020

alokprasad commented Feb 20, 2020 •

edited

mychiux413 commented Feb 21, 2020

alokprasad commented Feb 21, 2020

DanBmh commented Mar 25, 2020

dabinat commented Mar 28, 2020

mychiux413 commented Mar 31, 2020

alokprasad commented Mar 31, 2020

mychiux413 commented Mar 31, 2020

tilmankamp commented Apr 1, 2020

mychiux413 commented Apr 9, 2020

DanBmh commented Apr 9, 2020

DanBmh commented Apr 17, 2020

reuben commented Apr 17, 2020

DanBmh commented May 19, 2020

tilmankamp commented May 19, 2020

DanBmh commented May 19, 2020

JRMeyer commented Sep 22, 2020

DanBmh commented Sep 22, 2020

lissyx commented Sep 23, 2020

online mix noise audio data in training step #2622

Are you sure you want to change the base?

online mix noise audio data in training step #2622

Conversation

mychiux413 commented Dec 31, 2019

community-tc-integration bot commented Dec 31, 2019

DanBmh commented Feb 5, 2020

alokprasad commented Feb 6, 2020

DanBmh commented Feb 6, 2020

DanBmh left a comment

Choose a reason for hiding this comment

mychiux413 commented Feb 12, 2020

alokprasad commented Feb 12, 2020

mychiux413 commented Feb 17, 2020

DanBmh commented Feb 17, 2020

mychiux413 commented Feb 19, 2020

alokprasad commented Feb 20, 2020

mychiux413 commented Feb 20, 2020

alokprasad commented Feb 20, 2020 • edited

mychiux413 commented Feb 21, 2020

alokprasad commented Feb 21, 2020

DanBmh commented Mar 25, 2020

dabinat commented Mar 28, 2020

mychiux413 commented Mar 31, 2020

alokprasad commented Mar 31, 2020

mychiux413 commented Mar 31, 2020

tilmankamp commented Apr 1, 2020

mychiux413 commented Apr 9, 2020

DanBmh commented Apr 9, 2020

DanBmh commented Apr 17, 2020

reuben commented Apr 17, 2020

DanBmh commented May 19, 2020

tilmankamp commented May 19, 2020

DanBmh commented May 19, 2020

JRMeyer commented Sep 22, 2020

DanBmh commented Sep 22, 2020

lissyx commented Sep 23, 2020

alokprasad commented Feb 20, 2020 •

edited