You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+6-3Lines changed: 6 additions & 3 deletions
Original file line number
Diff line number
Diff line change
@@ -38,6 +38,12 @@ A curated list of awesome Voiceprint Recognition papers.
38
38
## 2018
39
39
*[GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION](https://arxiv.org/abs/1710.10467v3)
40
40
41
+
# Data Augmentation
42
+
*[SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition](https://arxiv.org/abs/1904.08779v3) | [code](https://github.com/DemisEom/SpecAugment)
43
+
*[SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification](https://arxiv.org/abs/2103.16858)
44
+
*[MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition](https://arxiv.org/abs/2102.12664)
45
+
*[Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition](https://arxiv.org/abs/2102.09106)
46
+
41
47
# Others
42
48
*[Multichannel CRNN for Speaker Counting: an Analysis of Performance](https://arxiv.org/abs/2101.01977)
43
49
*[End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN](https://arxiv.org/abs/2101.05056)
@@ -46,6 +52,3 @@ A curated list of awesome Voiceprint Recognition papers.
46
52
*[GISE-51: A scalable isolated sound events dataset](https://arxiv.org/abs/2103.12306)
47
53
*[Guided Training: A Simple Method for Single-channel Speaker Separation](https://arxiv.org/abs/2103.14330)
48
54
*[SubSpectral Normalization for Neural Audio Data Processing](https://arxiv.org/abs/2103.13620)
49
-
*[SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification](https://arxiv.org/abs/2103.16858)
50
-
*[MixSpeech: Data Augmentation for Low-resource Automatic Speech Recognition](https://arxiv.org/abs/2102.12664)
51
-
*[Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition](https://arxiv.org/abs/2102.09106)
0 commit comments