Resource

GigaSpeech 2: 30,000-Hour Southeast Asian Multilingual Speech Recognition Open-Source Dataset

Release time: 2024/09/23
Hits:
Data description:

Data Application:

https://huggingface.co/datasets/speechcolab/gigaspeech2

Github:

https://github.com/SpeechColab/GigaSpeech2

Paper:

https://arxiv.org/pdf/2406.11546

Quality description:
30,000-hour southeast Asian multilingual transcribed audio