Skip to content

INSPIRE: Instruction-based Multi-Task Speech and Audio Processing Benchmark

License

Notifications You must be signed in to change notification settings

alibaba/INSPIRE

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

INSPIRE: Instruction-based Multi-Task Speech and Audio Processing Benchmark

Introduction

INSPIRE is an INstruction-based multi-task SPeech and audIo pRocessing bEnchmark. INSPIRE is built to help benchmark speech foundation models and it includes dataset and models. INSPIRE can be used for cross-modal tasks including speech-to-text, text-to-speech, speech-to-speech, and audio-to-text tasks in the range from recognition, understanding and generation.

Dataset

  • INSPIRE dataset (coming soon)

Models

  • (coming soon)

## License This project is licensed under [The MIT License](https://opensource.org/licenses/MIT). INSPIRE also contains various third-party components and some code modified from other repos under other open source licenses.

About

INSPIRE: Instruction-based Multi-Task Speech and Audio Processing Benchmark

Resources

License

Stars

Watchers

Forks