GitXplorerGitXplorer
f

spoken_task_oriented_parsing

public
8 stars
2 forks
5 issues

Commits

List of commits on branch main.
Unverified
06489401e2b64128e5d1c8063552c2c68ed66752

add submission details & updated timeline

committed 2 years ago
Verified
4a31f9d5ce27c8d95fc4af34cdf54c542b46f8de

Merge pull request #6 from trangmle045/patch-3

AAkshatSh committed 2 years ago
Verified
7b8bc695a6c2aa523b8107f84d13c0795019206c

Merge pull request #3 from suyounkimfb/patch-2

AAkshatSh committed 2 years ago
Verified
dc2661e461075ba51bdeb7505ac802006af680ff

Merge pull request #4 from suyounkimfb/patch-3

AAkshatSh committed 2 years ago
Verified
1db35ab4fc0de2327e349794a847113823232f85

Merge pull request #5 from trangmle045/patch-2

AAkshatSh committed 2 years ago
Verified
c7fb3cead500e98f38d163cd5a828287aa090ed0

Update contact_us.md

ttrangmle045 committed 2 years ago

README

The README file for this repository.

STOP Dataset

End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single model. It promises to improve the performance of assistant systems by leveraging acoustic information lost in the intermediate textual representation and preventing cascading errors from Automatic Speech Recognition (ASR). Further, having one unified model has efficiency advantages when deploying assistant systems on-device. However, the limited number of public audio datasets with semantic parse labels hinders the research progress in this area. In this paper, we release the Spoken Task-Oriented semantic Parsing (STOP) dataset, the largest and most complex SLU dataset to be publicly available. Additionally, we define low-resource splits to establish a benchmark for improving SLU when limited labeled data is available. Furthermore, in addition to the human-recorded audio, we are releasing a TTS-generated version to benchmark the performance for low-resource domain adaptation of end-to-end SLU systems. Initial experimentation show end-to-end SLU models performing slightly worse than their cascaded counterparts, which we hope encourages future work in this direction.

Updates

Questions

  • Please post on our github issues, and we will get back to you!

Citation

Please use the following citation:

@inproceedings{stop2022,
  author    = {Paden Tomasello and Akshat Shrivastava and Daniel Lazar and Po-Chun Hsu and Duc Le and Adithya Sagar and Ali Elkahky and Jade Copet and Wei-Ning Hsu and Yossef Mordechay and Robin Algayres and Tu Anh Nguyen and Emmanuel Dupoux and Luke Zettlemoyer and Abdelrahman Mohamed},
  title     = {{STOP: A dataset for Spoken Task Oriented Semantic Parsing}},
  booktitle   = {CoRR},
  eprinttype = {arXiv},
}

LICENSE