GitXplorerGitXplorer
f

spoken_task_oriented_parsing

public
8 stars
2 forks
5 issues

Commits

List of commits on branch main.
Verified
51e877fef0141119d2db39c7212e9558d93415ca

Update call_for_participation.md

ttrangmle045 committed 2 years ago
Unverified
f117db2ed3b37c456e81ec84a678c14b3bcc5ef0

Merge branch 'main' of github.com:facebookresearch/spoken_task_oriented_parsing

committed 2 years ago
Unverified
ba0d0e99d4a0c635220c848351a9af7c0f90179b

added sasha info

committed 2 years ago
Verified
46c4f4fc79491e62a5f952232882a3ca6585bd49

Update call_for_participation.md

ssuyounkimfb committed 2 years ago
Verified
9cd592bf5cd65e1851434193c98ada99639eaa0b

Update contact_us.md

ssuyounkimfb committed 2 years ago
Verified
5b60779881cec7168532403cee347eae07a0c89f

Update contact_us.md

pprintfoo committed 2 years ago

README

The README file for this repository.

STOP Dataset

End-to-end spoken language understanding (SLU) predicts intent directly from audio using a single model. It promises to improve the performance of assistant systems by leveraging acoustic information lost in the intermediate textual representation and preventing cascading errors from Automatic Speech Recognition (ASR). Further, having one unified model has efficiency advantages when deploying assistant systems on-device. However, the limited number of public audio datasets with semantic parse labels hinders the research progress in this area. In this paper, we release the Spoken Task-Oriented semantic Parsing (STOP) dataset, the largest and most complex SLU dataset to be publicly available. Additionally, we define low-resource splits to establish a benchmark for improving SLU when limited labeled data is available. Furthermore, in addition to the human-recorded audio, we are releasing a TTS-generated version to benchmark the performance for low-resource domain adaptation of end-to-end SLU systems. Initial experimentation show end-to-end SLU models performing slightly worse than their cascaded counterparts, which we hope encourages future work in this direction.

Updates

Questions

  • Please post on our github issues, and we will get back to you!

Citation

Please use the following citation:

@inproceedings{stop2022,
  author    = {Paden Tomasello and Akshat Shrivastava and Daniel Lazar and Po-Chun Hsu and Duc Le and Adithya Sagar and Ali Elkahky and Jade Copet and Wei-Ning Hsu and Yossef Mordechay and Robin Algayres and Tu Anh Nguyen and Emmanuel Dupoux and Luke Zettlemoyer and Abdelrahman Mohamed},
  title     = {{STOP: A dataset for Spoken Task Oriented Semantic Parsing}},
  booktitle   = {CoRR},
  eprinttype = {arXiv},
}

LICENSE