Result Number | Material Type | Add to My Shelf Action | Record Details and Options |
---|---|---|---|
1 |
Material Type: Artigo
|
![]() |
Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech RecognitionShinohara, Yusuke ; Watanabe, ShinjiarXiv.org, 2022-11Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
2 |
Material Type: Artigo
|
![]() |
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task GeneralizationPeng, Puyuan ; Yan, Brian ; Watanabe, Shinji ; Harwath, DavidarXiv.org, 2023-08Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
3 |
Material Type: Artigo
|
![]() |
SpeechLMScore: Evaluating speech generation using speech language modelMaiti, Soumi ; Peng, Yifan ; Saeki, Takaaki ; Watanabe, ShinjiarXiv.org, 2022-12Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
4 |
Material Type: Artigo
|
![]() |
ESPnet-ONNX: Bridging a Gap Between Research and ProductionSomeki, Masao ; Higuchi, Yosuke ; Hayashi, Tomoki ; Watanabe, ShinjiarXiv.org, 2022-11Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
5 |
Material Type: Artigo
|
![]() |
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal DerivativesTachibana, Hideyuki ; Go, Mocho ; Inahara, Muneyoshi ; Katayama, Yotaro ; Watanabe, YotaroarXiv.org, 2022-10Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
6 |
Material Type: Artigo
|
![]() |
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and UnderstandingPeng, Yifan ; Dalmia, Siddharth ; Lane, Ian ; Watanabe, ShinjiarXiv.org, 2022-07Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
7 |
Material Type: Artigo
|
![]() |
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASRTsunoo, Emiru ; Narisetty, Chaitanya ; Hentschel, Michael ; Kashiwagi, Yosuke ; Watanabe, ShinjiarXiv.org, 2022-01Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
8 |
Material Type: Artigo
|
![]() |
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation ConditionsZhang, Wangyou ; Shi, Jing ; Li, Chenda ; Watanabe, Shinji ; Qian, YanminarXiv.org, 2021-10Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
9 |
Material Type: Artigo
|
![]() |
Efficient Sequence Transduction by Jointly Predicting Tokens and DurationsXu, Hainan ; Jia, Fei ; Majumdar, Somshubra ; Huang, He ; Watanabe, Shinji ; Ginsburg, BorisarXiv.org, 2023-05Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |
10 |
Material Type: Artigo
|
![]() |
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text PretrainingSaeki, Takaaki ; Maiti, Soumi ; Li, Xinjian ; Watanabe, Shinji ; Takamichi, Shinnosuke ; Saruwatari, HiroshiarXiv.org, 2023-05Ithaca: Cornell University Library, arXiv.orgTexto completo disponível |