skip to main content
Resultados 1 2 3 4 5 next page
Refinado por: Nome da Publicação: Arxiv remover assunto: Speech Recognition remover Arxiv.Org remover
Result Number Material Type Add to My Shelf Action Record Details and Options
1
Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition
Material Type:
Artigo
Adicionar ao Meu Espaço

Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition

Shinohara, Yusuke ; Watanabe, Shinji

arXiv.org, 2022-11

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

2
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Material Type:
Artigo
Adicionar ao Meu Espaço

Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization

Peng, Puyuan ; Yan, Brian ; Watanabe, Shinji ; Harwath, David

arXiv.org, 2023-08

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

3
SpeechLMScore: Evaluating speech generation using speech language model
Material Type:
Artigo
Adicionar ao Meu Espaço

SpeechLMScore: Evaluating speech generation using speech language model

Maiti, Soumi ; Peng, Yifan ; Saeki, Takaaki ; Watanabe, Shinji

arXiv.org, 2022-12

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

4
ESPnet-ONNX: Bridging a Gap Between Research and Production
Material Type:
Artigo
Adicionar ao Meu Espaço

ESPnet-ONNX: Bridging a Gap Between Research and Production

Someki, Masao ; Higuchi, Yosuke ; Hayashi, Tomoki ; Watanabe, Shinji

arXiv.org, 2022-11

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

5
Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives
Material Type:
Artigo
Adicionar ao Meu Espaço

Quasi-Taylor Samplers for Diffusion Generative Models based on Ideal Derivatives

Tachibana, Hideyuki ; Go, Mocho ; Inahara, Muneyoshi ; Katayama, Yotaro ; Watanabe, Yotaro

arXiv.org, 2022-10

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

6
Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding
Material Type:
Artigo
Adicionar ao Meu Espaço

Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding

Peng, Yifan ; Dalmia, Siddharth ; Lane, Ian ; Watanabe, Shinji

arXiv.org, 2022-07

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

7
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR
Material Type:
Artigo
Adicionar ao Meu Espaço

Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR

Tsunoo, Emiru ; Narisetty, Chaitanya ; Hentschel, Michael ; Kashiwagi, Yosuke ; Watanabe, Shinji

arXiv.org, 2022-01

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

8
Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions
Material Type:
Artigo
Adicionar ao Meu Espaço

Closing the Gap Between Time-Domain Multi-Channel Speech Enhancement on Real and Simulation Conditions

Zhang, Wangyou ; Shi, Jing ; Li, Chenda ; Watanabe, Shinji ; Qian, Yanmin

arXiv.org, 2021-10

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

9
Efficient Sequence Transduction by Jointly Predicting Tokens and Durations
Material Type:
Artigo
Adicionar ao Meu Espaço

Efficient Sequence Transduction by Jointly Predicting Tokens and Durations

Xu, Hainan ; Jia, Fei ; Majumdar, Somshubra ; Huang, He ; Watanabe, Shinji ; Ginsburg, Boris

arXiv.org, 2023-05

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

10
Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining
Material Type:
Artigo
Adicionar ao Meu Espaço

Learning to Speak from Text: Zero-Shot Multilingual Text-to-Speech with Unsupervised Text Pretraining

Saeki, Takaaki ; Maiti, Soumi ; Li, Xinjian ; Watanabe, Shinji ; Takamichi, Shinnosuke ; Saruwatari, Hiroshi

arXiv.org, 2023-05

Ithaca: Cornell University Library, arXiv.org

Texto completo disponível

Resultados 1 2 3 4 5 next page

Personalize Seus Resultados

  1. Editar

Refine Search Results

Expandir Meus Resultados

  1.   

Data de Publicação 

De até
  1. Antes de2017  (1)
  2. 2017Até2017  (2)
  3. 2018Até2018  (5)
  4. 2019Até2020  (9)
  5. Após 2020  (35)
  6. Mais opções open sub menu

Buscando em bases de dados remotas. Favor aguardar.