Author of the publication

Textless Direct Speech-to-Speech Translation with Discrete Speech Representation.

, , and . ICASSP, page 1-5. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization., , , , , , , , , and 1 other author(s). CoRR, (2020)Gesture generation with low-dimensional embeddings., and . AAMAS, page 781-788. IFAAMAS/ACM, (2014)Recognizing Long-Form Speech Using Streaming End-to-End Models., , , , , and . ASRU, page 920-927. IEEE, (2019)Minimum Word Error Rate Training for Attention-Based Sequence-to-Sequence Models., , , , , , and . ICASSP, page 4839-4843. IEEE, (2018)Learning online alignments with continuous rewards policy gradient., , , and . ICASSP, page 2801-2805. IEEE, (2017)An efficient scan algorithm for block-based connected component labeling., and . MED, page 1008-1013. IEEE, (2014)Speech Recognition for Medical Conversations., , , , , , , , , and 4 other author(s). INTERSPEECH, page 2972-2976. ISCA, (2018)SLM: Bridge the Thin Gap Between Speech and Text Foundation Models., , , , , , , , , and 6 other author(s). ASRU, page 1-8. IEEE, (2023)Block-Based Connected-Component Labeling Algorithm Using Binary Decision Trees., , and . Sensors, 15 (9): 23763-23787 (2015)Bridging the Gap Between Streaming and Non-Streaming ASR Systems by Distilling Ensembles of CTC and RNN-T Models., , , , , and . Interspeech, page 1807-1811. ISCA, (2021)