Author of the publication

Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language

, , , and . https://ai.facebook.com/blog/ai-self-supervised-learning-data2vec/, (2022)cite arxiv:2212.07525.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Disentangling by Partitioning: A Representation Learning Framework for Multimodal Sensory Data., and . CoRR, (2018)EXPRESSO: A Benchmark and Analysis of Discrete Expressive Speech Resynthesis., , , , , , , , , and 3 other author(s). CoRR, (2023)Text-Free Image-to-Speech Synthesis Using Learned Segmental Units., , , and . CoRR, (2020)Differentiable Weighted Finite-State Transducers., , , and . CoRR, (2020)STOP: A dataset for Spoken Task Oriented Semantic Parsing., , , , , , , , , and 5 other author(s). CoRR, (2022)Direct speech-to-speech translation with discrete units., , , , , , , , , and 1 other author(s). CoRR, (2021)Textless Speech-to-Speech Translation on Real Data., , , , , , , , , and . CoRR, (2021)DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning., , , , and . CoRR, (2023)Simple and Effective Unsupervised Speech Translation., , , , , , , and . CoRR, (2022)Hierarchical Generative Modeling for Controllable Speech Synthesis., , , , , , , , , and 2 other author(s). CoRR, (2018)