Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset.

I. Palmer, A. Rouditchenko, A. Barbu, B. Katz, and J. Glass. Interspeech, page 3650-3654. ISCA, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Andrew Huvos

Andrew Lee

Andrew Singer

Andrew Torda

Andrew Cato

Other publications of authors with the same name

C2KD: Cross-Lingual Cross-Modal Knowledge Distillation for Multilingual Text-Video Retrieval.A. Rouditchenko, Y. Chuang, N. Shvetsova, S. Thomas, R. Feris, B. Kingsbury, L. Karlinsky, D. Harwath, H. Kuehne, and J. Glass. CoRR, (2022)What, when, and where? - Self-Supervised Spatio-Temporal Grounding in Untrimmed Multi-Action Videos from Narrated Instructions.B. Chen, N. Shvetsova, A. Rouditchenko, D. Kondermann, S. Thomas, S. Chang, R. Feris, J. Glass, and H. Kuehne. CoRR, (2023)Routing with Self-Attention for Multimodal Capsule Networks.K. Duarte, B. Chen, N. Shvetsova, A. Rouditchenko, S. Thomas, A. Liu, D. Harwath, J. Glass, H. Kuehne, and M. Shah. CoRR, (2021)Self-Supervised Segmentation and Source Separation on Videos.A. Rouditchenko, H. Zhao, C. Gan, J. McDermott, and A. Torralba. CVPR Workshops, page 0. Computer Vision Foundation / IEEE, (2019)Self-supervised Audio-visual Co-segmentation.A. Rouditchenko, H. Zhao, C. Gan, J. McDermott, and A. Torralba. ICASSP, page 2357-2361. IEEE, (2019)Label-efficient audio classification through multitask learning and self-supervision.T. Lee, T. Gong, S. Padhy, A. Rouditchenko, and A. Ndirango. CoRR, (2019)Cascaded Multilingual Audio-Visual Learning from Videos.A. Rouditchenko, A. Boggust, D. Harwath, S. Thomas, H. Kuehne, B. Chen, R. Panda, R. Feris, B. Kingsbury, M. Picheny and 1 other author(s). Interspeech, page 3006-3010. ISCA, (2021)Contrastive Audio-Visual Masked Autoencoder.Y. Gong, A. Rouditchenko, A. Liu, D. Harwath, L. Karlinsky, H. Kuehne, and J. Glass. CoRR, (2022)Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos.B. Chen, A. Rouditchenko, K. Duarte, H. Kuehne, S. Thomas, A. Boggust, R. Panda, B. Kingsbury, R. Feris, D. Harwath and 3 other author(s). ICCV, page 7992-8001. IEEE, (2021)Everything at Once - Multi-modal Fusion Transformer for Video Retrieval.N. Shvetsova, B. Chen, A. Rouditchenko, S. Thomas, B. Kingsbury, R. Feris, D. Harwath, J. Glass, and H. Kuehne. CVPR, page 19988-19997. IEEE, (2022)

BibSonomy

Disambiguation of "Rouditchenko, Andrew"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset.

Please choose a person to relate this publication to

Andrew Huvos

Andrew Lee

Andrew Singer

Andrew Torda

Andrew Cato

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Rouditchenko, Andrew"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset.

Please choose a person to relate this publication to

Andrew Huvos

Andrew Lee

Andrew Singer

Andrew Torda

Andrew Cato

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset.