Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Visual Grounding in Video for Unsupervised Word Translation.

G. Sigurdsson, J. Alayrac, A. Nematzadeh, L. Smaira, M. Malinowski, J. Carreira, P. Blunsom, and A. Zisserman. CVPR, page 10847-10856. Computer Vision Foundation / IEEE, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Jean-Baptiste Alayrac

Jean-Baptiste Felten

Jean-Baptiste Bordes

Jean-Baptiste Sibarita

Jean-Baptiste Filippi

Other publications of authors with the same name

Multimodal Self-Supervised Learning of General Audio Representations.L. Wang, P. Luc, A. Recasens, J. Alayrac, and A. van den Oord. CoRR, (2021)Controllable Attention for Structured Layered Video Decomposition.J. Alayrac, J. Carreira, R. Arandjelovic, and A. Zisserman. ICCV, page 5733-5742. IEEE, (2019)Three ways to improve feature alignment for open vocabulary detection.R. Arandjelovic, A. Andonian, A. Mensch, O. Hénaff, J. Alayrac, and A. Zisserman. CoRR, (2023)Zorro: the masked multimodal transformer.A. Recasens, J. Lin, J. Carreira, A. Jaegle, L. Wang, J. Alayrac, P. Luc, A. Miech, L. Smaira, R. Hemsley and 1 other author(s). CoRR, (2023)Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers.L. Hendricks, J. Mellor, R. Schneider, J. Alayrac, and A. Nematzadeh. CoRR, (2021)End-to-End Learning of Visual Representations from Uncurated Instructional Videos.A. Miech, J. Alayrac, L. Smaira, I. Laptev, J. Sivic, and A. Zisserman. CoRR, (2019)Perceiver IO: A General Architecture for Structured Inputs & Outputs.A. Jaegle, S. Borgeaud, J. Alayrac, C. Doersch, C. Ionescu, D. Ding, S. Koppula, D. Zoran, A. Brock, E. Shelhamer and 5 other author(s). CoRR, (2021)Gemini: A Family of Highly Capable Multimodal Models.R. Anil, S. Borgeaud, Y. Wu, J. Alayrac, J. Yu, R. Soricut, J. Schalkwyk, A. Dai, A. Hauth, K. Millican and 42 other author(s). CoRR, (2023)End-to-End Learning of Visual Representations From Uncurated Instructional Videos.A. Miech, J. Alayrac, L. Smaira, I. Laptev, J. Sivic, and A. Zisserman. CVPR, page 9876-9886. Computer Vision Foundation / IEEE, (2020)Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers.A. Miech, J. Alayrac, I. Laptev, J. Sivic, and A. Zisserman. CVPR, page 9826-9836. Computer Vision Foundation / IEEE, (2021)

BibSonomy

Disambiguation of "Alayrac, Jean-Baptiste"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Visual Grounding in Video for Unsupervised Word Translation.

Please choose a person to relate this publication to

Jean-Baptiste Alayrac

Jean-Baptiste Felten

Jean-Baptiste Bordes

Jean-Baptiste Sibarita

Jean-Baptiste Filippi

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Alayrac, Jean-Baptiste"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Visual Grounding in Video for Unsupervised Word Translation.

Please choose a person to relate this publication to

Jean-Baptiste Alayrac

Jean-Baptiste Felten

Jean-Baptiste Bordes

Jean-Baptiste Sibarita

Jean-Baptiste Filippi

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Visual Grounding in Video for Unsupervised Word Translation.