Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Audio-Video Modalities from Image Captions.

A. Nagrani, P. Seo, B. Seybold, A. Hauth, S. Manen, C. Sun, and C. Schmid. ECCV (14), volume 13674 of Lecture Notes in Computer Science, page 407-426. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Eunyoung Seo

Ean-Jeong Seo

Paek Pyung Seon

Bong-Seock Seo

Ki-Chang Seong

Other publications of authors with the same name

Look Before You Speak: Visually Contextualized Utterances.P. Seo, A. Nagrani, and C. Schmid. CVPR, page 16877-16887. Computer Vision Foundation / IEEE, (2021)Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning.A. Yang, A. Nagrani, P. Seo, A. Miech, J. Pont-Tuset, I. Laptev, J. Sivic, and C. Schmid. CVPR, page 10714-10726. IEEE, (2023)Regularizing Neural Networks via Stochastic Branch Layers.W. Park, P. Seo, B. Han, and M. Cho. ACML, volume 101 of Proceedings of Machine Learning Research, page 678-693. PMLR, (2019)Reinforcing an Image Caption Generator Using Off-Line Human Feedback.P. Seo, P. Sharma, T. Levinboim, B. Han, and R. Soricut. CoRR, (2019)Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction.H. Noh, P. Seo, and B. Han. CoRR, (2015)MarioQA: Answering Questions by Watching Gameplay Videos.J. Mun, P. Seo, I. Jung, and B. Han. CoRR, (2016)Learning Correlation Structures for Vision Transformers.M. Kim, P. Seo, C. Schmid, and M. Cho. CoRR, (2024)AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR.P. Seo, A. Nagrani, and C. Schmid. CVPR, page 22922-22931. IEEE, (2023)Learning for Single-Shot Confidence Calibration in Deep Neural Networks Through Stochastic Inferences.S. Seo, P. Seo, and B. Han. CVPR, page 9030-9038. Computer Vision Foundation / IEEE, (2019)Reinforcing an Image Caption Generator Using Off-Line Human Feedback.P. Seo, P. Sharma, T. Levinboim, B. Han, and R. Soricut. AAAI, page 2693-2700. AAAI Press, (2020)

BibSonomy

Disambiguation of "Seo, Paul Hongsuck"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Audio-Video Modalities from Image Captions.

Please choose a person to relate this publication to

Eunyoung Seo

Ean-Jeong Seo

Paek Pyung Seon

Bong-Seock Seo

Ki-Chang Seong

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Seo, Paul Hongsuck"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Learning Audio-Video Modalities from Image Captions.

Please choose a person to relate this publication to

Eunyoung Seo

Ean-Jeong Seo

Paek Pyung Seon

Bong-Seock Seo

Ki-Chang Seong

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Audio-Video Modalities from Image Captions.