Author of the publication

UNITER: UNiversal Image-TExt Representation Learning.

, , , , , , , and . ECCV (30), volume 12375 of Lecture Notes in Computer Science, page 104-120. Springer, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A quantitative quality control method of big data in cancer patients using artificial neural network., , , , , , and . CCIS, page 499-504. IEEE, (2014)MCMG simulator: A unified simulation framework for CPU and graphic GPU., , , and . J. Comput. Syst. Sci., 81 (1): 57-71 (2015)Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression., , , , , , , , , and 7 other author(s). CoRR, (2023)AVID: Any-Length Video Inpainting with Diffusion Model., , , , , , , , and . CoRR, (2023)CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval., , , , , , and . KDD, page 4433-4442. ACM, (2022)Improving branch divergence performance on GPGPU with a new PDOM stack and multi-level warp scheduling., , , and . J. Syst. Archit., 60 (5): 420-430 (2014)Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation., , , , , , and . CVPR, page 10681-10692. IEEE, (2023)BachGAN: High-Resolution Image Synthesis From Salient Object Layout., , , , , and . CVPR, page 8362-8371. Computer Vision Foundation / IEEE, (2020)VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation., , , , , , , , , and 5 other author(s). NeurIPS Datasets and Benchmarks, (2021)Question Answering, Grounding, and Generation for Vision and Language.. University of North Carolina, Chapel Hill, USA, (2019)base-search.net (ftcarolinadr:cdr.lib.unc.edu:7h149v557).