Author of the publication

Violin: A Large-Scale Dataset for Video-and-Language Inference.

, , , , , , and . CVPR, page 10897-10907. Computer Vision Foundation / IEEE, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A quantitative quality control method of big data in cancer patients using artificial neural network., , , , , , and . CCIS, page 499-504. IEEE, (2014)MCMG simulator: A unified simulation framework for CPU and graphic GPU., , , and . J. Comput. Syst. Sci., 81 (1): 57-71 (2015)AVID: Any-Length Video Inpainting with Diffusion Model., , , , , , , , and . CoRR, (2023)Text-to-Sticker: Style Tailoring Latent Diffusion Models for Human Expression., , , , , , , , , and 7 other author(s). CoRR, (2023)BachGAN: High-Resolution Image Synthesis From Salient Object Layout., , , , , and . CVPR, page 8362-8371. Computer Vision Foundation / IEEE, (2020)Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation., , , , , , and . CVPR, page 10681-10692. IEEE, (2023)CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval., , , , , , and . KDD, page 4433-4442. ACM, (2022)Question Answering, Grounding, and Generation for Vision and Language.. University of North Carolina, Chapel Hill, USA, (2019)base-search.net (ftcarolinadr:cdr.lib.unc.edu:7h149v557).CiT: Curation in Training for Effective Vision-Language Data., , , , , , , and . ICCV, page 15134-15143. IEEE, (2023)Improving branch divergence performance on GPGPU with a new PDOM stack and multi-level warp scheduling., , , and . J. Syst. Archit., 60 (5): 420-430 (2014)