Author of the publication

A Survivor in the Era of Large-Scale Pretraining: An Empirical Study of One-Stage Referring Expression Comprehension.

, , , , and . IEEE Trans. Multim., (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Real-time Global Inference Network for One-stage Referring Expression Comprehension., , , , , , , and . CoRR, (2019)HSM-QA: Question Answering System Based on Hierarchical Semantic Matching., , , , and . IEEE Access, (2023)Knowledge-Driven Generative Adversarial Network for Text-to-Image Synthesis., , , , , , and . IEEE Trans. Multim., (2022)Towards Efficient Visual Adaption via Structural Re-parameterization., , , , , , and . CoRR, (2023)Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models., , , , , and . CoRR, (2024)Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting., , , , , , and . CoRR, (2023)Towards Language-Guided Visual Recognition via Dynamic Convolutions., , , , , and . Int. J. Comput. Vis., 132 (1): 1-19 (January 2024)Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network., , , , and . AAAI, page 2528-2536. AAAI Press, (2023)Dynamic Capsule Attention for Visual Question Answering., , , , and . AAAI, page 9324-9331. AAAI Press, (2019)DIFNet: Boosting Visual Information Flow for Image Captioning., , , , , , , and . CVPR, page 17999-18008. IEEE, (2022)