Author of the publication

CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation.

, , , , , , and . IEEE Trans. Multim., (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst., , , , , , , and . CoRR, (2023)CSDNet: Contrastive Similarity Distillation Network for Multi-lingual Image-Text Retrieval., , , , , and . ICIG (3), volume 14357 of Lecture Notes in Computer Science, page 385-395. Springer, (2023)Keypoint Context Aggregation for Human Pose Estimation., , , and . ICIG (2), volume 12889 of Lecture Notes in Computer Science, page 386-396. Springer, (2021)Normalized and Geometry-Aware Self-Attention Network for Image Captioning., , , , , and . CVPR, page 10324-10333. Computer Vision Foundation / IEEE, (2020)Knowledge Condensation and Reasoning for Knowledge-based VQA., , , , , , , , , and 1 other author(s). CoRR, (2024)Modeling Local and Global Contexts for Image Captioning., , , and . ICME, page 1-6. IEEE, (2020)EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE., , , , , , and . AAAI, page 1110-1119. AAAI Press, (2024)MAMO: Fine-Grained Vision-Language Representations Learning with Masked Multimodal Modeling., , , , , and . SIGIR, page 1528-1538. ACM, (2023)CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation., , , , , , and . IEEE Trans. Multim., (2024)AutoCaption: Image Captioning with Neural Architecture Search., , , and . CoRR, (2020)