From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Leveraging Multi-Token Entities in Document-Level Named Entity Recognition., , , и . AAAI, стр. 7961-7968. AAAI Press, (2020)Accommodating Audio Modality in CLIP for Multimodal Processing., , , , , и . AAAI, стр. 9641-9649. AAAI Press, (2023)ICECAP: Information Concentrated Entity-aware Image Captioning., , и . ACM Multimedia, стр. 4217-4225. ACM, (2020)MPMQA: Multimodal Question Answering on Product Manuals., , , , и . AAAI, стр. 13958-13966. AAAI Press, (2023)mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model., , , , , , , , , и . CoRR, (2023)UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model., , , , , , , , , и 4 other автор(ы). EMNLP (Findings), стр. 2841-2858. Association for Computational Linguistics, (2023)Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval., , , , , , , , , и 3 other автор(ы). ACM Multimedia, стр. 4460-4470. ACM, (2023)WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training., , , , , , , , , и 25 other автор(ы). CoRR, (2021)InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation., , , и . ACL (1), стр. 3171-3185. Association for Computational Linguistics, (2023)mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding., , , , , , , , , и 1 other автор(ы). CoRR, (2024)