From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ICECAP: Information Concentrated Entity-aware Image Captioning.

A. Hu, S. Chen, и Q. Jin. ACM Multimedia, стр. 4217-4225. ACM, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Doris Anwender

Jun Hu

Chunchun Hu

Yuhui Hu

Zheng Hu

Другие публикации лиц с тем же именем

Leveraging Multi-Token Entities in Document-Level Named Entity Recognition.A. Hu, Z. Dou, J. Nie, и J. Wen. AAAI, стр. 7961-7968. AAAI Press, (2020)Accommodating Audio Modality in CLIP for Multimodal Processing.L. Ruan, A. Hu, Y. Song, L. Zhang, S. Zheng, и Q. Jin. AAAI, стр. 9641-9649. AAAI Press, (2023)ICECAP: Information Concentrated Entity-aware Image Captioning.A. Hu, S. Chen, и Q. Jin. ACM Multimedia, стр. 4217-4225. ACM, (2020)MPMQA: Multimodal Question Answering on Product Manuals.L. Zhang, A. Hu, J. Zhang, S. Hu, и Q. Jin. AAAI, стр. 13958-13966. AAAI Press, (2023)mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model.A. Hu, Y. Shi, H. Xu, J. Ye, Q. Ye, M. Yan, C. Li, Q. Qian, J. Zhang, и F. Huang. CoRR, (2023)UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model.J. Ye, A. Hu, H. Xu, Q. Ye, M. Yan, G. Xu, C. Li, J. Tian, Q. Qian, J. Zhang и 4 other автор(ы). EMNLP (Findings), стр. 2841-2858. Association for Computational Linguistics, (2023)Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval.Y. Shi, H. Liu, H. Xu, Z. Ma, Q. Ye, A. Hu, M. Yan, J. Zhang, F. Huang, C. Yuan и 3 other автор(ы). ACM Multimedia, стр. 4460-4470. ACM, (2023)WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training.Y. Huo, M. Zhang, G. Liu, H. Lu, Y. Gao, G. Yang, J. Wen, H. Zhang, B. Xu, W. Zheng и 25 other автор(ы). CoRR, (2021)InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation.A. Hu, S. Chen, L. Zhang, и Q. Jin. ACL (1), стр. 3171-3185. Association for Computational Linguistics, (2023)mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding.A. Hu, H. Xu, J. Ye, M. Yan, L. Zhang, B. Zhang, C. Li, J. Zhang, Q. Jin, F. Huang и 1 other автор(ы). CoRR, (2024)

Что такое BibSonomy?: С чего начать; Кнопки для браузера; Помощь
Разработчикам: Обзор; API-документация

Контакт и защита личных данных: о нас; Cookies; Сообщить о проблеме; BibSonomy Вики

Интеграция: PUMA; Расширение для TYPO3; Плагин для; Клиент Java REST; Поддерживаемые источники; далее

О BibSonomy: Команда; Блог; Список рассылки
Социальные сети: Наш Twitter