Author of the publication

Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding.

, , , , , , , , , , , , and . ACL (1), page 13386-13401. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Blow-Up Phenomena for Porous Medium Equation with Nonlinear Flux on the Boundary., , and . J. Appl. Math., (2013)Wukong-Reader: Multi-modal Pre-training for Fine-grained Visual Document Understanding., , , , , , , , , and 2 other author(s). CoRR, (2022)ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network., , , , , and . CVPR, page 9806-9815. Computer Vision Foundation / IEEE, (2020)Polarimetric radar-based rainfall estimation through adaptive learning with multi-source data from the NOAA meteorological assimilation data ingest system., , , and . IGARSS, page 7290-7292. IEEE, (2022)Reading Scene Text by Fusing Visual Attention with Semantic Representations., , and . ICMR, page 210-218. ACM, (2021)VirtuWander: Enhancing Multi-modal Interaction for Virtual Tour Guidance through Large Language Models., , , , and . CHI, page 612:1-612:20. ACM, (2024)Global Lagrange stability for neutral type neural networks with mixed time-varying delays., and . Int. J. Machine Learning & Cybernetics, 9 (4): 599-609 (2018)Visual and semantic ensemble for scene text recognition with gated dual mutual attention., , and . Int. J. Multim. Inf. Retr., 11 (4): 669-680 (2022)Mining competitive relationships by learning across heterogeneous networks., , , , , , , and . CIKM, page 1432-1441. ACM, (2012)One-shot Key Information Extraction from Document with Deep Partial Graph Matching., , , , and . CoRR, (2021)