Author of the publication

TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model.

, , , , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

PororoGAN: An Improved Story Visualization Model on Pororo-SV Dataset., , and . CSAI, page 155-159. ACM, (2019)Feature Enhancement with Text-Specific Region Contrast for Scene Text Detection., , , , , , , and . PRCV (7), volume 14431 of Lecture Notes in Computer Science, page 3-14. Springer, (2023)TextBlock: Towards Scene Text Spotting without Fine-grained Detection., , , , , , , , and . ACM Multimedia, page 5892-5902. ACM, (2022)A Cost-Efficient Framework for Scene Text Detection in the Wild., , , and . PRICAI (1), volume 13031 of Lecture Notes in Computer Science, page 139-153. Springer, (2021)Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering., , , , , , and . ICME, page 1-6. IEEE, (2022)TextBlockV2: Towards Precise-Detection-Free Scene Text Spotting with Pre-trained Language Model., , , , , , and . CoRR, (2024)Beyond OCR + VQA: Towards end-to-end reading and reasoning for robust and accurate textvqa., , , , , , , and . Pattern Recognit., (June 2023)Filling in the Blank: Rationale-Augmented Prompt Tuning for TextVQA., , , , , , and . ACM Multimedia, page 1261-1272. ACM, (2023)Beyond OCR + VQA: Involving OCR into the Flow for Robust and Accurate TextVQA., , , and . ACM Multimedia, page 376-385. ACM, (2021)