Author of the publication

VTQAGen: BART-based Generative Model For Visual Text Question Answering.

, , , , , and . ACM Multimedia, page 9456-9461. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Meta learning based audio tagging., , , , , , and . DCASE, page 193-196. (2018)FINT: Field-Aware Interaction Neural Network for Click-Through Rate Prediction., , , , and . ICASSP, page 3913-3917. IEEE, (2022)Multiple Temporal Fusion based Weakly-supervised Pre-training Techniques for Video Categorization., , , , , and . ACM Multimedia, page 7089-7093. ACM, (2022)Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast., , , , , , and . IJCAI, page 3787-3794. ijcai.org, (2022)Large-Scale Whale Call Classification Using Deep Convolutional Neural Network Architectures., , , and . ICSPCC, page 1-5. IEEE, (2018)Cheap-Fake Detection with LLM Using Prompt Engineering., , , , , and . ICME Workshops, page 105-109. IEEE, (2023)Multimodal Deep Learning for Social Media Popularity Prediction With Attention Mechanism., , , , , and . ACM Multimedia, page 4580-4584. ACM, (2020)Ultrasound-Based Silent Speech Interface using Sequential Convolutional Auto-encoder., , and . ACM Multimedia, page 2194-2195. ACM, (2019)NiCad+: Speeding the Detecting Process of NiCad., , , , , and . SOSE, page 103-110. IEEE, (2020)Adapter-Based Incremental Learning for Face Forgery Detection., , , , , and . ICASSP, page 4690-4694. IEEE, (2024)