Author of the publication

MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering.

, , , and . EMNLP (Findings), volume EMNLP 2020 of Findings of ACL, page 4648-4660. Association for Computational Linguistics, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning a Multi-concept Video Retrieval Model with Multiple Latent Variables., , and . ISM, page 615-620. IEEE Computer Society, (2016)Context-Aware Analysis of Group Submissions for Group Anomaly Detection and Performance Prediction., and . AAAI, page 15938-15946. AAAI Press, (2023)Visual Text Correction., and . ECCV (13), volume 11217 of Lecture Notes in Computer Science, page 159-175. Springer, (2018)Video Generation from Text Employing Latent Path Construction for Temporal Modeling., and . CoRR, (2021)WoundNet: A Domain-Adaptable Few-Shot Classification Framework for Wound Healing Assessment., , , , , , , , , and 1 other author(s). ISBI, page 1-5. IEEE, (2023)UCF-CRCV at TRECVID 2015: Semantic Indexing., , , and . TRECVID, National Institute of Standards and Technology (NIST), (2015)MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering., , , and . EMNLP (Findings), volume EMNLP 2020 of Findings of ACL, page 4648-4660. Association for Computational Linguistics, (2020)Video Generation from Text Employing Latent Path Construction for Temporal Modeling., and . ICPR, page 5010-5016. IEEE, (2022)UCF-CRCV at TRECVID 2014: Semantic Indexing., , , , , , , , and . TRECVID, National Institute of Standards and Technology (NIST), (2014)Deep Photo Cropper And Enhancer., , , and . ICIP, page 993-997. IEEE, (2020)