Author of the publication

ProVLA: Compositional Image Search with Progressive Vision-Language Alignment and Multimodal Fusion.

, , , , and . ICCV (Workshops), page 2764-2769. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Reinforcement Learning Framework to Identify Cause of Diseases - Predicting Asthma Attack Case., , and . IEEE BigData, page 4829-4838. IEEE, (2019)A Framework of Input Devices to Support Designing Composite Wearable Computers., , , , , and . HCI (2), volume 12182 of Lecture Notes in Computer Science, page 401-427. Springer, (2020)VidLA: Video-Language Alignment at Scale., , , , , , , and . CoRR, (2024)X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs., , , , , , , and . CoRR, (2024)Multi-modal Alignment using Representation Codebook., , , , , , and . CVPR, page 15630-15639. IEEE, (2022)A Logic-based Explanation Generation Framework for Classical and Hybrid Planning Problems (Extended Abstract)., , , , , and . IJCAI, page 6985-6989. ijcai.org, (2023)VLUE: A New Benchmark and Multi-task Knowledge Transfer Learning for Vietnamese Natural Language Understanding., , , , and . NAACL-HLT (Findings), page 211-222. Association for Computational Linguistics, (2024)ProVLA: Compositional Image Search with Progressive Vision-Language Alignment and Multimodal Fusion., , , , and . ICCV (Workshops), page 2764-2769. IEEE, (2023)Bringing Multimodality to Amazon Visual Search System., , , , , , , , , and 3 other author(s). KDD, page 6390-6399. ACM, (2024)A new approach using the Viterbi algorithm in stereo correspondence problem., and . SMC (3), page 3016-3021. IEEE, (2004)