Author of the publication

Transformer Decoders with MultiModal Regularization for Cross-Modal Food Retrieval.

, , , and . CVPR Workshops, page 4566-4577. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion., , , and . CoRR, (2021)FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models., , , , , and . CoRR, (2024)DiffEdit: Diffusion-based semantic image editing with mask guidance., , , and . ICLR, OpenReview.net, (2023)DiffEdit: Diffusion-based semantic image editing with mask guidance., , , and . CoRR, (2022)DyTox: Transformers for Continual Learning with DYnamic TOken eXpansion., , , and . CVPR, page 9275-9285. IEEE, (2022)Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment., , and . BMVC, page 353. BMVA Press, (2022)Zero-shot spatial layout conditioning for text-to-image diffusion models., , , , and . ICCV, page 2174-2183. IEEE, (2023)Gradpaint: Gradient-Guided Inpainting with Diffusion Models., , and . CoRR, (2023)Embedding Arithmetic for Text-driven Image Transformation., , , and . CoRR, (2021)Sub-meter resolution canopy height maps using self-supervised learning and a vision transformer trained on Aerial and GEDI Lidar., , , , , , , , , and 6 other author(s). CoRR, (2023)