Author of the publication

Exploiting Unlabeled Data with Vision and Language Models for Object Detection.

, , , , , , , and . ECCV (9), volume 13669 of Lecture Notes in Computer Science, page 159-175. Springer, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!, , , , , and . CoRR, (2023)Supervised dictionary learning for action localization., and . FG, page 1-8. IEEE Computer Society, (2013)DeepSetNet: Predicting Sets with Deep Neural Networks., , , , , and . ICCV, page 5257-5266. IEEE Computer Society, (2017)Large Scale Multimodal Classification Using an Ensemble of Transformer Models and Co-Attention., and . CoRR, (2020)Unsupervised CNN for Single View Depth Estimation: Geometry to the Rescue., , and . CoRR, (2016)Generating Enhanced Negatives for Training Language-Based Object Detectors., , , , , , and . CoRR, (2024)STRIVE: Scene Text Replacement In Videos., , , , , , and . ICCV, page 14529-14538. IEEE, (2021)Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!, , , , , and . CVPR, page 15005-15015. IEEE, (2023)Learning codebook weights for action detection., and . CVPR Workshops, page 27-32. IEEE Computer Society, (2012)Smart Mining for Deep Metric Learning., , , , and . ICCV, page 2840-2848. IEEE Computer Society, (2017)