Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding.

D. Chen, R. Hu, X. Chen, M. Nießner, and A. Chang. ICCV, page 18063-18073. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Jun Hu

Zaijun Hu

Hu Wang

Bing Hu

Ruguo Hu

Other publications of authors with the same name

FLAVA: A Foundational Language And Vision Alignment Model.A. Singh, R. Hu, V. Goswami, G. Couairon, W. Galuba, M. Rohrbach, and D. Kiela. CoRR, (2021)Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image.R. Hu, and D. Pathak. CoRR, (2020)Scaling Language-Image Pre-Training via Masking.Y. Li, H. Fan, R. Hu, C. Feichtenhofer, and K. He. CVPR, page 23390-23400. IEEE, (2023)FLAVA: A Foundational Language And Vision Alignment Model.A. Singh, R. Hu, V. Goswami, G. Couairon, W. Galuba, M. Rohrbach, and D. Kiela. CVPR, page 15617-15629. IEEE, (2022)Modeling Relationships in Referential Expressions with Compositional Modular Networks.R. Hu, M. Rohrbach, J. Andreas, T. Darrell, and K. Saenko. CVPR, page 4418-4427. IEEE Computer Society, (2017)Natural Language Object Retrieval.R. Hu, H. Xu, M. Rohrbach, J. Feng, K. Saenko, and T. Darrell. CoRR, (2015)TextCaps: A Dataset for Image Captioning with Reading Comprehension.O. Sidorov, R. Hu, M. Rohrbach, and A. Singh. ECCV (2), volume 12347 of Lecture Notes in Computer Science, page 742-758. Springer, (2020)UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding.D. Chen, R. Hu, X. Chen, M. Nießner, and A. Chang. ICCV, page 18063-18073. IEEE, (2023)Worldsheet: Wrapping the World in a 3D Sheet for View Synthesis from a Single Image.R. Hu, N. Ravi, A. Berg, and D. Pathak. ICCV, page 12508-12517. IEEE, (2021)Are You Looking? Grounding to Multiple Modalities in Vision-and-Language Navigation.R. Hu, D. Fried, A. Rohrbach, D. Klein, T. Darrell, and K. Saenko. ACL (1), page 6551-6557. Association for Computational Linguistics, (2019)

BibSonomy

Disambiguation of "Hu, Ronghang"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding.

Please choose a person to relate this publication to

Jun Hu

Zaijun Hu

Hu Wang

Bing Hu

Ruguo Hu

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Hu, Ronghang"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding.

Please choose a person to relate this publication to

Jun Hu

Zaijun Hu

Hu Wang

Bing Hu

Ruguo Hu

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding.