Author of the publication

HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models.

, , , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

VINet: Visual and Inertial-based Terrain Classification and Adaptive Navigation over Unknown Terrain., , , and . CoRR, (2022)GA-Nav: Efficient Terrain Segmentation for Robot Navigation in Unstructured Outdoor Environments., , , , , and . IEEE Robotics Autom. Lett., 7 (3): 8138-8145 (2022)FAR: Fourier Aerial Video Recognition., , , , , and . ECCV (37), volume 13697 of Lecture Notes in Computer Science, page 657-676. Springer, (2022)M3DETR: Multi-representation, Multi-scale, Mutual-relation 3D Object Detection with Transformers., , , , , , and . WACV, page 2293-2303. IEEE, (2022)AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales., , , , , , and . CoRR, (2024)Prompt Learning for Action Recognition., , , and . CoRR, (2023)VINet: Visual and Inertial-based Terrain Classification and Adaptive Navigation over Unknown Terrain., , , and . ICRA, page 4106-4112. IEEE, (2023)CrossLoc3D: Aerial-Ground Cross-Source 3D Place Recognition., , , , , , , and . ICCV, page 11301-11310. IEEE, (2023)TerraPN: Unstructured Terrain Navigation using Online Self-Supervised Learning., , , , and . IROS, page 7197-7204. IEEE, (2022)Large Language Models and Causal Inference in Collaboration: A Comprehensive Survey., , , , , , , , , and 3 other author(s). CoRR, (2024)