Author of the publication

LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following.

, , , , , , and . EMNLP, page 1203-1217. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

TAN: Temporal Aggregation Network for Dense Multi-Label Action Recognition., , , and . WACV, page 151-160. IEEE, (2019)Image is First-order Norm+Linear Autoregressive., , , , , , and . CoRR, (2023)CLIP-TD: CLIP Targeted Distillation for Vision-Language Tasks., , , , , , , , , and . CoRR, (2022)DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search., , , , and . CoRR, (2020)BEVT: BERT Pretraining of Video Transformers., , , , , , , , and . CoRR, (2021)Should All Proposals be Treated Equally in Object Detection?, , , , , , , , , and . CoRR, (2022)OmniTracker: Unifying Object Tracking by Tracking-with-Detection., , , , , , and . CoRR, (2023)Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning., , , , , , , and . CoRR, (2022)MicroNet: Improving Image Recognition with Extremely Low FLOPs., , , , , , , , and . ICCV, page 458-467. IEEE, (2021)CvT: Introducing Convolutions to Vision Transformers., , , , , , and . ICCV, page 22-31. IEEE, (2021)