Author of the publication

Cross-modal Non-linear Guided Attention and Temporal Coherence in Multi-modal Deep Video Models.

, , , and . ACM Multimedia, page 313-321. ACM, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Modeling Feature Representations for Affective Speech Using Generative Adversarial Networks., , and . IEEE Trans. Affect. Comput., 13 (2): 1098-1110 (2022)Exploiting Temporal Coherence for Multi-modal Video Categorization., , , and . CoRR, (2020)Uplink Transmission in MU Multi-Cell Massive MIMO-FBMC Systems over Ricean Fading., , , , and . VTC Fall, page 1-6. IEEE, (2021)Multi-Sensor Fusion Framework using Discriminative Autoencoders., , , , and . EUSIPCO, page 1351-1355. IEEE, (2021)Leveraging Local Temporal Information for Multimodal Scene Classification., and . CoRR, (2021)Multi-Modal Learning for Speech Emotion Recognition: An Analysis and Comparison of ASR Outputs with Ground Truth Transcription., , , and . INTERSPEECH, page 3302-3306. ISCA, (2019)Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training., and . CoRR, (2021)Cross-modal Learning for Multi-modal Video Categorization., , , and . CoRR, (2020)Semi-Supervised and Transfer Learning Approaches for Low Resource Sentiment Classification., , , and . ICASSP, page 5109-5113. IEEE, (2018)Smoothing Model Predictions Using Adversarial Training Procedures for Speech Based Emotion Recognition., , , and . ICASSP, page 4934-4938. IEEE, (2018)