Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

PicoAudio: Enabling Precise Timestamp and Frequency Controllability of Audio Events in Text-to-audio Generation., , , and . CoRR, (2024)Enhancing Audio Generation Diversity with Visual Information., , , , and . ICASSP, page 866-870. IEEE, (2024)Investigating Passive Filter Pruning for Efficient CNN-Transformer Audio Captioning., , , , and . MLSP, page 1-6. IEEE, (2024)Navigating Audio-Visual Event Detection Across Mismatched Modalities., , , and . ICASSP, page 1975-1979. IEEE, (2022)Towards Weakly Supervised Text-to-Audio Grounding., , , and . IEEE Trans. Multim., (2024)PicoAudio: Enabling Precise Temporal Controllability in Text-to-Audio Generation., , , and . ICASSP, page 1-5. IEEE, (2025)DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning., , , , , , , and . ICASSP, page 1-5. IEEE, (2025)Diversity-Controllable and Accurate Audio Captioning Based on Neural Condition., , and . ICASSP, page 971-975. IEEE, (2022)Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning., , , , and . ICASSP, page 905-909. IEEE, (2021)A Lightweight Framework for Online Voice Activity Detection in the Wild., , , and . Interspeech, page 371-375. ISCA, (2021)