Author of the publication

Large-Scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation.

, , , , , and . ICASSP, page 1-5. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An 112-Ch Neural Signal Acquisition SoC With Full-Channel Read-Out and Processing Accelerators., , , , , , , and . IEEE Trans. Very Large Scale Integr. Syst., 32 (8): 1461-1471 (August 2024)Adaptive Accompaniment with ReaLchords., , , , , , , , , and 3 other author(s). ICML, OpenReview.net, (2024)The Chamber Ensemble Generator: Limitless High-Quality MIR Data via Generative Modeling., , , , , and . CoRR, (2022)Learning Singing From Speech., , , , , , , and . CoRR, (2019)Large-Scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation., , , , , and . ICASSP, page 1-5. IEEE, (2023)SUNMASK: Mask Enhanced Control in Step Unrolled Denoising Autoencoders., , , and . EvoMUSART@EvoStar, volume 13988 of Lecture Notes in Computer Science, page 148-163. Springer, (2023)3M-AI: A Multi-task and Multi-core Virtualization Framework for Multi-FPGA AI Systems in the Cloud., , , , , , , , , and 1 other author(s). FPGA, page 228. ACM, (2021)A Frequency-Division Transceiver for Long-Range Neural Signal Recording From Multiple Subjects., , , , , , , , and . IEEE J. Solid State Circuits, 59 (3): 923-934 (March 2024)Audio Captioning Based on Transformer and Pre-Trained CNN., , , , , , and . DCASE, page 21-25. (2020)Peking Opera Synthesis via Duration Informed Attention Network., , , , , , and . INTERSPEECH, page 1226-1230. ISCA, (2020)