Author of the publication

MART: Learning Hierarchical Music Audio Representations with Part-Whole Transformer.

, , , , , , , , and . WWW (Companion Volume), page 967-970. ACM, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

HiGene: A high-performance platform for genomic data analysis., , , , and . BIBM, page 576-583. IEEE Computer Society, (2016)Recognizing Dance Motions with Segmental SVD., , , and . ICPR, page 1537-1540. IEEE Computer Society, (2010)CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction., , , , , , and . ISCSLP, page 81-85. IEEE, (2022)Generalized Model-Based Human Motion Recognition with Body Partition Index Maps., , , and . Comput. Graph. Forum, 31 (1): 202-215 (2012)CCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation detection and diagnosis., , , , , , , , and . CoRR, (2021)Unified Mandarin TTS Front-end Based on Distilled BERT Model., , and . CoRR, (2020)USED: Universal Speaker Extraction and Diarization., , , , , , , , and . CoRR, (2023)EditSinger: Zero-Shot Text-Based Singing Voice Editing System with Diverse Prosody Modeling., , , and . IJCAI, page 4503-4509. ijcai.org, (2022)DisCover: Disentangled Music Representation Learning for Cover Song Identification., , , , , , , , , and . SIGIR, page 453-463. ACM, (2023)CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis., , , , , , , , , and . INTERSPEECH, page 4352-4356. ISCA, (2022)