From post

Bytecover: Cover Song Identification Via Multi-Loss Training

, , , , и . Proceedings of the International Conference on Acoustics, Speech and Signal Processing, стр. 551--555. IEEE, (июня 2021)
DOI: 10.1109/ICASSP39728.2021.9414128

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Improving RNN transducer with normalized jointer network., , , , , , , и . CoRR, (2020)Improving Large-Scale Deep Biasing With Phoneme Features and Text-Only Data in Streaming Transducer., , , , , и . ASRU, стр. 1-8. IEEE, (2023)A Chapter-Wise Understanding System for Text-To-Speech in Chinese Novels., , , , , и . ICASSP, стр. 6069-6073. IEEE, (2021)PPG-Based Singing Voice Conversion with Adversarial Representation Learning., , , , , , и . ICASSP, стр. 7073-7077. IEEE, (2021)Unsupervised training of subspace gaussian mixture models for conversational telephone speech recognition., , и . ICASSP, стр. 4829-4832. IEEE, (2012)A Unified Sequence-to-Sequence Front-End Model for Mandarin Text-to-Speech Synthesis., , , , , , и . ICASSP, стр. 6689-6693. IEEE, (2020)Connecting Speech Encoder and Large Language Model for ASR., , , , , , , , и . CoRR, (2023)SALMONN: Towards Generic Hearing Abilities for Large Language Models., , , , , , , , и . CoRR, (2023)BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance., , , , , , , , и . CoRR, (2022)Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation., , , , , , , , , и . CoRR, (2023)