Author of the publication

Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation.

, , , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Bayesian Speech and Language Processing, and . Cambridge University Press, (2015)High-accuracy user identification using EEG biometrics., , , , , , and . EMBC, page 854-858. IEEE, (2016)Structural Bayesian Linear Regression for Hidden Markov Models., , and . J. Signal Process. Syst., 74 (3): 341-358 (2014)Speech Recognition Based on Student's t-Distribution Derived from Total Bayesian Framework., and . IEICE Trans. Inf. Syst., 89-D (3): 970-980 (2006)Language independent end-to-end architecture for joint language identification and speech recognition., , and . ASRU, page 265-271. IEEE, (2017)Effectiveness of discriminative training and feature transformation for reverberated and noisy speech., , and . ICASSP, page 6935-6939. IEEE, (2013)Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks., , , and . ICASSP, page 708-712. IEEE, (2015)Bag Of ARCS: New representation of speech segment features based on finite state machines., , , , and . ICASSP, page 4201-4204. IEEE, (2012)End-to-end Speech Recognition With Word-Based Rnn Language Models., , and . SLT, page 389-396. IEEE, (2018)Application of topic tracking model to language model adaptation and meeting analysis., , , , and . SLT, page 378-383. IEEE, (2010)