Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.

T. Tanaka, R. Masumura, M. Ihori, A. Takashima, T. Moriya, T. Ashihara, S. Orihashi, and N. Makishima. Interspeech, page 4059-4063. ISCA, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Naoki Yoshida

Naoki Ueda

Naoki Fukuzawa

Naoki Tamura

Naoki Saito

Other publications of authors with the same name

Independent Deeply Learned Matrix Analysis for Determined Audio Source Separation.N. Makishima, S. Mogami, N. Takamune, D. Kitamura, H. Sumino, S. Takamichi, H. Saruwatari, and N. Ono. IEEE ACM Trans. Audio Speech Lang. Process., 27 (10): 1601-1615 (2019)OnDA-DETR: Online Domain Adaptation for Detection Transformers with Self-Training Framework.S. Suzuki, T. Yamane, N. Makishima, K. Suzuki, A. Ando, and R. Masumura. ICIP, page 1780-1785. IEEE, (2023)MAPGN: Masked Pointer-Generator Network for Sequence-to-Sequence Pre-Training.M. Ihori, N. Makishima, T. Tanaka, A. Takashima, S. Orihashi, and R. Masumura. ICASSP, page 7563-7567. IEEE, (2021)Independent deeply learned matrix analysis with automatic selection of stable microphone-wise update and fast sourcewise update of demixing matrix.N. Makishima, Y. Mitsui, N. Takamune, D. Kitamura, H. Saruwatari, Y. Takahashi, and K. Kondo. Signal Process., (2021)Enrollment-Less Training for Personalized Voice Activity Detection.N. Makishima, M. Ihori, T. Tanaka, A. Takashima, S. Orihashi, and R. Masumura. Interspeech, page 346-350. ISCA, (2021)Memory Attentive Fusion: External Language Model Integration for Transformer-based Sequence-to-Sequence Model.M. Ihori, R. Masumura, N. Makishima, T. Tanaka, A. Takashima, and S. Orihashi. INLG, page 1-6. Association for Computational Linguistics, (2020)Hierarchical Knowledge Distillation for Dialogue Sequence Labeling.S. Orihashi, Y. Yamazaki, N. Makishima, M. Ihori, A. Takashima, T. Tanaka, and R. Masumura. ASRU, page 433-440. IEEE, (2021)Unified Autoregressive Modeling for Joint End-to-End Multi-Talker Overlapped Speech Recognition and Speaker Attribute Estimation.R. Masumura, D. Okamura, N. Makishima, M. Ihori, A. Takashima, T. Tanaka, and S. Orihashi. Interspeech, page 2591-2595. ISCA, (2021)Text-to-Text Pre-Training with Paraphrasing for Improving Transformer-Based Image Captioning.R. Masumura, N. Makishima, M. Ihori, A. Takashima, T. Tanaka, and S. Orihashi. EUSIPCO, page 516-520. IEEE, (2023)Multi-region CNN-Transformer for Micro-gesture Recognition in Face and Upper Body.K. Suzuki, S. Suzuki, R. Masumura, A. Ando, and N. Makishima. MMAsia, page 89:1-89:5. ACM, (2023)

BibSonomy

Disambiguation of "Makishima, Naoki"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.

Please choose a person to relate this publication to

Naoki Yoshida

Naoki Ueda

Naoki Fukuzawa

Naoki Tamura

Naoki Saito

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Makishima, Naoki"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.

Please choose a person to relate this publication to

Naoki Yoshida

Naoki Ueda

Naoki Fukuzawa

Naoki Tamura

Naoki Saito

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Cross-Modal Transformer-Based Neural Correction Models for Automatic Speech Recognition.