Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages.

J. Ye, S. Li, G. Li, C. Huang, S. Gao, Y. Wu, Q. Zhang, T. Gui, and X. Huang. ACL (1), page 2181-2211. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Josef Huang

Pa Huang

Haishi Huang

Feiqing Huang

Zhida Huang

Other publications of authors with the same name

Beyond Boundaries: Learning a Universal Entity Taxonomy across Datasets and Languages for Open Named Entity Recognition.Y. Yang, W. Zhao, C. Huang, J. Ye, X. Wang, H. Zheng, Y. Nan, Y. Wang, X. Xu, K. Huang and 4 other author(s). CoRR, (2024)EasyJailbreak: A Unified Framework for Jailbreaking Large Language Models.W. Zhou, X. Wang, L. Xiong, H. Xia, Y. Gu, M. Chai, F. Zhu, C. Huang, S. Dou, Z. Xi and 11 other author(s). CoRR, (2024)CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models.H. Lv, X. Wang, Y. Zhang, C. Huang, S. Dou, J. Ye, T. Gui, Q. Zhang, and X. Huang. CoRR, (2024)StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback.S. Dou, Y. Liu, H. Jia, L. Xiong, E. Zhou, W. Shen, J. Shan, C. Huang, X. Wang, X. Fan and 7 other author(s). CoRR, (2024)Secrets of RLHF in Large Language Models Part II: Reward Modeling.B. Wang, R. Zheng, L. Chen, Y. Liu, S. Dou, C. Huang, W. Shen, S. Jin, E. Zhou, C. Shi and 17 other author(s). CoRR, (2024)What's Wrong with Your Code Generated by Large Language Models? An Extensive Study.S. Dou, H. Jia, S. Wu, H. Zheng, W. Zhou, M. Wu, M. Chai, J. Fan, C. Huang, Y. Tao and 14 other author(s). CoRR, (2024)StepCoder: Improving Code Generation with Reinforcement Learning from Compiler Feedback.S. Dou, Y. Liu, H. Jia, E. Zhou, L. Xiong, J. Shan, C. Huang, X. Wang, X. Fan, Z. Xi and 6 other author(s). ACL (1), page 4571-4585. Association for Computational Linguistics, (2024)SafeAligner: Safety Alignment against Jailbreak Attacks via Response Disparity Guidance.C. Huang, W. Zhao, R. Zheng, H. Lv, S. Dou, S. Li, X. Wang, E. Zhou, J. Ye, Y. Yang and 3 other author(s). CoRR, (2024)ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages.J. Ye, S. Li, G. Li, C. Huang, S. Gao, Y. Wu, Q. Zhang, T. Gui, and X. Huang. ACL (1), page 2181-2211. Association for Computational Linguistics, (2024)TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities.M. Zhang, C. Huang, Y. Wu, S. Liu, H. Zheng, Y. Dong, Y. Shen, S. Dou, J. Zhao, J. Ye and 3 other author(s). CoRR, (2024)

BibSonomy

Disambiguation of "Huang, Caishuang"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages.

Please choose a person to relate this publication to

Josef Huang

Pa Huang

Haishi Huang

Feiqing Huang

Zhida Huang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Huang, Caishuang"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages.

Please choose a person to relate this publication to

Josef Huang

Pa Huang

Haishi Huang

Feiqing Huang

Zhida Huang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages.