From post

ETHICIST: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation.

, , и . ACL (1), стр. 12674-12687. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks., , , , , , и . CoRR, (2024)Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization., , , , , и . ACL (1), стр. 8865-8887. Association for Computational Linguistics, (2024)Defending Large Language Models Against Jailbreaking Attacks Through Goal Prioritization., , , и . CoRR, (2023)Indoor Auto-Navigate System for Electric Wheelchairs in a Nursing Home., , и . HCI (7), том 13308 из Lecture Notes in Computer Science, стр. 542-552. Springer, (2022)Visualizing the Electroencephalography Signal Discrepancy When Maintaining Social Distancing: EEG-Based Interactive Moiré Patterns., , , , , и . HCI (21), том 13322 из Lecture Notes in Computer Science, стр. 185-197. Springer, (2022)ETHICIST: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confidence Estimation., , и . ACL (1), стр. 12674-12687. Association for Computational Linguistics, (2023)MoralDial: A Framework to Train and Evaluate Moral Dialogue Systems via Moral Discussions., , , , , , , , и . ACL (1), стр. 2213-2230. Association for Computational Linguistics, (2023)OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics., , , , , , , и . ACL/IJCNLP (1), стр. 6394-6407. Association for Computational Linguistics, (2021)SafetyBench: Evaluating the Safety of Large Language Models., , , , , , , , , и . ACL (1), стр. 15537-15553. Association for Computational Linguistics, (2024)Persona-Guided Planning for Controlling the Protagonist's Persona in Story Generation., , , и . NAACL-HLT, стр. 3346-3361. Association for Computational Linguistics, (2022)