Session Details
[4E5-GS-11b]AI and Society
Thu. Jun 11, 2026 3:30 PM - 5:00 PM JST
Thu. Jun 11, 2026 6:30 AM - 8:00 AM UTC
Thu. Jun 11, 2026 6:30 AM - 8:00 AM UTC
Room E(Main hall C)
座長:山本 頼弥(静岡大学)
[4E5-GS-11b-01]Efficient Guardrails for Large Language Models via Policy Filtering
〇Miyu Yamada2, Kunihiro Ito1 (1. NEC Corporation, 2. Institute of Science Tokyo)
[4E5-GS-11b-02]Evaluating the Consistency of Beliefs in Large Language Models
〇Tomoki Tsujimura1, Matīss Rikters1, Shusaku Egami1, Masaki Asada1, Tatsuya Ishigaki1, Ken Yano1, Hiroya Takamura1 (1. National Institute of Advanced Industrial Science and Technology)
[4E5-GS-11b-03]A Comparative Analysis of Human Evaluation and LLM-as-a-Judge for Safety Evaluation of Japanese LLM-based Systems
〇Masaki Fujita1, Takuya Komada1, Hiroshi Fujimoto1, Takeshi Yoshimura1 (1. NTT DOCOMO, INC)
[4E5-GS-11b-04]Exploring the Potential for Reward Hacking Mitigation Through LLM Steering
〇Taiga Sano1, Masami Takahashi1 (1. NTT, Inc.)
[4E5-GS-11b-05]Evaluation of Prompt Injection Attacks for Extracting Sensitive Data
〇Kenichiro Hayasaka1, Yoshihiro Koseki1, Toshiki Okahara1 (1. Mitsubishi Electric Corporation)
[4E5-GS-11b-06]Vulnerability Analysis of Three-Layer LLM Defense Systems and Implications for Defense Design
Naoya Takashima1, 〇Fubuki Sawa1, Kiyoto Hashimoto1, Kota Shimomura1,2, Hayato Fujihara1, Koki Inoue1, Takayoshi Yamashita2 (1. Elith Inc., 2. Chubu University)
