Session Details

[4E5-GS-11b]AI and Society

Thu. Jun 11, 2026 3:30 PM - 5:00 PM JST
Thu. Jun 11, 2026 6:30 AM - 8:00 AM UTC
Room E(Main hall C)
座長:山本 頼弥(静岡大学)

[4E5-GS-11b-01]Efficient Guardrails for Large Language Models via Policy Filtering

〇Miyu Yamada2, Kunihiro Ito1 (1. NEC Corporation, 2. Institute of Science Tokyo)
Comment()

[4E5-GS-11b-02]Evaluating the Consistency of Beliefs in Large Language Models

〇Tomoki Tsujimura1, Matīss Rikters1, Shusaku Egami1, Masaki Asada1, Tatsuya Ishigaki1, Ken Yano1, Hiroya Takamura1 (1. National Institute of Advanced Industrial Science and Technology)
Comment()

[4E5-GS-11b-03]A Comparative Analysis of Human Evaluation and LLM-as-a-Judge for Safety Evaluation of Japanese LLM-based Systems

〇Masaki Fujita1, Takuya Komada1, Hiroshi Fujimoto1, Takeshi Yoshimura1 (1. NTT DOCOMO, INC)
Comment()

[4E5-GS-11b-04]Exploring the Potential for Reward Hacking Mitigation Through LLM Steering

〇Taiga Sano1, Masami Takahashi1 (1. NTT, Inc.)
Comment()

[4E5-GS-11b-05]Evaluation of Prompt Injection Attacks for Extracting Sensitive Data

〇Kenichiro Hayasaka1, Yoshihiro Koseki1, Toshiki Okahara1 (1. Mitsubishi Electric Corporation)
Comment()

[4E5-GS-11b-06]Vulnerability Analysis of Three-Layer LLM Defense Systems and Implications for Defense Design

Naoya Takashima1, 〇Fubuki Sawa1, Kiyoto Hashimoto1, Kota Shimomura1,2, Hayato Fujihara1, Koki Inoue1, Takayoshi Yamashita2 (1. Elith Inc., 2. Chubu University)
Comment()