Presentation Information
[2I1-OS-49-04]Context-Dependent Parameter Modulation and Phase Transition-like Behavior in Social Learning among Large Language Model Agents
〇Kazuya Horibe1, Masto S Abe2,1, Takahiro Ezaki3, Peter Romero4,5, Wataru Toyokawa1 (1. RIKEN, 2. Doshisha University, 3. The University of Tokyo, 4. Universitat Politècnica de València, 5. University of Cambridge,)
Keywords:
Multi LLM Agents,Social Learning,Critical Dynamics
The potential impact of autonomous AI agents on information ecosystems and social processes has become a topic of increasing concern. To characterize collective dynamics among large language model (LLM) agents, we deployed GPT-5.2 and Claude-Sonnet-4.5 in a social learning task (multi-armed bandit with environmental change). RL parameter estimation revealed distinct profiles: GPT exhibited rapid annealing and strong conformity, leading to collective lock-in, while Claude maintained exploration. Mixed-agent experiments showed a sharp threshold effect at the 2:3 GPT:Claude ratio. Parameter re-estimation revealed asymmetric context-dependent modulation: GPT's shifts were extreme and systematic across all ratios, while Claude remained near baseline at the cascade threshold. A replay analysis showed that absorbing these context-dependent responses into individual RL parameters worsened predictions, suggesting dynamic strategy adjustment beyond fixed-parameter RL. These findings highlight group composition as a critical design variable for multi-agent LLM systems.
Comment
To browse or post comments, you must log in.Log in
