The 40th Annual Conference of the Japanese Society for Artificial Intelligence, 2026

Presentation Information

[5Yin-A-15]Exploring Feedback-Driven Text Optimization for User-Level Fixed-Allocation A/B Testing

〇Masashi Ueno¹, tetsu sato¹, Naohiro Ikahata¹ (1. PERSOL CAREER CO., LTD.)

Keywords:

LLM,text optimization

User-level fixed-allocation A/B testing is a method that assigns each user to a specific group for the entire duration of an experiment and compares the performance of the groups. Attempts to optimize text in accordance with evaluation outcomes by repeatedly conducting fixed-allocation A/B tests have traditionally relied on manual trial and error, which incurs substantial human effort. In this study, with the aim of automating sequential A/B testing, we investigate a method for optimizing text with respect to an evaluation signal under the condition that the evaluation is provided only as numeric values, by iteratively running A/B tests. We propose a hybrid approach that combines TextGrad, which is used in prompt optimization, with a genetic algorithm. Using opinions extracted from the Perspectrum dataset which collects diverse perspectives on a given claim as optimization targets, we optimize text based on evaluation signals from an LLM-as-a-judge and evaluate the performance of the proposed method. The results show that the proposed method outperforms baseline approaches in later generations.

Back to Session information