Presentation Information

[15p-S2_204-14]Effects of Structural Representations and Vocabulary Design on Crystal Property Prediction with Large Language Models

〇Shuichiro Ozawa1, Izumi Takahara1, Teruyasu Mizoguchi2 (1.Sch. of Eng., UTokyo, 2.IIS, UTokyo)

Keywords:

Large Language model,Crystal property prediction,Crystal structure description

We investigate the prediction of formation energies of crystalline materials using large language models (LLMs), focusing on (1) natural-language representations of crystal structures and (2) the effect of domain-specific tokenization that treats materials-science terminology as single tokens on prediction accuracy. In particular, we show that fine-tuning is effective when using detailed structure descriptions that include not only composition and space group but also atomic coordination environments.