Presentation Information

[16p-K505-12]Benchmark for multi-modal LLM in Materials Science

〇Michiko Yoshitake1, Yuta Suzuki2, Ryo Igarashi1, Yoshitaka Ushiku1, Keisuke Nagato3 (1.OSX, 2.Osaka Univ., 3.Univ. Tokyo)

Keywords:

LLM,benchmark,multi-modal

For the evaluation of multi-modal LLMs, a benchmark Q&A dataset with figures for materials science that can not be solved without the interpretation of the meaning of figures are constructed. Q&A were taken from common university textbook, while figures are modified without altering the essence of questions and the correct answers to avoid copy right problems. The examples of figure modification and some answers from LMMs will be presented.

Comment

To browse or post comments, you must log in.Log in