Presentation Information

[10p-N304-2]Development of Digitization Tools from Plots in Published Papers

〇Yukari Katsura1,2,3, Aditya Acharya1,4, Tomoya Mato1, Eiji Koyama1, Dewi Yana1, Atsumi Tanaka1, Masahiko Demura1 (1.NIMS, 2.Univ. of Tsukuba, 3.RIKEN, 4.IIT (BHU) Varanasi)

Keywords:

database,graph,literature data

We developed a tool that assists in extracting numerical data from line plots in research papers. The tool automatically extracts plots distinguished by color, and we built a web system that handles the entire workflow—from cropping plot images out of paper PDFs, to labeling the axes, to extracting the numerical data. We also integrated a paper-summarization function that builds on Starrydata Auto-Summary GPT. For reusable open-access papers, a commercial LLM summarizes the objective, approach, figures, samples, and conclusions of each paper, which can then be referenced.