The 40th Annual Conference of the Japanese Society for Artificial Intelligence, 2026

Presentation Information

[3Yin-A-41]Spatial Reasoning for Lunar Rovers Using Vision-language model with Top-Down Terrain Maps

〇Takashi Kubo¹, Naoya Takeishi¹, Takehisa Yairi¹ (1. University of Tokyo)

Keywords:

LLM,Spatial reasoning,lunar rover

本研究は、VLMが月面環境での空間推論にどの程度利用可能かを基礎的に検証し、改善手法を導入することを目的とする。waypointまでの経路を出力するタスクに対し、DEM・クレーターマスクから生成したEgocentric Rendering画像と固定カメラ画像をVLMに入力する手法を検討した。月面模擬環境での実験により、ベースラインと比較して検討手法はクレーターを含む空間認識・waypointまでの経路説明の精度を改善する可能性を確認した。

Back to Session information