Presentation Information
[D-12-30]Consistency Improvement of Image Captioning Methods using Vision Language Grounding Models
◎Shunichi Kaizu1, Kazuaki Nakamura1 (1. Tokyo Univ. of Science)
Keywords:
Image Captioning,Dense Captioning,Vision Language Grounding Models
