[3Q5-IS-2b-01] Enhancing 2D Pose Estimation Models through Adversarial Learning Techniques | 2024年度人工知能学会全国大会（第38回）

2024年度人工知能学会全国大会（第38回）

2024年5月28日〜5月31日アクトシティ浜松＋オンライン

戻る

2024年度人工知能学会全国大会（第38回）

2024年5月28日〜5月31日アクトシティ浜松＋オンライン

[3Q5-IS-2b-01]Enhancing 2D Pose Estimation Models through Adversarial Learning Techniques

〇Rina Komatsu¹, Tad Gonsalves¹(1. Sophia University)

2D pose estimation is utilized in sports and health analytics. Deep learning models have the potential to estimate poses using only a single human image without the need for motion capture suits. This study aimed to enhance the existing pose estimation model, PoseResNet, which uses Residual Nets to encode input images and to output heatmaps for relevant human joints. To improve this model, we employed the GAN method, training to generate realistic images through adversarial learning between a Generator and a Discriminator. For training 2D pose estimation, we used PoseResNet as the Generator and simple CNN layers implemented as the Discriminator. In our experiments, we employed the MPII Human Pose Dataset and compared three models: 1) PoseResNet, 2) PoseResNet employing adversarial learning based on Patch GAN, and 3) PoseResNet employing adversarial learning based on Patch GAN and CAM logits. Experimental results show that adapting PoseResNet to adversarial learning based on Patch GAN can lead to a significant improvement in the PCKh score, particularly when the adversarial loss is moderately scaled. However, we also observed that either using a strong scalar multiplication for adversarial loss or incorporating CAM logits tends to be less effective in enhancing the quality of pose estimation.

戻る

2024年度 人工知能学会全国大会（第38回）

2024年度 人工知能学会全国大会（第38回）

[3Q5-IS-2b-01]Enhancing 2D Pose Estimation Models through Adversarial Learning Techniques

2024年度人工知能学会全国大会（第38回）

2024年度人工知能学会全国大会（第38回）