Presentation Information

[WP-F-9]From In-Network AllReduce to In-NIC AllReduce in Support of Optically Switched GPU Networks

○Libin Wang1, Hongxiang Guo1, Cen Wang2, Yuepeng Wu1, Xiyang Lan1, Jian Wu1 (1. Beijing University of Posts and Telecommunications (China), 2. KDDI Research (Japan))

Keywords:

Distributed Learning,AllReduce,In-Network Computing,Optical Switching,Wavelength-selective Switches for Routing,FPGA-based NIC

We propose in-NIC AllReduce to lower communication times, adapting to optically switched GPU network for distributed training. The experimental results show a 1.89× acceleration aligning with a theoretical analysis of 1.90× acceleration on average.

Comment

To browse or post comments, you must log in.Log in