China Telecom Completes Industry's First 1024-GPU, Trillion-Parameter Model joint Training over 500 Kilometers
ICC Information News -- Recently, under the unified organization of China Telecom Group Corporation, China Telecom Research Institute, Tianyi Cloud, and Beijing Telecom successfully completed the industry's first real-user trial commercial use of a distributed joint training for a 1024-GPU, trillion-parameter commercial large model. Utilizing an actual optical loopback between Wuqing and Yinghai, they achieved a 500-kilometer long-distance interconnection for distributed training, with training performance reaching over 97% that of a single data center. This significant breakthrough paves a new path for cross-regional collaborative development in large model training.
The trial was conducted on Beijing’s existing 800G wide-area intelligent lossless network and the Xiyang one-stop intelligent computing service platform. Breakthroughs were made in interconnected distance, bandwidth convergence ratio, and model parameters, achieving multi-data center interconnection and resource integration to support distributed joint training of commercial models.
The success of this trial commercial use is a result of China Telecom's continuous innovation and practice in the field of intelligent computing networks. It also represents an important measure in actively responding to national strategies aimed at promoting coordinated development of computational power networks. In the future, China Telecom will continue to increase investment and research efforts in the intelligent computing network domain, providing stronger network support for the development of the artificial intelligence industry and contributing to the high-quality growth of China’s digital economy.