Chinese AI Firm Zhipu Says it Trained Major Model on Huawei Chips


TL;DR

  • Domestic Training: Zhipu AI became the first Chinese company to train a major AI model entirely on Huawei’s domestic chips.
  • Hardware Independence: The GLM-Image system was developed using Huawei’s Ascend processors without any US semiconductor technology.
  • Open Source Strategy: Zhipu released the model as open-source software to build a developer ecosystem and compete against better-funded rivals.
  • Export Control Impact: The achievement demonstrates that US chip restrictions have not prevented China from developing competitive AI systems.


Zhipu AI announced this week that it has become the first Chinese company to train a major AI model entirely on domestic chips, using Huawei’s Ascend processors to develop its GLM-Image model without US semiconductor technology.

The achievement validates that China’s multi-billion dollar investment in domestic semiconductor infrastructure can power competitive AI systems despite American technological containment strategies.

Breaking US Semiconductor Dependence

According to Zhipu, the entire training pipeline for GLM-Image was conducted on Huawei’s Ascend Atlas 800T A2 server, incorporating the company’s in-house Ascend AI processors and MindSpore machine learning framework. DeepSeek’s well-documented difficulties training models on Huawei hardware makes Zhipu’s achievement particularly notable. The success demonstrates that complete domestic training is technically feasible despite earlier high-profile failures.

GLM-Image’s release as open-source software provides Chinese developers with a reference implementation demonstrating domestic chip viability for computationally intensive AI tasks.

Inside GLM-Image’s Architecture

The GLM-image model employs an autoregressive and diffusion hybrid architecture. The system uses an autoregressive encoder to process text prompts, then feeds representations to a diffusion decoder that generates images through iterative denoising.

GLM-Image was developed using the Ascend Atlas 800T A2 server with four Kunpeng 920 processors. The entire training pipeline used Huawei’s MindSpore machine learning framework.



Source link

Recent Articles

spot_img

Related Stories