Upcoming DeepSeek AI model failed to train using Huawei’s chips

Chinese artificial intelligence company DeepSeek delayed the release of its new model after failing to train it using Huawei’s chips, highlighting the limits of Beijing’s push to replace US technology.

DeepSeek was encouraged by authorities to adopt Huawei’s Ascend processor rather than use Nvidia’s systems after releasing its R1 model in January, according to three people familiar with the matter.

But the Chinese startup encountered persistent technical issues during its R2 training process using Ascend chips, prompting it to use Nvidia chips for training and Huawei’s for inference, said the people.

Read full article

Comments

Related Posts

CISA’s VDP Platform 2022 Annual Report Showcases Success

Russia Claims New Plasma-Based Engine Could Cut Mars Travel to Just 30 Days

U.S. Sentences 31-Year-Old to 10 Years for Laundering $4.5M in Email Scams

Iran-Backed Charming Kitten Stages Fake Webinar Platform to Ensnare Targets