Speculation grows on DeepSeek’s next AI model after OpenAI’s GPT-5

China’s leading artificial intelligence start-up, DeepSeek, finds itself at the centre of speculation about the release of its next-generation R2 reasoning model, following OpenAI’s launch of its GPT-5 model last week.

Advertisement

DeepSeek – which made waves in the industry with its V3 model in December and the R1 model in January – has not introduced major updates to its products in the past months, aside from two minor revisions.

While the market had expected DeepSeek to introduce a new foundation model within months of R1’s release, the Hangzhou-based start-up – founded by computer scientist Liang Wenfeng as a side project of his quantitative trading firm – has yet to announce a schedule for the launch of the R2 model.

DeepSeek did not respond to a request for comment on Wednesday.

Developing and training an advanced model is an expensive and complex task, requiring substantial computing resources and training data, as well as sophisticated algorithms. It took OpenAI two and a half years to release GPT-5 after launching GPT-4 in March 2023.

Sam Altman, CEO of ChatGPT maker OpenAI. Photo: Reuters
Sam Altman, CEO of ChatGPT maker OpenAI. Photo: Reuters

According to a report by The Financial Times on Thursday, DeepSeek has delayed the launch of its new model owing to challenges in training it with Huawei Technologies’ Ascend AI chips. The start-up has faced “persistent technical issues” during the process, said the report, which cited an unidentified source.

  

Read More

Leave a Reply