Chinese startup DeepSeek is making waves in the artificial intelligence (AI) industry with its cost-efficient large-language models, which it claims can perform just as well as those developed by giants like OpenAI and Meta. The company's flagship R1 reasoning model is said to achieve "performance comparable" to OpenAI's o1 equivalent, while its newly-released Janus Pro multimodal AI model is reportedly capable of outperforming Stable Diffusion and DALL-E 3.
DeepSeek's ChatGPT competitor has quickly soared to the top of the App Store, with downloads spiking shortly after the release of its R1 reasoning model on January 20th. The AI assistant, powered by the startup's "state-of-the-art" DeepSeek-V3 model, allows users to ask questions, plan trips, generate text, and more. However, the company has begun restricting signups due to "malicious attacks" on its services, with an incident report page stating that registrations are being temporarily limited.
The startup's claims have significant implications for the AI industry, as they suggest that powerful AI models can be built using fewer resources than previously thought. DeepSeek's models are reportedly built using less cash and fewer GPUs than those developed by OpenAI, Meta, Google, Microsoft, and others. If true, this could prove that the startup has managed to overcome the strict US export controls preventing chipmakers like Nvidia from selling high-performance graphics cards in China.
The impact of DeepSeek's models is already being felt in the financial markets, with shares of Nvidia dipping 17 percent by 2PM on January 27th. The company's disruption of the AI industry has sparked interest in cost-efficient AI development, with many wondering if DeepSeek's models can truly rival those of the industry giants.
DeepSeek's Janus Pro multimodal AI model, released on January 27th, is the latest development in the company's quest to build open-source AI models using fewer resources. The model is said to beat comparable models on two AI benchmark tests, although input image analysis is limited to 384x384 resolution. The company's claims have sparked debate in the AI community, with some experts questioning the limitations of the model.
As the AI industry continues to evolve, the rise of cost-efficient models like those developed by DeepSeek could have significant implications for the future of AI development. With the company's models already making waves in the financial markets, it remains to be seen whether DeepSeek can maintain its momentum and continue to disrupt the AI industry.
For more information on DeepSeek and its AI models, stay tuned for further updates and analysis.