Unlocking the Secrets of DeepSeek Diving

Published On Wed Feb 05 2025

DeepSeek Diving + The News You Missed

For those that are new around here, this is an extension of my weekly newsletter where I highlight new and innovative AI products that are worth exploring.

Hey hey! Happy not-Wednesday, here’s your news in newsletter! Due to some technical errors This was supposed to land Friday, Jan 31st, and is now in your inboxes Feb 4th 😅 In this Issue:

The DeepSeek Splash 🌊

Coming out of stealth mode comes a new LLM Challenger: DeepSeek. Boasting bold benchmark performance, this model was allegedly developed for a fraction of the cost of OpenAI’s models on a per iteration basis. Visit their website here.

Your Weekly Industry News Recap

A new entrant swings hard right out of the gates: New entrants aren’t something new to the AI scene - especially not for LLMs. But one that comes in this late in the game swinging this hard is refreshing, and frankly unexpected! The Potential to Shrink Operating Costs vs competitors: Startups and those experimenting with LLMs in their environments are now looking at lower op costs due to DeepSeek’s architecture barely requiring flashy GPUs. See more at Hugging Face.

DeepSeek Proves that Reinforcement Learning has a place in LLMs

It’s the biggest proof that large models can improve their performance dramatically from pure reinforcement learning. Venture Beat covers the basics nicely of what is and why it’s impressive. Read more on VentureBeat.

Mind you that even when GPT4 released, OpenAI had tried adding a layer of reinforcement learning to tie bow on their model’s development, and had only noticed minor increases in performance. I ran this LLM through what I could design as a gauntlet. Coding questions, philosophy, rationales behind decisions, detailed science questions, niche local law, etc.. and it all came back pretty clean. I steered clear of politics due to some obvious statements in the Privacy Policy. Diving deeper into some obscure Quebec’s law structures was no problem at all for this model, it’s got link, citation as well as recommendations for further sources. When asked a vague technical question about building a DIY air purifier, I turned on the R1 reasoning features, and I got a stream of consciousness response leading up to my “official prompt” response. Even after further pressing the system by diving deeper into my lines of inquiry, the R1 model impresses in the thoroughness of its responses.

L’Oréal & IBM’s AI for Cosmetics: AI is transforming beauty! L’Oréal and IBM introduce generative AI for personalized skincare. Read more here.

The API legend veers into AI: Postman’s AI Agent Builder Launch: API development is getting smarter! Postman debuts an AI agent builder to automate API processes. Read more here.

The AI Governance Balancing Act (World Bank): How should regulators balance AI’s potential with risks? Read more here.