Is Deepseek Training Lying About Chips Used for Training its AI ?

Altimeter Capital analyst and partner puts what Deepseek claims and results into numbers. $6M Training Costs = Plausible IMO Quick math: Training costs ∝ (active params * tokens). DeepSeek v3 (37B params; 14.8T tokens) vs. Llama3.1 (405B params; 15T tokens) = v3 theoretically should be 9% of Llama3.1’s cost. And the disclosed actual figures aligned …

Read more

Using Ollama to Install Deepseek 14B on a Laptop

Langchain used ollama to install Deepseek 14B on a laptop. They used for a local deep researching model. $ ollama pull deepseek-r1:14b $ export TAVILY_API_KEY= $ uvx –refresh –from “langgraph-cli[inmem]” –with-editable . –python 3.11 langgraph dev Brian WangBrian Wang is a Futurist Thought Leader and a popular Science blogger with 1 million readers per month. …

Read more

More AI Efficiency Will See More Demand for AI

Making AI to 10 to 30 times more efficient for AI inference and getting more value from training will increase AI demand. Increased AI efficiency on training and inference will accelerate the improvement and usefulness of AI. Dr Know It All went over the DeepSeek paper and explains how they automated the Reinforcement Learning. AlphaZero …

Read more

New DeepSeek Janus Pro 7B Beats OpenAI Dall-E 3 on Image Generation

DeepSeek just dropped a new open-source multmodal AI model, Janus-Pro-7B. It is MIT opensource license. It’s multimodal (can generate images) and beats OpenAI’s DALL-E 3 and Stable Diffusion across GenEval and DPG-Bench benchmarks. This comes on top of all the R1 hype. Here is the link to the Deepseek Janus 7B Github. NEWS: DeepSeek just …

Read more

Tiny DeepSeek 1.5B Models Run on $249 NVIDIA Jetson Nano

Youtuber, Ominous Industries, ran a couple of versions of the DeepSeek R1 1.5B of models running locally on the NVIDIA Jetson Nan. The newly released distilled DeepSeek models were explroed. The DeepSeek R1 1.5B model delivers impressive performance with plenty of room to spare on the Jetson. He shows the installation process, followed by a …

Read more

Open Source DeepSeek R1 Runs at 200 Tokens Per Second on Raspberry Pi

Experimenters have had overnight tests confirming they have OPEN SOURCE DeepSeek R1 running at 200 tokens per second on a NON-INTERNET connected Raspberry Pi. This is a distilled smaller model than the OPenAI O1 class model. Folks, I think we have done it! If overnight tests are confirmed we have OPEN SOURCE DeepSeek R1 running …

Read more

Open Source DeepSeek R1 Matches OpenAI O1 Math, Code and Reasoning

DeepSeek R1 is an open sourced model. DeepSeek is a Chinese AI research company backed by High-Flyer Capital Management, a quant hedge fund focused on AI applications for trading decisions. They have released models under open-source licenses like MIT. How did they match or even surpassing OpenAI’s O1: Reinforcement Learning Focus: DeepSeek-R1 and its variant, …

Read more