OpenAI Release O3 Level Open Source Models

Openai is making O3 level capabilities open source ahead of the release of GPT-5 in about two days. These open source models are trained for agentic workflows—supporting function calling, web search, Python execution, configurable reasoning effort, and full raw chain-of-thought access. The gpt-oss-120b and gpt-oss-20b—two are state-of-the-art open-weight language models that deliver strong real-world performance …

Read more

New DeepSeek Janus Pro 7B Beats OpenAI Dall-E 3 on Image Generation

DeepSeek just dropped a new open-source multmodal AI model, Janus-Pro-7B. It is MIT opensource license. It’s multimodal (can generate images) and beats OpenAI’s DALL-E 3 and Stable Diffusion across GenEval and DPG-Bench benchmarks. This comes on top of all the R1 hype. Here is the link to the Deepseek Janus 7B Github. NEWS: DeepSeek just …

Read more

Open Source DeepSeek R1 Runs at 200 Tokens Per Second on Raspberry Pi

Experimenters have had overnight tests confirming they have OPEN SOURCE DeepSeek R1 running at 200 tokens per second on a NON-INTERNET connected Raspberry Pi. This is a distilled smaller model than the OPenAI O1 class model. Folks, I think we have done it! If overnight tests are confirmed we have OPEN SOURCE DeepSeek R1 running …

Read more

Open Source DeepSeek R1 Matches OpenAI O1 Math, Code and Reasoning

DeepSeek R1 is an open sourced model. DeepSeek is a Chinese AI research company backed by High-Flyer Capital Management, a quant hedge fund focused on AI applications for trading decisions. They have released models under open-source licenses like MIT. How did they match or even surpassing OpenAI’s O1: Reinforcement Learning Focus: DeepSeek-R1 and its variant, …

Read more

Meta Llama 3 70B Open Source Model Beats Claude 3 Sonnet

Llama 3 is Meta’s latest generation of models that has state-of-the art performance and efficiency for openly available LLMs. Meta AI is available online for free. The small 7B model beats Mistral 7B and Gemma 7B. The 70B beats Claude 3 Sonnet (closed source Anthropic model) and competes against Gemini Pro 1.5 (closed source model …

Read more

Will Meta Have a Zettaflop of AI Compute in 2024?

Mark Zuckerberg plans on acquiring 350,000 Nvidia H100 GPUs to help Meta build a next-generation AI that possesses human-like intelligence. Zuckerberg mentioned the figure today as he announced his company’s long-term effort to develop an artificial general intelligence (AGI), or an AI that can learn and be used to perform a variety of tasks. By …

Read more

Llama 2 is the Best Open Source LLM so Far

Llama 2 is a collection of pretrained and fine-tuned large language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Meta fine-tuned LLMs, called Llama 2-Chat, are optimized for dialogue use cases. The models outperform open-source chat models on most benchmarks they tested, and based on their human evaluations for helpfulness and …

Read more