Will XAI Grok 4.2 and Grok 5 Have Improved Architecture Help Finally Get the AI Lead?

Grok 4 had lower lmarena scores than I had projected based upon the amount AI training compute used. I had projections for Grok 3.5 which was renamed Grok 4. We do not have elo lmarena score for Grok 4 heavy. My old projection had assumed what will now be called Grok 5 was called Grok …

Read more

Figure AI Helix Humanoid Robot Loading Washing Machine

Figure AI videos of the Helix Humanoid Robot loading washing machines in a residential setting and sorting objects from a conveyor belt for industrial settings. Brian WangBrian Wang is a Futurist Thought Leader and a popular Science blogger with 1 million readers per month. His blog Nextbigfuture.com is ranked #1 Science News Blog. It covers …

Read more

Microsoft and China AI Research Possible Reinforcement Pre-Training Breakthrough

Reinforcement Pre-Training (RPT) is a new method for training large language models (LLMs) by reframing the standard task of predicting the next token in a sequence as a reasoning problem solved using reinforcement learning (RL). Unlike traditional RL methods for LLMs that need expensive human data or limited annotated data, RPT uses verifiable rewards based …

Read more

More AI Efficiency Will See More Demand for AI

Making AI to 10 to 30 times more efficient for AI inference and getting more value from training will increase AI demand. Increased AI efficiency on training and inference will accelerate the improvement and usefulness of AI. Dr Know It All went over the DeepSeek paper and explains how they automated the Reinforcement Learning. AlphaZero …

Read more

OpenAI Releases O3 Model With High Performance and High Cost

OpenaI o3 sets new records in several key areas, particularly in reasoning, coding and mathematical problem-solving. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task in compute ) and 87.5% in high-compute mode (thousands of $ per task). It’s very expensive. It is not just brute force. These capabilities are …

Read more

Llama 3.1 405 billion Parameter Released

Llama 3.1 405B is the first openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation. With the release of the 405B model, Meta supercharges innovation—with unprecedented opportunities for growth and exploration. They believe the latest generation of Llama will …

Read more

Human Teams Can Often Beat Individual Results and AI Teams Can Also Improve Results

If Large Language Models debate their answers they can reach better answers. A complementary approach to improve language responses where multiple language model instances propose and debate their individual responses and reasoning processes over multiple rounds to arrive at a common final answer. The findings indicate that this approach significantly enhances mathematical and strategic reasoning …

Read more