Reading view

There are new articles available, click to refresh the page.

NVIDIA’s Nemotron 3 Super Tops The Open-Source AI Model Chart, Beating DeepSeek & GPT-OSS

4 May 2026 at 20:30

NVIDIA's Open-Source "Nemotron 3 Super" AI model has topped the EnterpriseOps-Gym leaderboard, showcasing NVIDIA's software prowess. NVIDIA Is Topping Both AI Hardware and Software Leaderboards With Its Open-Source Nemotron 3 Super, Leading The Pack In March this year, NVIDIA introduced its Neomtron 3 Super, a 120B AI model with 12B active parameters. Based on a hybrid MoE architecture, the model is designed to deliver a 5x throughput versus the previous Nemotron Super model, and tackles large context with a native 1M-token context windows that gives agents long-term memory for aligned, high accuracy reasoning. Some of the highlights of NVIDIA's Nemotron […]

Read full article at https://wccftech.com/nvidia-nemotron-3-super-tops-the-open-source-ai-model-chart-beating-deepseek-gpt-oss/

NVIDIA Beats Everyone To DeepSeek V4 With Day-0 Blackwell Support, Pushing 3,500 Tokens Per Second On 1.6T Models

Wccftech

Hassan Mujtaba

26 April 2026 at 09:10

DeepSeek V4 is out, bringing major optimizations, including up to 1.6T model sizes, and NVIDIA is ready with Day-0 support on Blackwell GPUs using NVFP4. NVIDIA Blackwell NVFP4 Architecture Delivers Major Speed-Ups In DeepSeek v4 With More Optimizations On The Way With the launch of DeepSeek V4, we saw some major optimizations in compute & memory requirements. The updated AI model uses just 27% of single-token inference FLOPs & 10% of the KV cache when running a one-million-token context window. Two new models were also introduced, one being a Pro model with a parameter size of 1.6T, and a Flash version […]

Read full article at https://wccftech.com/nvidia-beats-everyone-to-deepseek-v4-day-0-blackwell-support-pushing-3500-tokens-on-1-6t-models/