❌

Reading view

There are new articles available, click to refresh the page.

NVIDIA’s Nemotron 3 Super Tops The Open-Source AI Model Chart, Beating DeepSeek & GPT-OSS

NVIDIA's Nemotron 3 Super Tops The Open-Source AI Model Chart, Beating DeepSeek & GPT-OSS 1

NVIDIA's Open-Source "Nemotron 3 Super" AI model has topped the EnterpriseOps-Gym leaderboard, showcasing NVIDIA's software prowess. NVIDIA Is Topping Both AI Hardware and Software Leaderboards With Its Open-Source Nemotron 3 Super, Leading The Pack In March this year, NVIDIA introduced its Neomtron 3 Super, a 120B AI model with 12B active parameters. Based on a hybrid MoE architecture, the model is designed to deliver a 5x throughput versus the previous Nemotron Super model, and tackles large context with a native 1M-token context windows that gives agents long-term memory for aligned, high accuracy reasoning. Some of the highlights of NVIDIA's Nemotron […]

Read full article at https://wccftech.com/nvidia-nemotron-3-super-tops-the-open-source-ai-model-chart-beating-deepseek-gpt-oss/

NVIDIA Beats Everyone To DeepSeek V4 With Day-0 Blackwell Support, Pushing 3,500 Tokens Per Second On 1.6T Models

A person stands next to a large NVIDIA data center server rack with multiple GPUs and visible branding.

DeepSeek V4 is out, bringing major optimizations, including up to 1.6T model sizes, and NVIDIA is ready with Day-0 support on Blackwell GPUs using NVFP4. NVIDIA Blackwell NVFP4 Architecture Delivers Major Speed-Ups In DeepSeek v4 With More Optimizations On The Way With the launch of DeepSeek V4, we saw some major optimizations in compute & memory requirements. The updated AI modelΒ uses just 27% of single-token inference FLOPs & 10% of the KV cache when running a one-million-token context window. Two new models were also introduced, one being a Pro model with a parameter size of 1.6T, and a Flash version […]

Read full article at https://wccftech.com/nvidia-beats-everyone-to-deepseek-v4-day-0-blackwell-support-pushing-3500-tokens-on-1-6t-models/

❌