Search results for Kimi K2.6
AI & Enterprise
Nvidia unveils 550 billion-parameter Nemotron 3 Ultra, starts mass production of Vera Rubin
Nvidia unveiled its 550 billion-parameter open AI model Nemotron 3 Ultra and said it has started mass production of its next AI server platform Vera Rubin. CEO Jensen Huang introduced both at a GTC Taipei 2026 keynote. Nvidia said the model will be released in open form this week and highlighted competitiveness in benchmarks and token generation speed, while an external index score trailed a leading Chinese model.
AI & Enterprise
Cerebras compresses 163 seconds into 5 seconds, says GPU era is over
AI chip designer Cerebras put the 1 trillion-parameter open-weight model Kimi K2.6 into its enterprise inference service and achieved 981 tokens per second, a pace it says is the world’s fastest. It also cut the time to complete 500 output tokens from a 10,000-token input to 5.6 seconds, versus 163.7 seconds on the official Kimi endpoint. The company is pursuing an IPO and reported 2025 revenue of $510 million and net profit of $238 million.
AI & Enterprise
AI IQ project compares GPT-5.5, Gemini and Claude at a glance
A project called AI IQ has been released to compare the performance of the latest AI models with a single score. Engineer and entrepreneur Ryan Shay (Ryan Shay) said he converts multiple public benchmark results into an estimated IQ scale and averages four areas including abstract, math, programming and academic reasoning. The comparison includes models such as GPT-5.5, Claude Opus 4.7 and Gemini 3.1, and also shows trends over time and cost per IQ.