English < Article List - DigitalToday

Search results for MoE

AI & Enterprise

DeepSeek V4 seen having bigger impact than R1 on strong price performance

China\'s AI company DeepSeek has launched its new V4 models, drawing attention for offering open-source, near-frontier performance at a much lower price than Opus 4.7 or GPT-5.5. The V4 Pro and V4 Flash models were trained on about 33 trillion tokens and post benchmark results close to those rivals. Commentator Matthew Berman said pricing could pull U.S. companies toward DeepSeek, though geopolitical risks remain.

AI & Enterprise

Nota tops Nvidia Nemotron hackathon overall

Nota, an AI model lightweighting and optimisation company, won the overall top prize at the Nvidia Nemotron hackathon, finishing first among 20 teams. It used synthetic data generation technology specialised for mixture-of-experts (MoE) quantisation. The event was held to share research results from Nvidia\'s open-source AI model Nemotron and improve the practical application capabilities of domestic developers, and ran in three tracks.

Industry

Nvidia shifts AI chip battleground from specs to end-to-end efficiency

Nvidia said it will shift competition in AI semiconductors from chip specifications to end-to-end efficiency across pre-training, post-training, inference and agents. It also disclosed measured results showing Blackwell-based GPUs deliver 55 times faster mixture-of-experts inference than the prior Hopper generation. Nvidia highlighted efficiency gains from a new numeric format and software advances, and introduced curriculum-based post-training results for its Nemotron 3 Nano model. It also said Korean partners are participating and unveiled a Korean-focused synthetic dataset.