Coding Problem Example

Self-invoking code benchmarks help you decide which LLMs to use for your programming tasks

As large language models (LLMs) continue to improve at coding, the benchmarks used to evaluate their performance are steadily becoming less useful. That's because though many LLMs have similar high ...

11d

Nous Research's NousCoder-14B is an open-source coding model landing right in the Claude Code moment

B, an open-source AI coding model trained in four days on Nvidia B200 GPUs, publishing its full reinforcement-learning stack as Claude Code hype underscores the accelerating race to automate software ...

TechRadar

The future of coding has a vibe problem: balancing creativity with control

Recently awarded the 2025 word of the year by Collins Dictionary, vibe-coding has caught on as the latest AI trend. Many organizations are jumping on the bandwagon and proposing their own vibe led ...

9to5google

Gemini 2.5 Deep Think scores competitive coding gold in ‘profound leap’ for abstract problem-solving

After a mathematics win in July, Gemini 2.5 Deep Think has now earned a gold-medal level performance in competitive coding. The International Collegiate Programming Contest (ICPC) is the “oldest, ...

Ars Technica

Exhausted man defeats AI model in world coding championship

A Polish programmer running on fumes recently accomplished what may soon become impossible: beating an advanced AI model from OpenAI in a head-to-head coding competition. The 10-hour marathon left him ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results