Frontier lab releases, open-source checkpoints, multimodal systems, inference stacks, and model capability shifts.
OpenAI’s GPT‑OSS open‑weight model positioned as near‑frontier coding and reasoning system
A recent walkthrough highlights OpenAI’s **GPT‑OSS** as a state‑of‑the‑art open‑weight model available in ~120B and ~20B parameter sizes, released under Apache 2.0 with weights for self‑hosting.[3] Benchmarks in the review show the 120B model approaching GPT‑4‑class performance on coding (Codeforces), MMLU (~90%), GPQA, and medical benchmarks, while the 20B variant targets efficient on‑device and edge deployment.[3]
Open models like Qwen, DeepSeek, Kimi K2, and GPT‑OSS closing the gap with closed frontier systems
OpenA Red Hat Developer review of open models notes that 2025–2026 open systems such as **Qwen**, **DeepSeek**, **Kimi K2**, and **gpt‑oss** offer strong performance and can be run locally via engines like **Ollama**, **RamaLama**, and **llama.cpp**‑based stacks.[2] The article emphasizes that open models now reach roughly 90% of the performance of leading closed models when released, while offering substantially lower inference cost and flexible deployment.[2][4]
Data from model trackers show a crowded 2026 frontier with multimodal as baseline capability
OpenAn AI frontier comparison covering 22 models across GPT, Claude, Gemini, DeepSeek, Qwen, and Kimi notes that by 2025–2026, essentially all major models support text, image, and document input, making multimodality a baseline rather than a differentiator.[5] Complementary datasets like Epoch AI’s model database track thousands of models and define ‘frontier models’ as those in the top 10 by training compute at release, showing a rapid cadence of increasingly compute‑intensive systems.[6]