GLM-5.1 Ships Open-Weight 754B Model That Tops SWE-Bench Pro
Z.AI released GLM-5.1, a 754-billion-parameter open-weight model that claims the top spot on SWE-Bench Pro with a score of 58.4, surpassing Claude Opus 4.6 and GPT-5.4.
Z.AI released GLM-5.1, a 754-billion-parameter open-weight model that claims the top spot on SWE-Bench Pro with a score of 58.4, surpassing Claude Opus 4.6 and GPT-5.4.
Meta confirmed it will release open-source versions of its upcoming Mango multimedia generator and Avocado LLM, extending its open-weight strategy to next-gen frontier models.
In April 2026, a creator with a $1,200 desktop PC and a 12GB graphics card can generate publication-quality images in under 10 seconds, produce video clips, run a conversational AI, and compose music without sending a single byte to the cloud.
Netflix has released VOID (Video Object and Interaction Deletion), its first public AI model. The open-source tool removes objects from video while preserving physically plausible interactions.
DeepSeek V4 will run exclusively on Huawei Ascend 950PR processors, marking a milestone in China's push to build advanced AI without American chip technology.
Alibaba Wan2.7 video generation model is now available directly in ComfyUI through Partner Nodes, bringing comprehensive video creation capabilities to the popular node-based workflow tool.
Tencent has released OmniWeaving, a unified video generation model that handles seven distinct tasks from text-to-video to reasoning-augmented generation, with publicly available weights and code.
Google released Gemma 4, a family of four open multimodal models under Apache 2.0. The 31B dense model scores 89.2% on AIME 2026, quadrupling Gemma 3. For creators and developers building on open models, this changes everything.
Sony AI has released Woosh, an open source sound effects foundation model supporting text-to-audio and video-to-audio generation.
The first week of April 2026 delivered three significant open-source audio AI models that directly challenge paid alternatives in voice synthesis, sound effects, and multilingual speech.
Liquid AI releases LFM2.5-350M, a 350M parameter open-weight model that runs AI agents on mobile GPUs in just 81MB with 32k context window.
Arcee AI releases Trinity-Large-Thinking under Apache 2.0, ranking #2 on PinchBench behind Claude Opus 4.6 at just $0.90 per million output tokens.
Alibaba released Wan2.7-Image, a unified AI model that handles both image generation and editing in one system, with fine-grained personalization, color control, and a Pro variant capable of 4K output.
OmniVoice is a zero-shot text-to-speech model supporting 600+ languages with voice cloning, an Apache 2.0 license, and inference 40x faster than real-time.
Technology Innovation Institute releases Falcon Perception, a compact 0.6B parameter vision model that beats Meta SAM 3 across six key benchmarks while running on consumer hardware.
PrismML emerged from stealth and open-sourced Bonsai 8B, a true 1-bit large language model that fits 8.2 billion parameters into just 1.15 GB of memory, roughly 14x more compressed than comparable 16-bit models.
H Company released Holo3, a vision-language model optimized for GUI agents that sets new state-of-the-art on the OSWorld-Verified benchmark, beating proprietary models with only 3B active parameters.
Skywork AI releases Matrix-Game-3.0, an open-source model that generates real-time interactive video at 720p and 40fps.
NVIDIA has open-sourced Kimodo, the largest controllable motion diffusion model ever trained. Trained on over 700 hours of professional motion capture data, it generates 3D human and robot motions from text prompts.
Mistral Voxtral TTS is a 4B parameter open-weights model that matches ElevenLabs quality in human evaluations, with 3-second voice cloning across 9 languages and self-hosting support.
Researchers released daVinci-MagiHuman, a 15-billion-parameter open-source model that generates realistic human video with synchronized speech in two seconds.
The performance gap between open-source and closed AI models has collapsed from 17.5 percentage points to effectively zero on knowledge benchmarks by early 2026. For creators, that shift changes everything.
OpenAI has launched Parameter Golf, an open research competition offering $1M in computing credits to whoever can compress AI models into just 16MB with only 10 minutes of training time.
Corridor Digital co-founder Niko Pueringer releases CorridorKey, an AI chroma keyer that handles hair, motion blur, and transparency at VFX-production quality.