Beyond LoRA: Hugging Face Tests Better Fine-Tuning Methods
Hugging Face benchmarked LoRA against newer PEFT methods like OFT, which beat it on image generation while using less memory. The default is not always the best fit.
Hugging Face benchmarked LoRA against newer PEFT methods like OFT, which beat it on image generation while using less memory. The default is not always the best fit.
Google is building a Skills Marketplace for Gemini Enterprise, a hub where teams pick predefined AI skills instead of configuring each capability by hand.
Anthropic disabled Claude Fable 5 and Mythos 5 for every customer worldwide after a US government export-control directive on June 12, 2026.
Moonshot released Kimi K2.7-Code on June 12, an open-weights coding model that beats Claude Opus 4.8 on MCPMark tool use at far lower cost.
Anthropic apologized on June 11 for a hidden Claude Fable 5 guardrail that quietly degraded results instead of refusing them, and said it will make the safeguard visible.
Apple WWDC 2026 unveiled five new Foundation Models, a rebuilt Siri AI, Image Playground, and Spatial Reframing, built with Google Gemini collaboration.
Xiaomi MiMo-V2.5-Pro-UltraSpeed claims 1000 tokens per second decode on a 1T MoE. Open-weights FP4 checkpoint plus a 2-week free API trial.
Microsoft used its Build 2026 keynote on June 2 to ship MAI-Code-1-Flash, an in-house coding model that is live today inside the GitHub Copilot model picker for Free, Pro, Pro+, and Max tiers.
Microsoft's MAI-Thinking-1, a 35B sparse MoE with 256k context, tops Claude Sonnet 4.6 in blind evaluations and matches Claude Opus 4.6 on SWE-Bench Pro.
Nvidia is bringing Cosmos, Nemotron, GR00T, and Ising under OpenMDW-1.1. Here is what the unified AI model license means for creative AI developers.
Xiaomi cut MiMo-v2.5 API prices up to 99% effective May 27 across chat, image, audio, video understanding, and TTS, with OpenAI and Anthropic compatible endpoints.
PrismML released Bonsai Image 4B with 1-bit and ternary checkpoints under Apache 2.0. The model retains 95% of FLUX.2 Klein 4B quality at 6.4x smaller size and runs directly on iPhone.
Alibaba released Qwen3.6-27B on April 22, 2026. The 27B dense open-weight model beats its larger 35B-A3B sibling on coding, agent, and vision benchmarks.
Alibaba released Qwen3.6-Max-Preview on April 20, 2026, topping six programming benchmarks including SWE-benchPro. Here is what it means for creators building AI workflows.
Meta launched Muse Spark on April 8, the first AI model built by its new Superintelligence Labs division. Unlike every Llama release before it, Muse Spark is proprietary.
A model called HappyHorse-1.0 has taken the top spot on Artificial Analysis text-to-video leaderboard with an ELO rating of 1365, beating Seedance 2.0 and Kling 3.0 Pro.
Cognition released SWE-1.6, making it generally available in Windsurf with a free tier for three months. The model improves SWE-Bench Pro scores by over 10% at up to 950 tokens per second.
Z.AI released GLM-5.1, a 754-billion-parameter open-weight model that claims the top spot on SWE-Bench Pro with a score of 58.4, surpassing Claude Opus 4.6 and GPT-5.4.
Meta confirmed it will release open-source versions of its upcoming Mango multimedia generator and Avocado LLM, extending its open-weight strategy to next-gen frontier models.
DeepSeek V4 will run exclusively on Huawei Ascend 950PR processors, marking a milestone in China's push to build advanced AI without American chip technology.
Google released Gemma 4, a family of four open multimodal models under Apache 2.0. The 31B dense model scores 89.2% on AIME 2026, quadrupling Gemma 3. For creators and developers building on open models, this changes everything.
Liquid AI releases LFM2.5-350M, a 350M parameter open-weight model that runs AI agents on mobile GPUs in just 81MB with 32k context window.
A misconfigured data cache exposed Anthropic's biggest secret: Claude Mythos, an AI model the company says represents a step change over Claude Opus 4.6 in coding, reasoning, and cybersecurity.
Anthropic accidentally exposed Claude Mythos, described as its most powerful model ever, through an unsecured data cache. The model shows major advances in coding, reasoning, and cybersecurity.