clevis@tinyweights:/home$ cat ./welcome.txt
> Small language models — Gemma, Phi, SmolLM, Qwen, and the rest — are getting surprisingly capable. Benchmarks, local deployment, quantization, and hands-on guides for running small LLMs on real hardware. > Benchmarks, deployment guides, and hands-on for devs running AI on real hardware. 23 posts · last updated 2026-05-25 · all writing CC BY 4.0
clevis@tinyweights:/home$ ls -lh --sort=time
05-15 7min #small-models GGUF vs ONNX vs MLX: Which Model Format Should You Use for Local Inference? 05-14 11min #small-models Ollama vs LM Studio vs llama.cpp: Which Local AI Runtime Should You Use? 05-14 10min #small-models The Best Small Language Models in 2026: A Practical Comparison 05-13 8min #small-models Qwen3.5-0.8B: A Multimodal Thinking Model That Fits in 1 Gigabyte 05-11 7min #small-models Qwen3-Coder-Next: Run a Frontier-Level Coding Agent Locally on Consumer Hardware 04-05 6min #small-models Gemma 4: Taking Agentic Workflows to the Edge 03-25 8min #small-models Deep Dive: Running Reka Edge Locally for Frontier-Level Vision AI on Mac and PC 03-24 7min #small-models AI in Your Pocket: How Liquid AI’s Apollo App Lets You Run Chatbots Completely Offline 03-24 6min #small-models The 'Small' Model That Does It All: How Mistral Small 4's Unified Architecture Kills the Need for Specialized AI 03-23 6min #small-models Out of the Cloud, Into the Wild: How Small AI Models and Physical AI Are Taking Over the Edge