TinyWeights.dev

clevis@tinyweights:/home$ cat ./welcome.txt

 ████████╗██╗███╗   ██╗██╗   ██╗
 ╚══██╔══╝██║████╗  ██║╚██╗ ██╔╝
    ██║   ██║██╔██╗ ██║ ╚████╔╝
    ██║   ██║██║╚██╗██║  ╚██╔╝
    ██║   ██║██║ ╚████║   ██║
    ╚═╝   ╚═╝╚═╝  ╚═══╝   ╚═╝

 ██╗    ██╗███████╗██╗ ██████╗ ██╗  ██╗████████╗███████╗
 ██║    ██║██╔════╝██║██╔════╝ ██║  ██║╚══██╔══╝██╔════╝
 ██║ █╗ ██║█████╗  ██║██║  ███╗███████║   ██║   ███████╗
 ██║███╗██║██╔══╝  ██║██║   ██║██╔══██║   ██║   ╚════██║
 ╚███╔███╔╝███████╗██║╚██████╔╝██║  ██║   ██║   ███████║
  ╚══╝╚══╝ ╚══════╝╚═╝ ╚═════╝ ╚═╝  ╚═╝   ╚═╝   ╚══════╝

> Small language models — Gemma, Phi, SmolLM, Qwen, and the rest — are getting surprisingly capable. Benchmarks, local deployment, quantization, and hands-on guides for running small LLMs on real hardware. > Benchmarks, deployment guides, and hands-on for devs running AI on real hardware. 23 posts · last updated 2026-05-25 · all writing CC BY 4.0

clevis@tinyweights:/home$ ls -lh --sort=time

05-25 9min #small-models How to Run a Small LLM in Your Browser with WebLLM (No Install, No API) 05-25 6min #small-models How to Run Ministral 3 Locally: Mistral's 3B, 8B, and 14B Vision Models 05-24 7min #small-models MedGemma 1.5: Google's 4B Medical Vision-Language Model You Can Run Locally 05-24 6min #small-models Qwen3.5-4B vs Phi-4-mini: Choosing the Right 4B Model for Local Inference 05-23 11min #small-models Running Local LLMs on Low-VRAM Windows GPUs (6GB and 8GB Cards) in 2026 05-21 10min #small-models What Can You Actually Do With a Local Small LLM? A Practical Guide 05-20 8min #small-models Running LLMs on Raspberry Pi 5: A Practical Guide with Real Benchmarks 05-19 7min #small-models The Complete Guide to Running Small LLMs on Apple Silicon (2026) 05-18 6min #small-models How to Run Phi-4-mini Locally: Microsoft's 3.8B Model with 128K Context 05-17 8min #small-models How Much RAM Do You Actually Need to Run Local LLMs?