ai:llm
Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revision | |||
| ai:llm [2026/01/26 02:34] – [LLM in one line] phong2018 | ai:llm [2026/01/26 02:45] (current) – [10. Popular models and which ones can run locally] phong2018 | ||
|---|---|---|---|
| Line 145: | Line 145: | ||
| ===== 10. Popular models and which ones can run locally ===== | ===== 10. Popular models and which ones can run locally ===== | ||
| - | Cloud-only (generally not downloadable): | ||
| - | * GPT (OpenAI) | ||
| - | * Claude (Anthropic) | ||
| - | * Gemini (Google) | ||
| - | Open-weight/ | + | ==== Quick mental model fields |
| - | * LLaMA family (Meta) | + | For each model below, capture: |
| - | | + | * **1) Learned experience = Parameters / Weights** (e.g., 7B/8B/70B) |
| - | * Qwen | + | * **2) What it can produce = Vocabulary** |
| - | | + | * **3) Temporary memory = Context window** |
| - | * Phi (small, efficient) | + | |
| - | Local runtimes | + | ==== Cloud-only (generally not downloadable) ==== |
| - | * **Ollama** | + | === GPT (OpenAI) === |
| - | * **LM Studio** | + | * **Parameters/ |
| - | * **llama.cpp** | + | * **Vocabulary: |
| + | * **Context window:** Varies by model tier/ | ||
| + | * Notes: Strong general reasoning + tool ecosystem. | ||
| + | |||
| + | === Claude (Anthropic) === | ||
| + | * **Parameters/ | ||
| + | * **Vocabulary: | ||
| + | * **Context window:** Varies by model tier/ | ||
| + | * Notes: Strong long-form writing and code assistance. | ||
| + | |||
| + | === Gemini (Google) === | ||
| + | * **Parameters/ | ||
| + | * **Vocabulary: | ||
| + | * **Context window:** Varies by model tier/ | ||
| + | * Notes: Strong multimodal and large-context options (depending | ||
| + | |||
| + | ==== Open-weight/ | ||
| + | === LLaMA family (Meta) === | ||
| + | * **Parameters/ | ||
| + | * **Vocabulary:** Fixed per LLaMA generation (tokenizer + vocab size depends on version). | ||
| + | * **Context window:** Varies by generation (older versions smaller; newer may be larger). | ||
| + | * Local use: Best with quantized GGUF via llama.cpp / Ollama / LM Studio. | ||
| + | |||
| + | === Mistral / Mixtral === | ||
| + | | ||
| + | * **Vocabulary: | ||
| + | * **Context window:** Varies by release/ | ||
| + | * Local use: Mistral 7B-class is popular for fast local inference. | ||
| + | |||
| + | === Qwen === | ||
| + | * **Parameters/ | ||
| + | * **Vocabulary: | ||
| + | * **Context window:** Varies by release/ | ||
| + | * Local use: Often strong multilingual performance. | ||
| + | |||
| + | === DeepSeek (especially strong for code variants) === | ||
| + | * **Parameters/ | ||
| + | * **Vocabulary: | ||
| + | * **Context window:** Varies by release/ | ||
| + | * Local use: Code-focused variants are widely used for dev tasks. | ||
| + | |||
| + | === Phi (small, efficient) === | ||
| + | * **Parameters/ | ||
| + | * **Vocabulary: | ||
| + | * **Context window:** Varies by version. | ||
| + | * Local use: Great for low-resource devices; fast inference. | ||
| + | |||
| + | ==== Local runtimes on macOS ==== | ||
| + | === Ollama === | ||
| + | * Purpose: Simplest local runner (download + run models easily). | ||
| + | * Works best with: Quantized GGUF models. | ||
| + | |||
| + | === LM Studio === | ||
| + | * Purpose: GUI app to download, run, and chat with local models. | ||
| + | * Works best with: Quantized GGUF models, easy model management. | ||
| + | |||
| + | === llama.cpp | ||
| + | | ||
| + | | ||
| + | |||
| + | ==== Glossary (hard terms) ==== | ||
| + | * **parameter** / | ||
| + | * **weight** /weɪt/: trọng số | ||
| + | * **vocabulary** / | ||
| + | * **token** / | ||
| + | * **context window** / | ||
| + | * **proprietary** / | ||
| + | * **open-weight** /ˌoʊpən ˈweɪt/: mở trọng số (công bố weights) | ||
| + | * **quantized** / | ||
| + | * **runtime** / | ||
| + | * **variant** / | ||
| + | * **Mixture-of-Experts (MoE)** / | ||
| ===== 11. Local model size estimates on Mac ===== | ===== 11. Local model size estimates on Mac ===== | ||
ai/llm.1769394845.txt.gz · Last modified: by phong2018
