chore: remove deleted providers (Kies-LLM-Lokal, TrueNAS-AMD, Ollama-Git, Qwen, OpenAI) from MEMORY.md and TOOLS.md; add Gemma-4-31B as coding/git default
This commit is contained in:
29
TOOLS.md
29
TOOLS.md
@@ -73,6 +73,11 @@ Add whatever helps you do your job. This is your cheat sheet.
|
||||
```
|
||||
- Reply-To: `mki@kies-media.de` verwenden wenn möglich
|
||||
|
||||
### Aktive OpenClaw-Modelle
|
||||
- **Primary:** `z-ai/glm-5.1` (Alias: GLM-5.1)
|
||||
- **Fallback:** `openrouter/google/gemma-4-31b` (Alias: Gemma-4-31B) – auch für Coding & Git
|
||||
- **Free:** `z-ai/glm-4.7-flash` (GLM-4.7-flash-free), `openrouter/deepseek/deepseek-v4-flash:free` (DeepSeek-V4-Flash)
|
||||
|
||||
### OpenRouter
|
||||
- API Key: `$OPENROUTER_API_KEY` in `.bashrc`
|
||||
- URL: https://openrouter.ai/api/v1
|
||||
@@ -93,31 +98,7 @@ Add whatever helps you do your job. This is your cheat sheet.
|
||||
- Workspace Repo: `/root/.openclaw/workspace`
|
||||
- Credentials in `.bashrc`/`.profile`
|
||||
|
||||
### TrueNAS Ollama-Server (192.168.8.112)
|
||||
Zwei separate Ollama-Container mit unterschiedlichen GPUs.
|
||||
|
||||
#### AMD-Container (Port 11439)
|
||||
- URL: `http://192.168.8.112:11439/v1` (HTTP, kein Auth, apiKey: `ollama`)
|
||||
- **GPU:** AMD Radeon RX 7800 XT (Navi 32, Gigabyte, 16GB VRAM)
|
||||
- **Software:** Ollama
|
||||
- **Provider-Name:** `TrueNAS - AMD`
|
||||
- Verfügbare Modelle:
|
||||
- qwen3:32b (Q4_K_M, 32.8B, 18.8GB)
|
||||
- qwen3:14b (Q4_K_M, 14.8B, 8.6GB)
|
||||
- mistral-small3.2 (Q4_K_M, 24.0B, 14.1GB) – Multimodal (Text+Vision), 128K Context, Tool-Calling, Apache 2.0
|
||||
- gpt-oss:20b (MXFP4, 20.9B, 12.8GB) – OpenAI Open-Weight, MoE (~3.6B active/token), Tool-Use, Chat+Coding
|
||||
- **Aktuell:** qwen3:32b lädt die vollen 16GB VRAM → kein Platz für zweites Modell parallel
|
||||
- ⚠️ **Einschränkung:** 14b/32b können Befehle/Tool-Results nicht korrekt verarbeiten – der ~26k Token System-Prompt überfordert die Modelle
|
||||
- **Verwendung:** Notfall-Fallback wenn Cloud-Provider down – einfache Textantworten gehen, aber keine Tool-Nutzung
|
||||
- Alias: `Qwen3-14b-Truenas` / `Qwen3-32b-Truenas` / `Mistral-Small-Truenas` / `GPT-OSS-Truenas`
|
||||
|
||||
#### NVIDIA-Container (Port 11434)
|
||||
- URL: `http://192.168.8.112:11434/v1` (HTTP, kein Auth, apiKey: `ollama`)
|
||||
- **GPU:** NVIDIA RTX 3060 Ti Lite Hash Rate (GA104, 8GB VRAM)
|
||||
- **Software:** Ollama
|
||||
- Verfügbare Modelle: qwen3:32b, qwen3:14b, qwen3:8b, mistral-small3.2, llama3, gpt-oss:20b
|
||||
- ⚠️ **8GB VRAM-Limit:** qwen3:8b (5.2GB) oder llama3 (4.7GB) passen problemlos; qwen3:14b knapp (9.3GB, evtl. partial offload)
|
||||
- **Verwendung:** Kleine lokale Modelle für einfache Tasks
|
||||
|
||||
### TrueNAS Server (192.168.8.112)
|
||||
|
||||
|
||||
Reference in New Issue
Block a user