Auto-commit: 2026-05-24 19:58

2026-05-24 19:58:21 +00:00
parent c3e17a04e0
commit 6ffe5c413d
16 changed files with 704 additions and 0 deletions
--- a/TOOLS.md
+++ b/TOOLS.md
@@ -93,6 +93,28 @@ Add whatever helps you do your job. This is your cheat sheet.
 - Workspace Repo: `/root/.openclaw/workspace`
 - Credentials in `.bashrc`/`.profile`

+### TrueNAS Ollama-Server (192.168.8.112)
+Zwei separate Ollama-Container mit unterschiedlichen GPUs.
+
+#### AMD-Container (Port 11439)
+- URL: `http://192.168.8.112:11439/v1` (HTTP, kein Auth, apiKey: `ollama`)
+- **GPU:** AMD Radeon RX 7800 XT (Navi 32, Gigabyte, 16GB VRAM)
+- **Software:** Ollama
+- **Provider-Name:** `TrueNAS - AMD`
+- Verfügbare Modelle: qwen3:32b (Q4_K_M, 32.8B, 20.2GB), qwen3:14b (Q4_K_M, 14.8B, 9.3GB)
+- **Aktuell:** qwen3:32b lädt die vollen 16GB VRAM → kein Platz für zweites Modell parallel
+- ⚠️ **Einschränkung:** Kann Befehle/Tool-Results nicht korrekt verarbeiten – der ~26k Token System-Prompt überfordert das Modell
+- **Verwendung:** Notfall-Fallback wenn Cloud-Provider down – einfache Textantworten gehen, aber keine Tool-Nutzung
+- Alias: `Qwen3-14b-Truenas` / `Qwen3-32b-Truenas`
+
+#### NVIDIA-Container (Port 11434)
+- URL: `http://192.168.8.112:11434/v1` (HTTP, kein Auth, apiKey: `ollama`)
+- **GPU:** NVIDIA RTX 3060 Ti Lite Hash Rate (GA104, 8GB VRAM)
+- **Software:** Ollama
+- Verfügbare Modelle: qwen3:32b, qwen3:14b, qwen3:8b, mistral-small3.2, llama3, gpt-oss:20b, deepseek-v3.2:cloud, kimi-k2.5:cloud
+- ⚠️ **8GB VRAM-Limit:** qwen3:8b (5.2GB) oder llama3 (4.7GB) passen problemlos; qwen3:14b knapp (9.3GB, evtl. partial offload)
+- **Verwendung:** Kleine lokale Modelle für einfache Tasks
+
 ### TrueNAS Server (192.168.8.112)

 **Hardware:**