Public Chat Assistant

Local AI (WebGPU) | optional server proxy

Chat

Runs fully in your browser using WebGPU.
Idle
Sizes are estimates from parameters & quantization (q4 ~ 0.5 byte/param) and may vary by packaging.