A Small LLM running on your browser CPU

After model loads switch off your internet and see the magic!

Mobile best - q4f16, wasm

PC fast answers - q4 webgpu; fast load - q4f16, webgpu

Assistant: