LocalAI Chat — Offline GGUF Assistant

Chat with any GGUF model 100% offline in your browser. Streaming, vision, PDF/Word/CSV analysis, workspace file editing.

As of June 2026, LocalAI Chat — Offline GGUF Assistant has users in the Productivity category.

Usersno change0%
Ratingno change0%
— reviews
Reviewsno change0%
Version
1.1.1
Manifest V3

History

1 snapshots

Tracking since Jun 30, 2026.

Not enough history yet for this metric — the chart fills in as we collect more snapshots.
View as table
DateUsersRatingReviewsVersion
Jun 30, 20261.1.1
Now1.1.1

Permissions & access

Permissions
tabsstorageunlimitedStorageactiveTab
Host access
file:///*

Screenshots

LocalAI Chat — Offline GGUF Assistant screenshot 1LocalAI Chat — Offline GGUF Assistant screenshot 2LocalAI Chat — Offline GGUF Assistant screenshot 3LocalAI Chat — Offline GGUF Assistant screenshot 4

About

LocalAI Chat lets you load and chat with any GGUF language model 
directly in your browser — no internet connection required after 
the extension is installed.

HOW IT WORKS
- Click the extension icon → a new tab opens automatically
- Drop any .gguf model file onto the page
- The model loads locally using WebAssembly (llama.cpp)
- Chat with full streaming output, just like ChatGPT — but 100% private

FEATURES
✓ Fully offline — zero network requests during inference
✓ Works with any GGUF model (Llama 3, Mistral, Qwen, Gemma, Phi, and more)
✓ Streaming token output with live tokens-per-second display
✓ Adjustable temperature, top-p, max tokens, context length, CPU threads
✓ Custom system prompt
✓ Chat history with copy button on every message
✓ Stop generation at any time
✓ New chat / change model buttons
✓ No account, no API key, no subscription

RECOMMENDED MODELS
Any Q4_K_M quantized model works well. Good starting points:
- Llama-3.2-1B-Instruct-Q4_K_M (fast, ~800 MB)
- Llama-3.2-3B-Instruct-Q4_K_M (balanced, ~2 GB)
- Mistral-7B-Instruct-Q4_K_M (powerful, ~4 GB)

Download models from: huggingface.co

PRIVACY
Your conversations and model files never leave your computer.
No analytics. No tracking. No telemetry. Ever.

Technical

Version
1.1.1
Manifest
V3
Size
2.63MiB
Min Chrome
88
Languages
1
Featured
No

Metadata

ID
hnkkljohhagdikkoanogcdiaepbceeci
Developer ID
u5dbec7f4ec302aabea5530d6363a8186
Developer Email
[email protected]
Created
Jun 29, 2026
Last Updated (Store)
Jun 29, 2026
Last Scraped
Jun 30, 2026
Website
Support URL

Data sourced from the Chrome Web Store · last verified Jun 30, 2026.