Ui.Vision
Task and UI test automation with Computer Vision/OCR. Ui.Vision combines browser automation and desktop automation.
As of June 2026, Ui.Vision has 200,000 users and a 3.91/5 rating from 239 reviews in the Developer Tools category.
Usersup 100.0 percent+100.0%
200.0K
200,000
Ratingno change0%
3.91
239 reviews
Reviewsno change0%
239
Version
9.6.0
Manifest V3
90-day change · In the last 90 days this extension gained 100.0K users, 1 version update.
History
4 snapshotsTracking since Apr 1, 2026.
View as table
| Date | Users | Rating | Reviews | Version |
|---|---|---|---|---|
| Apr 1, 2026 | 100.0K | 3.91 | 239 | 9.5.9 |
| Apr 19, 2026 | 100.0K | 3.91 | 240 | 9.5.9 |
| May 10, 2026 | 200.0K | 3.91 | 239 | 9.5.9 |
| Jun 20, 2026 | 200.0K | 3.90 | 238 | 9.6.0 |
| Now | 200.0K | 3.91 | 239 | 9.6.0 |
Permissions & access
- Permissions
- bookmarksclipboardReadclipboardWritecookiesdebuggerdownloadsdownloads.uinotificationsstoragetabsactiveTabproxynativeMessagingcontextMenuswebRequestwebRequestAuthProvidersidePanelscripting
- Host access
- <all_urls>
Screenshots
About
Ui.Vision is an open-source automation RPA software that combines classic browser automation with modern computer vision and OCR: (1) Browser Automation Ui.Vision's computer vision commands make automating tasks inside the web browser easy. Existing Selenium IDE scripts can be imported. Conversion help for iMacros scripts is provided, too. (2) Desktop Automation for Windows, Mac, and Linux Beyond web browser automation, Ui.Vision can interpret images and text on the desktop, executing actions like clicking, moving, dragging and dropping the mouse, and simulating keyboard inputs. This desktop automation requires installing the free Ui.Vision XModules, available for Windows, Mac, and Linux. These modules provide Ui.Vision with the necessary capabilities for desktop interaction. (3) Anthropic Claude Computer Use Integration The AI commands allow you to automate complex tasks with a single line of code that would traditionally require hundreds of lines of classic commands. For example, you can teach Ui.Vision to play TicTacToe with just one short "Play this game..." prompt. (4) Command Line API Ui.Vision can be controlled from the command line. Thus it can be used from, and combined with, any programming or scripting language, such as Windows batch files, Linux/Mac shell scripts, Python or PowerShell. (5) Open-Source Ui.Vision is Open-Source. The source code is available on Github. (6) 100% Local Software The software does not send any data back to us or any other place. Everything, including image recognition and OCR processing, is done locally on your machine. The only exception to the "all data is processed locally" rule is if you select the optional AI commands. But these cloud-features are disabled by default. The default computer vision (image recognition and OCR) runs 100% locally on the machine. **Happy Automating!** For questions and suggestions, please visit the Ui.Vision community forum at https://forum.ui.vision.
Technical
- Version
- 9.6.0
- Manifest
- V3
- Size
- 8.09MiB
- Min Chrome
- 88
- Languages
- 3
- Featured
- Yes
Metadata
- ID
- gcbalfbdmfieckjlnblleoemohcganoc
- Developer ID
- ufc6e1ece9a440ab3502424997bfc0216
- Developer Email
- [email protected]
- Created
- Aug 5, 2017
- Last Updated (Store)
- May 9, 2026
- Last Scraped
- Jun 20, 2026
- Website
- ui.vision
- Support URL
- https://forum.ui.vision/
- Privacy Policy
- https://ui.vision/privacypolicy
Similar extensions
Alternatives to Ui.Vision, ranked by description similarity.
Quantxt RPA
Automate web interactions with intelligent pattern matching and pre-defined action sequences
17
Crawlio for Chrome
Connect AI to your Browser. Let it navigate, click, take screenshots, inspect network traffic, and extract data.
21
ScanHUD: OCR & AI Analysis for Unselectable Logs
Extract text from unselectable logs with local OCR. Analyze them with AI using your own OpenAI API key.
10
Cuadex Browser Control
Cuadex browser automation for AI agents
12
UI Assist
AI-powered UI assistant that helps create and edit user interfaces through glutamate UIassist MCP server
25
★ 5.0
Browser Agent Extension
AI Agent browser control extension - MCP + WebSocket
82
Automize - Testing/Scraping Tool
Say goodbye to tricky element selection. Simplify scripting, mock network events, export to Puppeteer, Playwright and more.
1.0K
★ 4.4
Agentic Browser: AI Web Automation Tool
AI-powered browser automation tool! Automate web navigation, document processing, and data collection tasks.
23
Data sourced from the Chrome Web Store · last verified Jun 20, 2026.