Ui.Vision

Task and UI test automation with Computer Vision/OCR. Ui.Vision combines browser automation and desktop automation.

As of June 2026, Ui.Vision has 200,000 users and a 3.91/5 rating from 239 reviews in the Developer Tools category.

Usersup 100.0 percent+100.0%
200.0K
200,000
Ratingno change0%
3.91
239 reviews
Reviewsno change0%
239
Version
9.6.0
Manifest V3
90-day change · In the last 90 days this extension gained 100.0K users, 1 version update.

History

4 snapshots

Tracking since Apr 1, 2026.

208.0K150.0K92.0KApr 1, 2026Jun 20, 2026
View as table
DateUsersRatingReviewsVersion
Apr 1, 2026100.0K3.912399.5.9
Apr 19, 2026100.0K3.912409.5.9
May 10, 2026200.0K3.912399.5.9
Jun 20, 2026200.0K3.902389.6.0
Now200.0K3.912399.6.0

Permissions & access

Permissions
bookmarksclipboardReadclipboardWritecookiesdebuggerdownloadsdownloads.uinotificationsstoragetabsactiveTabproxynativeMessagingcontextMenuswebRequestwebRequestAuthProvidersidePanelscripting
Host access
<all_urls>

Screenshots

Ui.Vision screenshot 1Ui.Vision screenshot 2Ui.Vision screenshot 3

About

Ui.Vision is an open-source automation RPA software that combines classic browser automation with modern computer vision and OCR:

(1) Browser Automation

Ui.Vision's computer vision commands make automating tasks inside the web browser easy. Existing Selenium IDE scripts can be imported. Conversion help for iMacros scripts is provided, too.

(2) Desktop Automation for Windows, Mac, and Linux

Beyond web browser automation, Ui.Vision can interpret images and text on the desktop, executing actions like clicking, moving, dragging and dropping the mouse, and simulating keyboard inputs.

This desktop automation requires installing the free Ui.Vision XModules, available for Windows, Mac, and Linux. These modules provide Ui.Vision with the necessary capabilities for desktop interaction.

(3) Anthropic Claude Computer Use Integration

The AI commands allow you to automate complex tasks with a single line of code that would traditionally require hundreds of lines of classic commands. For example, you can teach Ui.Vision to play TicTacToe with just one short "Play this game..." prompt.

(4) Command Line API

Ui.Vision can be controlled from the command line. Thus it can be used from, and combined with,   any programming or scripting language, such as Windows batch files, Linux/Mac shell scripts, Python or PowerShell.

(5) Open-Source 

Ui.Vision is Open-Source. The source code is available on Github. 

(6) 100% Local Software

The software does not send any data back to us or any other place. Everything, including image recognition and OCR processing, is done locally on your machine. 

The only exception to the "all data is processed locally" rule is if you select the optional AI commands. But these cloud-features are disabled by default. The default computer vision (image recognition and OCR) runs 100% locally on the machine.

**Happy Automating!**

For questions and suggestions, please visit the Ui.Vision community forum at https://forum.ui.vision.

Technical

Version
9.6.0
Manifest
V3
Size
8.09MiB
Min Chrome
88
Languages
3
Featured
Yes

Metadata

ID
gcbalfbdmfieckjlnblleoemohcganoc
Developer ID
ufc6e1ece9a440ab3502424997bfc0216
Developer Email
[email protected]
Created
Aug 5, 2017
Last Updated (Store)
May 9, 2026
Last Scraped
Jun 20, 2026
Website
ui.vision

Similar extensions

Alternatives to Ui.Vision, ranked by description similarity.

Data sourced from the Chrome Web Store · last verified Jun 20, 2026.