Hi everyone,
I wanted to share something I came across during some research this morning. There’s a new NVDA add on called Vision Assistant Pro, which integrates Google’s Gemini models to provide advanced AI capabilities directly within NVDA.
This add on does a lot more than just solving CAPTCHAs on the web. It’s designed to assist with a variety of visual tasks, and from the documentation, it looks promising. I’ve included the link below where you can read more about its features and download it:
Download & Details:
Before trying the add on, you’ll need to create an API key from Google AI Studio. I highly recommend generating this key in advance and saving it in a safe file on your computer. You can create your key here:
Google AI Studio – API Key:
https://aistudio.google.com/apikey
I haven’t had the chance to fully test how well Vision Assistant Pro performs. I quickly visited a fairly complex website and attempted to solve a CAPTCHA, but I had to leave for work before checking the results. I’ll test it more thoroughly later.
If anyone needs help creating the API key or installing the add on, feel free to reach out—on list or off list. I’ll do my best to assist.
Warm regards,
Mister Kayne
"It's a bitter sweet symphony this life…"
Author: The Somebody, Nobody, Anybody & Everybody Blog!
Mail: writ...@mister-kayne.com
Sent from Outlook® for Windows 11