AI Content Describer

Par Carter Temm

This add-on makes it possible to describe the focus object, navigator object, or screen using popular vision capable AI language models, like Claude, Gemini, or GPT4. It also lets one understand where their face is positioned in the frame of a connected camera. Though content descriptions are quite detailed, they may not always be completely accurate or reflect real world information. To begin with GPT, head to https://platform.openai.com/account/api-keys and create an account, then create a key for interacting with the API. See add-on documentation for more information on this. Then, choose the "AI content describer" category from NVDA's settings dialog -> manage models and enter your API key. Press NVDA+shift+i to pop up a menu asking how you wish to describe based on the current position, or NVDA+shift+u to describe the navigator object, or NVDA+shift+y for an image that has been copied to the clipboard such as in windows explorer. Other keystrokes are customizable from the input gestures dialog.

Téléchargements disponibles

Autres détails