How ChatGPT Can Help Visually Impaired Individuals?

abstract:

How ChatGPT Can Help Visually Impaired Individuals?

Kadir Uludag* , Min Zhao
Shanghai Mental Health Center, School of Medicine, Shanghai Jiao Tong University, China.
* Corresponding author. Email: kadiruludag@mails.ucas.ac.cn.
Manuscript submitted April 20, 2023;  accepted May 22, 2023; published March 29, 2023.
Abstract: Background: Visually impaired individuals face significant challenges in accessing information that can limit their educational, economic, and social opportunities. Technologies such as screen readers can help, but they may not be suitable for everyone. Chatbots offer a promising alternative, as they can provide personalized information and support them socially. Goal of Study: We aimed to discuss how ChatGPT can help visually impaired individuals. Methods: We conducted a review study to evaluate the effectiveness of ChatGPT as a practical tool for visually impaired individuals. The first version of the ChatGPT was used. Also, ChatGPT has answered several relevant questions. Results: ChatGPT was highly influential in creating accurate responses to user input and can help visually impaired individuals. Furthermore, ChatGPT can help blind people think about visual scenarios since ChatGPT can describe visual plans quickly. It can support them socially and emotionally. Conclusion: ChatGPT has the potential to help visually impaired individuals access information more effectively. Future studies should investigate how ChatGPT can help visually impaired individuals.

Key words: ChatGPT, visually impaired individuals, chatbots, artificial intelligence, blindness.
Commentary on paper, related information: 
how chatbots can be used to improve daily life of visually impaired?

## How Chatbots Can Improve Daily Life of Visually Impaired Individuals

Chatbots powered by large language models (LLMs) like ChatGPT, Claude, or Gemini offer **practical, conversational assistance** that goes far beyond traditional screen readers. Below is a structured overview of their daily-life applications, followed by practical considerations.

### Core Daily Functions

| Daily Need | How a Chatbot Helps | Example Interaction |
|————|———————|———————-|
| **Reading printed text** | Extract text from a photo (medicine bottle, receipt, menu) and read it aloud. | *User uploads photo of a prescription label. Chatbot: “The label says: ‘Take one tablet daily with food. Expires 12/2025.’”* |
| **Describing surroundings** | Interpret a room, street scene, or object from an image. | *“Describe this photo.” – “A white cat sitting on a blue sofa next to a green plant.”* |
| **Navigating indoors** | Explain spatial layout from a photo of a sign or room. | *“I see a door on the left, a staircase straight ahead, and a fire extinguisher on the right wall.”* |
| **Using public transport** | Read bus/train schedules, platform numbers, or transfer instructions. | *User shows photo of a timetable. Chatbot: “The next bus to City Center is at 10:15 AM from platform 2.”* |
| **Shopping & product info** | Identify products, read ingredients, check prices, or compare items. | *“Which of these two cereal boxes has less sugar?” (uploads both images).* |
| **Learning & work** | Summarise documents, explain diagrams, convert math to spoken format. | *“This graph shows sales increasing from January to March, then a drop in April.”* |
| **Social media access** | Describe memes, interpret emojis, read comments aloud. | *“The meme shows a confused dog with the text: ‘I have no idea what I’m doing.’”* |
| **Household tasks** | Read appliance settings, explain how to use a device, set voice reminders. | *“What do the symbols on my microwave keypad mean?” – detailed explanation.* |

### How Chatbots Outperform Traditional Assistive Tech

| Feature | Traditional Screen Readers / OCR | LLM‑Based Chatbots |
|———|——————————–|———————|
| **Natural conversation** | Fixed commands | Ask anything in plain language |
| **Context understanding** | No (reads only what’s selected) | Yes – can infer meaning from surrounding text or image |
| **Image description** | Limited or none | Detailed, contextual descriptions |
| **Answering follow‑ups** | No | Yes – “What color is the cat?” → “Orange.” |
| **Summarisation** | No | Yes – “Give me the key points of this 10‑page PDF.” |
| **Error correction** | No | User can say “That’s wrong – read again.” |

### Practical Setup for Daily Use

Visually impaired users can access chatbots via:

1. **Smartphone apps** (ChatGPT, Copilot, Gemini) with built‑in voice input (speech‑to‑text) and screen reader (TalkBack, VoiceOver) for output.
2. **Smart speakers** (Amazon Echo, Google Nest) that integrate with chatbot APIs – hands‑free voice only.
3. **Wearable cameras** (e.g., Envision Glasses, OrCam) connected to LLM backends – describe continuously.
4. **Custom shortcuts** – use voice commands like *“Hey Siri, ask ChatGPT to describe what I’m pointing my phone at.”*

### Key Limitations to Be Aware Of

| Limitation | Impact & Mitigation |
|————|———————-|
| **Hallucinations** (describing things that aren’t there) | **High risk** for navigation or medical info. Always cross‑check critical info with another source. |
| **Privacy** – uploading photos of home, mail, or documents | Avoid sharing sensitive personal data (passports, bank statements). Use offline or on‑device models when possible. |
| **Internet required** | No offline mode for most LLMs. Plan ahead – download maps or text offline. |
| **Latency** (3–10 seconds per response) | Not suitable for real‑time mobility (e.g., crossing a street). Use specialised apps for that. |
| **No spatial awareness** | Can’t tell distance, speed, or direction unless explicitly described in the image. |
| **Cost** – advanced models (GPT‑4V) may require a subscription | Free tiers (GPT‑3.5, Gemini Pro) are often sufficient for basic tasks. |

### Real‑World Example Scenario

> **Morning routine**: A visually impaired person wakes up. They point their phone at a box of tea. Chatbot: *“English Breakfast tea. Instructions: steep for 3 minutes in boiling water.”* Then they point at a calendar on the wall: *“Today’s entry: Doctor’s appointment at 10:30 AM, 221 Main Street.”* They ask: *“What’s the weather like?”* Chatbot (if integrated with web search): *“Sunny, 18°C. No rain expected.”* Finally, they upload a photo of a bus stop sign: *“Route 42 to downtown arrives in 5 minutes.”*

### How to Expand This Topic (Further Reading Ideas)

If you want to develop this into a full article, paper, or guide, consider these angles:

1. **Comparative study** – Test ChatGPT vs. Seeing AI vs. human assistant on 20 daily tasks.
2. **User‑centered design** – Interview 10 visually impaired users to learn what features they need most.
3. **Safety protocol** – Design a chatbot that explicitly says *“I’m uncertain”* or *“Please verify with someone sighted”* for high‑stakes tasks.
4. **Offline LLMs** – Evaluate small models (Llama 3, Phi‑3) that run entirely on a phone without internet.
5. **Voice‑first interface optimisation** – Reduce conversation turns and memory load.
6. **Integration with navigation aids** – Combine chatbot object description with GPS or LiDAR for safer mobility.

### Conclusion

Chatbots dramatically expand what visually impaired individuals can do independently – from reading fine print to understanding visual scenes. However, they are **a supplement, not a replacement**, for traditional assistive tech and human help. Used wisely, they improve confidence, reduce reliance on others, and make everyday information accessible.

> **Bottom line**: A chatbot in your pocket is like having a sighted friend who is always patient, but occasionally makes mistakes – so verify critical information.


Cite: Kadir Uludag, Min Zhao, “How ChatGPT Can Help Visually Impaired Individuals?,” Journal of Advances in Artificial Intelligence, Vol. 1, No. 1, pp. 49-56, 2023. doi: 10.18178/JAAI.2023.1.1.49-56

https://www.jaai.net/content-172-14-1.html
 

Leave a Reply