Vision Glasses V2 for the Blind to Transcribe Text to Audio
by akhilnagori in Circuits > Raspberry Pi
1526 Views, 16 Favorites, 0 Comments
Vision Glasses V2 for the Blind to Transcribe Text to Audio

This project is the second, more advanced version of my Smart AI Glasses for the Blind to Transcribe Text to Audio in real time using Raspberry Pi Zero. These glasses scan text and broadcast the text aloud. Allowing visually impaired individuals to have access to more written text that isnβt translated in a form understandable to them (braille, audio, etc.).
The glasses still use all the same materials as the first iteration (Materials will be elaborated on later on) but, I also added a couple of other new components that make the glasses easier to use. Such as two pushbuttons that allow the user to control when the glasses turn on, and when the camera takes a picture. I also improved the code for more accurate and quick results.
I was inspired to pursue this project when I went to India. In the building where I stayed, there was a blind child who enjoyed listening to stories read to him by his parents. However, he couldn't read any stories by himself. Although he had access to a small number of braille books, there were many pieces of text that he needed help accessing. His story inspired me to create something that would enable him to access more pieces of text.
Supplies

Materials used:
Raspberry Pi Zero 2 W
Zero Spy Camera for Raspberry Pi Zero
Two mini speakers
Any type of PLA Filament (I used Bambu Lab Filament)
PCB Circuit Board
3.7 volt Lithium Ion Battery - preferably with more than 1.2 amps
Jumper Wires - male to male
Power Booster to make power supply sufficient for Raspberry Pi
Tools Needed:
3d Printer
Solder Iron
Software



We completely redesigned the software: Now instead of using the local Doctr OCR for text extraction, we are using an API which is much faster, and running a subprocess which allows it to call the camera capture from inside the code:
Deploying to the Raspberry Pi:
Unless you are modifying the code, you will most likely just need to upload the code to the Raspberry Pi. To start off, enable SSH on the Raspberry Pi, so you can use the terminal directly through your other computer.
Code:
πΉ Update Your System
Before installing dependencies, update your Raspberry Pi:
bash
πΉ Enable the Camera
If using Raspberry Pi OS:
Open Raspberry Pi configuration:
bash
Go to Interfacing Options β Camera β Enable.
Reboot your Raspberry Pi:
bash
π¦ Step 3: Install Required Software
πΉ Install Python Dependencies
bash
πΉ Test if RPi.GPIO is Installed
bash
π Step 4: Write the Python Code
Create a new Python script:
bash
Paste the following complete script:
π Step 5: Run the Program
Run the script:
- The program will wait for a button press.
- Press the button β It captures an image.
- The OCR extracts text.
- The text is spoken using espeakng.
After following steps above, you are now ready to run the code. Upload the main.py file and tts.py file to the Raspberry Pi. Then copy the code from tts.py to the end of main.py, so they run in one execution. Now, you should have a working text to audio glasses, but you must replace the test image in main.py to <imagename>.jpg. This will be used later when setting up the sound to the raspberry pi.
Hardware



These are the files you will need to print on a 3d printer. Use the software for your printer to slice this stl file.
Downloads
Conclusion

In conclusion, we successfully developed a prototype system that uses a Raspberry Pi, a camera module, and a push-button interface to capture images, extract text using Optical Character Recognition (OCR), and read the text aloud using text-to-speech (TTS) technology. This project was designed to assist individuals with visual impairments or reading difficulties by providing an easy-to-use, real-time text-reading solution. We implemented OCR using the OCR.space API and integrated espeakng for speech output. The system was optimized to function effectively in various lighting conditions and text formats, ensuring accessibility and ease of use. Through this project, we demonstrated the feasibility of a low-cost assistive device that enhances independence and daily interactions with printed text.
Demo

This is a demo of the project working.