How We Built Sanchar Mitra
What does a better world look like?
To us, a better world is one where no one is excluded from communication.
According to the Census of India, more than 5 million individuals live with hearing and speech disabilities. Despite rapid advances in Artificial Intelligence, many assistive technologies still depend on cloud services, smartphones, and internet connectivity. This creates barriers in accessibility, privacy, and affordability.
At the same time, modern AI systems are becoming increasingly resource-hungry, relying on powerful GPUs and large cloud infrastructure.
This inspired us to develop Sanchaar Mitra, an AI-powered communication assistant built on Edge AI principles. The project demonstrates how intelligent systems can run efficiently on embedded hardware while creating meaningful social impact.
Our goal is simple:
AI for All. AI for Social Good.
.webp)

Hardware
- STM32N6570-DK Discovery Kit
- USB Type-C Cable
- Laptop / PC for host-side processing
- Onboard Camera (STM32N6570-DK)
- Microphone
- Speaker / Audio Output Device
- Integrated LCD Display
- UART Communication Interface
Software & Development Tools
- STM32CubeIDE
- STM32CubeProgrammer
- STM32 Model Zoo
- X-CUBE-AI
- Python
- Scikit-learn
- Vosk (Offline Speech-to-Text)
- Ollama
- Llama 3.2
- PySerial
Machine Learning Resources
- ASL Alphabet Hand Landmark Dataset (Kaggle)
- Custom Sign Language Gesture Dataset
- Random Forest Classifier
Optional Future Enhancements
- Raspberry Pi 5
- Battery Pack
- Dedicated Camera Module
- Custom PCB
- 3D Printed Enclosure
Key Technologies Used
- Edge AI
- Embedded Machine Learning
- Computer Vision
- Sign Language Recognition
- Speech-to-Text (STT)
- Text-to-Speech (TTS)
- Neural Processing Unit (NPU) Acceleration
- Offline AI Inference
- Full-Duplex UART Communication
Github For Dataset and Code - https://github.com/maitreya0106/sanchar-mitra
Step 1: Understanding the Problem

Speech- and hearing-impaired individuals often face communication barriers in:
- Schools
- Hospitals
- Government offices
- Workplaces
- Public spaces
Most people do not understand sign language, making everyday interactions difficult.
Traditional assistive technologies often require:
- Internet connectivity
- Smartphones
- Cloud processing
We wanted to build a system that works locally, protects privacy, and provides real-time communication assistance.
Step 2: Defining the Solution

An AI-powered communication assistant capable of:
- Sign Language → Speech
- Speech → Text
- AI-powered Conversations
The system enables real-time bidirectional communication between hearing and speech-impaired individuals and the rest of society.
Step 3: System Architecture

The complete system consists of:
Embedded Side
- STM32N6570-DK
- Onboard Camera
- LCD Display
- NPU Accelerated AI Model
Host Side
- Python Application
- Random Forest Classifier
- Speech-to-Text Engine
- Text-to-Speech Engine
- Local LLM
The embedded device performs hand landmark detection and sends landmark coordinates to the host system for gesture classification.
Step 4: Hardware Used
Components
- STM32N6570-DK Discovery Kit
- USB Type-C Cable
- Laptop
- Integrated LCD Display
- Camera Interface
- Speaker
- Microphone
Why STM32N6?
The STM32N6 features an integrated Neural Processing Unit (NPU) that allows efficient AI inference directly on embedded hardware.
This makes it ideal for Edge AI applications where low power consumption and privacy are important.
Step 5: Setting Up the STM32 Environment
The development environment was built using:
- STM32CubeIDE
- STM32CubeProgrammer
- STM32 Model Zoo
- X-CUBE-AI
Tasks Performed
- Flashing firmware
- Configuring camera pipeline
- Loading AI model
- Enabling UART communication
Step 6: Deploying the Hand Landmark Detection Model

We used the STM32 Model Zoo Hand Landmark model.
The model performs:
- Palm Detection
- Hand Tracking
- Landmark Extraction
The model outputs:
- 21 hand landmarks
- 63 coordinate values
All inference runs directly on the STM32N6 NPU.
Memory usage remains around 4.2 MB.
Step 7: UART Communication Pipeline
The extracted landmark values are streamed over UART.
Communication Parameters:
- Baud Rate: 115200
- Full Duplex Communication
The STM32 board continuously sends landmark coordinates to the host system.
Python files attached below( stt.py, uart.py )
Step 8: Building the Gesture Recognition Engine

The gesture recognition pipeline was developed using Python and Scikit-Learn.
Training Dataset Sources:
- ASL Kaggle Dataset
- Custom Collected Dataset
Training Process:
- Remove noisy Z-coordinate
- Use X and Y coordinates only
- Normalize landmarks
- Train Random Forest Model
Final Model:
- 200 Trees
- Confidence Filtering
- Two-Frame Confirmation Logic
Python files attached below(collect_all.py, collect_data.py, download_dataset.py, train_model.py)
Step 9: Sign Language to Speech

Once a gesture is recognized:
- Character is displayed on LCD
- Text is converted to speech
The speech synthesis system enables hearing individuals to understand the message instantly.
Step 10: Reverse Communication Mode

For reverse communication:
- Microphone captures speech
- Whisper-based STT processes audio
- Text is displayed for deaf users
This creates a complete two-way communication system.
Step 11: AI Interaction Mode

We extended the system by integrating:
- Ollama
- Llama 3.2
Recognized sentences can be sent to a local language model to provide contextual responses.
The AI assistant runs locally without cloud dependency.
Step 12: Testing and Results
Key Achievements:
- Real-time gesture recognition
- Bidirectional communication
- Offline operation
- Low-latency performance
- Privacy-preserving architecture
The project demonstrates how meaningful AI applications can run efficiently on embedded hardware.
Step 13: Impact on Society

Potential deployment areas:
- Schools
- Hospitals
- Government Offices
- Public Service Kiosks
- Smart Cities
- Workplace Accessibility
Sanchar Mitra demonstrates how technology can empower people who are often excluded from mainstream digital systems.
Step 14: Recognition and Learning Journey

.webp)
Sanchaar Mitra was awarded:
- Winner – ST Innovation Fair 2026
One of the most rewarding experiences during this project was presenting our work to industry leaders from STMicroelectronics, including Alessandro Cremonesi (CIO) and Giuseppe Desoli (Chief Architect of STM32N6).
Working with the STM32N6 platform showcased how powerful Edge AI can be when combined with a meaningful social purpose.
Step 15: Future Scope
Future improvements include:
- Fully standalone operation
- On-device STT
- On-device TTS
- More sign language vocabulary
- Battery-powered deployment
- Custom PCB
- Compact enclosure
Step 16: Conclusion

Sanchaar Mitra is more than a technical project.
It is an attempt to make AI:
- Inclusive
- Accessible
- Efficient
Because the true power of AI lies not in how advanced it is, but in who it empowers.
A better world is one where everyone can communicate, participate, and be heard.
Thank You for reading.