Project

How We Built Sanchar Mitra
AI for All. AI for Social Good.
Step 1: Understanding the Problem
Step 2: Defining the Solution
Step 3: System Architecture
Step 4: Hardware Used
Step 5: Setting Up the STM32 Environment
Step 6: Deploying the Hand Landmark Detection Model
Step 7: UART Communication Pipeline
Step 8: Building the Gesture Recognition Engine
Step 9: Sign Language to Speech
Step 10: Reverse Communication Mode
Step 11: AI Interaction Mode
Step 12: Testing and Results
Step 13: Impact on Society
Step 14: Recognition and Learning Journey
Step 15: Future Scope
Step 16: Conclusion

Downloads

Credit

Comments

2.5K

Report This Project

Sanchar Mitra - a Communication Assistant That Enables Real-time Bidirectional Interaction Between Speech and Hearing-impaired

Published Jun 15, 2026

Apache-2.0

20 hours to build

Expert

Sanchar Mitra is an Edge AI-powered sign language translation system that enables real-time communication between hearing-impaired individuals and the general public. By combining computer vision, machine learning, offline speech recognition, and text-to-speech technologies, it delivers low-latency, accessible communication directly on embedded hardware without relying on cloud connectivity.

Components Used
Microphone Microphones microphone, 4 mmx1.5mm, electret condenser, noise cancelling, solder pads, 1 Vdc	X 1
Speaker Speakers & Transducers 28 mm, Round Frame, 0.25 W, 32 Ohm, Neodymium Magnet, PET Cone, Speaker	X 1
STM32N6570-DK	X 1
Camera module / webcam	X 1

Description

How We Built Sanchar Mitra

What does a better world look like?

To us, a better world is one where no one is excluded from communication.

According to the Census of India, more than 5 million individuals live with hearing and speech disabilities. Despite rapid advances in Artificial Intelligence, many assistive technologies still depend on cloud services, smartphones, and internet connectivity. This creates barriers in accessibility, privacy, and affordability.

At the same time, modern AI systems are becoming increasingly resource-hungry, relying on powerful GPUs and large cloud infrastructure.

This inspired us to develop Sanchaar Mitra, an AI-powered communication assistant built on Edge AI principles. The project demonstrates how intelligent systems can run efficiently on embedded hardware while creating meaningful social impact.

Our goal is simple:

AI for All. AI for Social Good.

Hardware

STM32N6570-DK Discovery Kit
USB Type-C Cable
Laptop / PC for host-side processing
Onboard Camera (STM32N6570-DK)
Microphone
Speaker / Audio Output Device
Integrated LCD Display
UART Communication Interface

Software & Development Tools

STM32CubeIDE
STM32CubeProgrammer
STM32 Model Zoo
X-CUBE-AI
Python
Scikit-learn
Vosk (Offline Speech-to-Text)
Ollama
Llama 3.2
PySerial

Machine Learning Resources

ASL Alphabet Hand Landmark Dataset (Kaggle)
Custom Sign Language Gesture Dataset
Random Forest Classifier

Optional Future Enhancements

Raspberry Pi 5
Battery Pack
Dedicated Camera Module
Custom PCB
3D Printed Enclosure

Key Technologies Used

Edge AI
Embedded Machine Learning
Computer Vision
Sign Language Recognition
Speech-to-Text (STT)
Text-to-Speech (TTS)
Neural Processing Unit (NPU) Acceleration
Offline AI Inference
Full-Duplex UART Communication

Github For Dataset and Code - https://github.com/maitreya0106/sanchar-mitra

Step 1: Understanding the Problem

Speech- and hearing-impaired individuals often face communication barriers in:

Schools
Hospitals
Government offices
Workplaces
Public spaces

Most people do not understand sign language, making everyday interactions difficult.

Traditional assistive technologies often require:

Internet connectivity
Smartphones
Cloud processing

We wanted to build a system that works locally, protects privacy, and provides real-time communication assistance.

Step 2: Defining the Solution

An AI-powered communication assistant capable of:

Sign Language → Speech
Speech → Text
AI-powered Conversations

The system enables real-time bidirectional communication between hearing and speech-impaired individuals and the rest of society.

Step 3: System Architecture

The complete system consists of:

Embedded Side

STM32N6570-DK
Onboard Camera
LCD Display
NPU Accelerated AI Model

Host Side

Python Application
Random Forest Classifier
Speech-to-Text Engine
Text-to-Speech Engine
Local LLM

The embedded device performs hand landmark detection and sends landmark coordinates to the host system for gesture classification.

Step 4: Hardware Used

Components

STM32N6570-DK Discovery Kit
USB Type-C Cable
Laptop
Integrated LCD Display
Camera Interface
Speaker
Microphone

Why STM32N6?

The STM32N6 features an integrated Neural Processing Unit (NPU) that allows efficient AI inference directly on embedded hardware.

This makes it ideal for Edge AI applications where low power consumption and privacy are important.

Step 5: Setting Up the STM32 Environment

The development environment was built using:

STM32CubeIDE
STM32CubeProgrammer
STM32 Model Zoo
X-CUBE-AI

Tasks Performed

Flashing firmware
Configuring camera pipeline
Loading AI model
Enabling UART communication

Step 6: Deploying the Hand Landmark Detection Model

We used the STM32 Model Zoo Hand Landmark model.

The model performs:

Palm Detection
Hand Tracking
Landmark Extraction

The model outputs:

21 hand landmarks
63 coordinate values

All inference runs directly on the STM32N6 NPU.

Memory usage remains around 4.2 MB.

Step 7: UART Communication Pipeline

The extracted landmark values are streamed over UART.

Communication Parameters:

Baud Rate: 115200
Full Duplex Communication

The STM32 board continuously sends landmark coordinates to the host system.

Python files attached below( stt.py, uart.py )

Step 8: Building the Gesture Recognition Engine

The gesture recognition pipeline was developed using Python and Scikit-Learn.

Training Dataset Sources:

ASL Kaggle Dataset
Custom Collected Dataset

Training Process:

Remove noisy Z-coordinate
Use X and Y coordinates only
Normalize landmarks
Train Random Forest Model

Final Model:

200 Trees
Confidence Filtering
Two-Frame Confirmation Logic

Python files attached below(collect_all.py, collect_data.py, download_dataset.py, train_model.py)

Step 9: Sign Language to Speech

Once a gesture is recognized:

Character is displayed on LCD
Text is converted to speech

The speech synthesis system enables hearing individuals to understand the message instantly.

Step 10: Reverse Communication Mode

For reverse communication:

Microphone captures speech
Whisper-based STT processes audio
Text is displayed for deaf users

This creates a complete two-way communication system.

Step 11: AI Interaction Mode

We extended the system by integrating:

Ollama
Llama 3.2

Recognized sentences can be sent to a local language model to provide contextual responses.

The AI assistant runs locally without cloud dependency.

Step 12: Testing and Results

Key Achievements:

Real-time gesture recognition
Bidirectional communication
Offline operation
Low-latency performance
Privacy-preserving architecture

The project demonstrates how meaningful AI applications can run efficiently on embedded hardware.

Step 13: Impact on Society

Potential deployment areas:

Schools
Hospitals
Government Offices
Public Service Kiosks
Smart Cities
Workplace Accessibility

Sanchar Mitra demonstrates how technology can empower people who are often excluded from mainstream digital systems.

Step 14: Recognition and Learning Journey

Sanchaar Mitra was awarded:

Winner – ST Innovation Fair 2026

One of the most rewarding experiences during this project was presenting our work to industry leaders from STMicroelectronics, including Alessandro Cremonesi (CIO) and Giuseppe Desoli (Chief Architect of STM32N6).

Working with the STM32N6 platform showcased how powerful Edge AI can be when combined with a meaningful social purpose.

Step 15: Future Scope

Future improvements include:

Fully standalone operation
On-device STT
On-device TTS
More sign language vocabulary
Battery-powered deployment
Custom PCB
Compact enclosure

Step 16: Conclusion

Sanchaar Mitra is more than a technical project.

It is an attempt to make AI:

Inclusive
Accessible
Efficient

Because the true power of AI lies not in how advanced it is, but in who it empowers.

A better world is one where everyone can communicate, participate, and be heard.

Thank You for reading.

Downloads
	All Codes Mentioned Above	Download