Surjyadip Sen | Field Notes
FILE: OP_VISISTATUS: DEVELOPMENT

Vision Orator

STAMP: 2025.07.31

ARCHIVE_IMAGE

PLATE 01: SYSTEM ARCHITECTURE OVERVIEW // VISION ORATOR

Empowering inclusivity through real-time environmental awareness. Uses on-device local inference to identify objects and gestures, converting visual information into audible text-to-speech output.

Designed with privacy-first principles, all processing occurs locally on the browser using MediaPipe and TensorFlow.js, eliminating the need for internet connectivity and ensuring immediate response times for safety-critical navigation.

TECHNICAL RESOLUTION

Successfully integrated MediaPipe for landmark detection (pose and hand), converting visual input into semantic points in space to protect user privacy. Implemented a natural-sounding Text-to-Speech engine and a simplified UI/UX optimized for accessibility standards.

# RELATED OPERATIONS

OP_CRYP // ARCHIVED

CryptoDataAggregator

A high-performance cryptocurrency data ingestion and technical analysis pipeline built on TimescaleDB and Celery.

OPEN FOLDER
OP_EDGE // ARCHIVED

EdgeAI Hand Gesture Classifier

Autonomous, self-contained gesture recognition system using K-Nearest Neighbors (KNN) with SIMD hardware acceleration.

OPEN FOLDER