01Completed 2025
Multimodal Sentiment Analysis
Built a production-grade AI system from scratch that quantifies emotions from short video content using multimodal analysis.
Overview
An end-to-end sentiment analysis platform that processes video, audio, and text simultaneously. The system breaks videos into utterances, analyzes each with deep learning models, and provides granular emotional insights through a modern dashboard.
Key Features
- Multimodal AI: Analyzes Video, Audio, and Text simultaneously for higher accuracy
- Granular Analysis: Breaks down videos into specific utterances (sentences) and analyzes each one
- 7 Emotion Classes: Detects Anger, Disgust, Fear, Joy, Neutral, Sadness, and Surprise
- Sentiment Detection: Classifies content as Positive, Neutral, or Negative
- Developer API: Provides a secure API with quota management for developers
- Modern Dashboard: A clean Next.js interface to upload videos and view detailed results
Impact & Metrics
Trained on 10k samples
Technologies
FRONTEND
Next.js (App Router)Tailwind CSSNextAuth.jsTypeScript
BACKEND
PyTorchOpenAI WhisperAWS SageMakerAWS S3
AI / ML
Multimodal Deep Learning