Back to Projects
01Completed 2025

Multimodal Sentiment Analysis

Built a production-grade AI system from scratch that quantifies emotions from short video content using multimodal analysis.

Multimodal Sentiment Analysis

Overview

An end-to-end sentiment analysis platform that processes video, audio, and text simultaneously. The system breaks videos into utterances, analyzes each with deep learning models, and provides granular emotional insights through a modern dashboard.

Key Features

  • Multimodal AI: Analyzes Video, Audio, and Text simultaneously for higher accuracy
  • Granular Analysis: Breaks down videos into specific utterances (sentences) and analyzes each one
  • 7 Emotion Classes: Detects Anger, Disgust, Fear, Joy, Neutral, Sadness, and Surprise
  • Sentiment Detection: Classifies content as Positive, Neutral, or Negative
  • Developer API: Provides a secure API with quota management for developers
  • Modern Dashboard: A clean Next.js interface to upload videos and view detailed results

Impact & Metrics

Trained on 10k samples

Technologies

FRONTEND
Next.js (App Router)Tailwind CSSNextAuth.jsTypeScript
BACKEND
PyTorchOpenAI WhisperAWS SageMakerAWS S3
AI / ML
Multimodal Deep Learning
PreviousViral PostNext Viral Post