AI & Computer Vision

Mini Projects
in Artificial Intelligence

A selection of personal explorations at the intersection of deep learning, computer vision, and generative models.

🎯 MediaPipe 🎨 CycleGAN 🔍 Object Detection 🖼️ Segmentation PyTorch · TensorFlow
01
Pose Estimation

Real-Time Body Tracking with MediaPipe

Implementation of a real-time human pose estimation pipeline using Google's MediaPipe framework, detecting 33 body landmarks at high frame rates on standard webcam input.

33 keypoints tracked in real-time via webcam feed
Custom gesture recognition logic built on top of landmarks
Angle computation between joints for motion analysis
MediaPipe Python OpenCV Pose Estimation
CycleGAN Van Gogh style transfer result
02
Generative Models

Image-to-Image Translation with CycleGAN

Exploration of unpaired image-to-image translation using CycleGAN architecture. The model learns bidirectional mappings between two visual domains without paired training data.

Horse ↔ Zebra translation trained from scratch
Cycle-consistency loss to enforce coherent mappings
Extended to medical image style transfer experiments
CycleGAN PyTorch GAN Style Transfer
Person 97%
Car 89%
Dog 74%
Person
Car
Dog
03
Object Detection & Segmentation

Detection & Semantic Segmentation Pipeline

Implementation and benchmarking of object detection models (YOLO, Faster R-CNN) combined with semantic segmentation approaches to understand scenes at pixel-level granularity.

YOLO v8 fine-tuned on custom dataset
Semantic segmentation with DeepLabV3+
mAP & IoU metrics tracked across experiments
YOLO Faster R-CNN Segmentation PyTorch DeepLabV3+