Research & Technical Projects

Reinforcement Learning: Cart-Pole Balancing

Built RL system from scratch (Python/NumPy) using Monte Carlo Policy Gradient. Implemented custom physics sim, neural network, and training loop. Agent learned to balance inverted pole through pure trial and error.

Tech: Python, NumPy | Result: Stable convergence without frameworks

Real-Time Dynamic Hand Gesture Recognition

Built production-viable gesture recognition pipeline using TCN for temporal modeling. Achieved ~90% accuracy on dynamic gestures with real-time latency. Integrated MediaPipe for RGB input.

Tech: PyTorch, MediaPipe, NVIDIA TAO | Demo Video

WaveGAN: Generative Audio Synthesis

Implemented WaveGAN architecture in TensorFlow for raw audio waveform generation. Adapted DCGAN-style blocks for 1D time-domain signals.

Tech: TensorFlow, NumPy, Librosa

Age & Gender Estimation from CCTV

Solved Bosch challenge: demographic estimation from low-res CCTV footage. Simplified pipeline (removed face detection/super-resolution) achieved ~82% gender accuracy, ~66% age accuracy.

Tech: YOLOv5, ByteTrack | Demo

Multi-Agent RL & Game Theory

Mentored research project studying cooperation emergence in multi-agent environments. Implemented Q-learning agents interacting with fixed-strategy agents using evolutionary game theory principles.