ML / AI Engineer ยท Computer Vision ยท Published Researcher ยท Original Dataset Author
I build ML/DL systems from scratch โ hand-coded backpropagation, diffusion samplers, camera calibration via DLT+SVD, and biometric evaluation pipelines. ~5 years of experience across computer vision, generative modeling, NLP, and production software engineering.
Languages: Python SQL MATLAB R C JavaScript
DL / ML: PyTorch TensorFlow scikit-learn ยท CNNs ยท LSTMs ยท Transformers ยท Diffusion (DDIM) ยท GANs ยท Autoencoders
Computer Vision: OpenCV ยท YOLO ยท Face Recognition ยท Biometrics (PAD) ยท Medical Imaging
Data & Infra: pandas NumPy Docker Git Linux Supabase/Postgres REST APIs CUDA HuggingFace MLflow
Cloud: AWS Solutions Architect (Associate) ยท AWS Cloud Practitioner ยท AWS AI Practitioner
These are the highlights โ all 23 repos are listed below
| Project | What I Built | Key Result | |
|---|---|---|---|
| ๐จ | DDIM Inpainting | From-scratch UNet + DDIM diffusion with v-prediction, self-conditioning, EMA, CoordConv | PSNR 14.41 on MNIST |
| ๐ | Face PAD (Anti-Spoofing) | End-to-end video-level face attack detection. Subject-disjoint protocol, EER/ACER thresholds | ACER 0.029, Acc 98.1% |
| ๐ | Blackberry-Lime Classifier | Original 6K-image dataset (published on Kaggle) + dual ResNet-18 classifier โ Live App | 100% / 99% accuracy |
| ๐ง | Transformer from Scratch | Encoder + Decoder with sinusoidal positional encoding, multi-head attention on Yelp Polarity | 79% test, ROC-AUC |
| ๐ | LSTM Stock + VaR | Next-day forecasting with Value-at-Risk quantification on Dropbox stock | MAPE 1.51%, Rยฒ 0.908 |
| ๐งฎ | From-Scratch MLP | Hand-coded forward pass + backpropagation + L2 reg, no autograd. MNIST + CelebA faces | 95% / 85% test |
| ๐น | YOLOv3 Traffic Detection | Real-time detection on traffic video with NMS, custom IoU, FPS profiling | Real-time on 3 videos |
| ๐ท | Camera Calibration | DLT + SVD camera calibration from scratch โ intrinsic & extrinsic decomposition | Pure NumPy math |
"Cardiovascular Disease Prediction using ML" โ IJSREM, Vol. 07, Issue 08, Aug 2023
Co-authored ยท Cleveland Heart Disease DB ยท LogReg vs Random Forest ยท Flask web app
๐ View Paper
๐ง Deep Learning & Generative Models (click to expand)
- ddim-mnist-inpainting โ DDIM diffusion, UNet, v-prediction, self-conditioning
- transformer-from-scratch-seq โ Transformer encoder+decoder on Yelp Polarity
- lstm-sentiment-analysis โ LSTM vs GloVe+BiGRU on IMDB 50K (81.8%)
- anomaly-detection-autoencoders โ Dense AE on NAB CloudWatch data
- lstm-stock-forecasting-with-var โ LSTM + Value-at-Risk on stock data
- covid-mlp โ MLP on CDC COVID data (87.58%)
๐๏ธ Computer Vision & Biometrics (click to expand)
- face-pad-msu-mfsd โ Video-level face anti-spoofing (PAD)
- faces-can-lie-identity-analysis โ ArcFace embeddings + clustering
- traffic-object-detection-yolov3 โ Real-time YOLOv3 detection
- camera-geometry-calibration โ DLT camera calibration from scratch
- rgb-channel-alignment โ ORB + RANSAC homography alignment
- blackberry-lime-visual-classifier โ Original dataset + deployed app
- octmnist-retinal-disease-cnn โ Medical imaging CNN (91.3%)
- vgg16-vs-resnet18-64x64 โ From-scratch VGG vs ResNet
- cnn-multiclass-28x28 โ 36-class CNN, 3-optimizer comparison
๐ Classical ML (From Scratch โ No sklearn) (click to expand)
- mnist-logreg-svm โ Hand-coded LogReg + Softmax + SVM sweep (98.32%)
- lda-qda-ridge-regression โ LDA/QDA/Ridge all from scratch
- handwritten-digits-and-faces-mlp โ MLP with manual backprop
- kmeans-farthest-first-init โ Farthest-first K-Means (3.4ร speedup)
- adult-census-income-prediction โ NB/DT/RF/XGBoost pipeline
๐ฌ Research & Numerical Methods (click to expand)
- nanoglass-md-preprocessor โ MATLAB + LAMMPS for Cu-Zr nanoglass MD (B.Tech thesis)
- predictive-health-assessment โ K-Means health risk stratification (R, 100K records)
- Mini-Projects โ Gauss-Newton, Monte Carlo/LHS, Newton-Raphson, ARIMA, time-series
Master of Science in Engineering Science (Data Science) โ University at Buffalo (SUNY), 2024โ2025
Bachelor of Tech โ IIT Bhubaneswar, 2019โ2023
Building models that see, think, and act intelligently.