I train Deep Neural Networks from the ground up - designing neural network architectures, writing training loops, and building sovereign foundation models from nothing. I don't prefer to fine-tune on top of existing LLMs - I create the recipe from the ground up and train the foundation model with diversified datasets. My work spans AI research - Dense decoders, Mixture of Experts, and Encoder architectures - covering the full lifecycle from pre-training on raw text to post-training with SFT, RLHF, and Chain-of-Thought. I build GenAI products for both research and commercial use. I spend my hours with GPUs, turning compute into intelligence. I write to break down and simplify AI/ML and LLM concepts for the world - making the complex accessible. I also train students and provide technical consultations to industry leaders.
By profession, I am a Senior Technical Account Manager at Amazon - helping top-tier AWS customers architect, operate, and scale in the cloud and AI.
These are the projects I have built from ground up - each designed to solve a real-world problem. During these projects, I have trained multiple foundation models spanning Dense decoders, Mixture of Experts, and Encoder architectures.
In 2026, I am researching on - Deep Neural Networks and hidden layers, classifier-gated routing for cost optimization and performance, sovereign language models for Indian languages (Sanskrit, Hindi, Tamil, Bengali, Kannada), purpose-built small and tiny models for efficiency, data residency and compliance by design, and the intersection of compliance and AI architecture.
I hold a BSc in Physics (Honours) with Mathemetics, graduated with with first class grade - building the mathematical foundation that now drives my neural network research. I earned a Bachelor of Engineering degree in Computer Science with 8.4 CGPA in 2011, giving me the systems thinking to architect and train models at scale. I am a proud alumnus of the Indian Institute of Management (IIM) Calcutta - one of Asia's premier business schools. Currently pursuing a Master of Arts in Philosophy, exploring the deeper questions of intelligence, cognition, and what it means for machines to "learn."
I operate at the intersection of two roles that complement each other. As a Senior Technical Account Manager at Amazon, I partner with C-Suite leaders to shape their cloud and AI strategy, drive informed decisions, and create opportunities that accelerate ROI. As an independent AI/ML researcher, I build foundation models from scratch, Dense decoders, Mixture of Experts, Encoders, training them on raw text and turning compute into intelligence. The next pages cover my consulting, leadership, and sales journey.