Abhishek Dey
I train Deep Neural Nets from Scratch
Independent AI/ML Researcher

I train Deep Neural Networks from the ground up - designing neural network architectures, writing training loops, and building sovereign foundation models from nothing. I don't prefer to fine-tune on top of existing LLMs - I create the recipe from the ground up and train the foundation model with diversified datasets. My work spans AI research - Dense decoders, Mixture of Experts, and Encoder architectures - covering the full lifecycle from pre-training on raw text to post-training with SFT, RLHF, and Chain-of-Thought. I build GenAI products for both research and commercial use. I spend my hours with GPUs, turning compute into intelligence. I write to break down and simplify AI/ML and LLM concepts for the world - making the complex accessible. I also train students and provide technical consultations to industry leaders.

By profession, I am a Senior Technical Account Manager at Amazon - helping top-tier AWS customers architect, operate, and scale in the cloud and AI.

Pet Projects - Research and Use

These are the projects I have built from ground up - each designed to solve a real-world problem. During these projects, I have trained multiple foundation models spanning Dense decoders, Mixture of Experts, and Encoder architectures.

Featured Writing

Research

In 2026, I am researching on - Deep Neural Networks and hidden layers, classifier-gated routing for cost optimization and performance, sovereign language models for Indian languages (Sanskrit, Hindi, Tamil, Bengali, Kannada), purpose-built small and tiny models for efficiency, data residency and compliance by design, and the intersection of compliance and AI architecture.

Videos

Neural Networks Explained
Building GPT from Scratch
Attention Mechanism
Backpropagation
Transformers Explained
Word Embeddings
Tokenization
LLM Training

Papers & Publications

Automating AWS Application Load Balancer Capacity Unit Reservation [AWS - 2023]
Automating ALB capacity planning with reserved capacity units for predictable workloads on AWS.
Visualize AWS Network Firewall Logs with Amazon QuickSight Dashboards [AWS - 2022]
Building real-time security visibility by visualizing Network Firewall logs through QuickSight dashboards.
How to Integrate Amazon CloudWatch Alarms with Atlassian Confluence [AWS - 2021]
Bridging monitoring and documentation by connecting CloudWatch alarms to Confluence knowledge articles.

Academic

I hold a BSc in Physics (Honours) with Mathemetics, graduated with with first class grade - building the mathematical foundation that now drives my neural network research. I earned a Bachelor of Engineering degree in Computer Science with 8.4 CGPA in 2011, giving me the systems thinking to architect and train models at scale. I am a proud alumnus of the Indian Institute of Management (IIM) Calcutta - one of Asia's premier business schools. Currently pursuing a Master of Arts in Philosophy, exploring the deeper questions of intelligence, cognition, and what it means for machines to "learn."

Work Experience

Amazon Web Services
Tech Account Manager (Cloud & AI) · 2020 - Now (2026)
Helping top-tier AWS customers architect, operate, and scale in the cloud and AI. Driving operational excellence for enterprise accounts across infrastructure, security, and cost optimization.
Deloitte
Data and AI Consultant · 2018 - 2020
Delivered data and AI consulting engagements for enterprise clients. Designed cloud-native analytics solutions and guided digital transformation initiatives.
Larsen & Toubro Infotech
Lead Networks · 2017 - 2018
Led network infrastructure design and operations for enterprise clients. Managed large-scale deployments across routing, switching, and security domains.
Accenture
Sr. Analyst · 2015 - 2017
Delivered technology consulting and infrastructure management services for global enterprise accounts.
HCL Technology
Specialist · 2012 - 2015
Managed enterprise network operations and infrastructure support for global clients across multiple geographies.

Featured Talks

AWS re:Invent 2025
Las Vegas · Dec 2025
Spoke on cost-efficient LLM routing for regulated industries
AWS Summit Mumbai
Mumbai · May 2025
Panel on sovereign AI models for Indian enterprises
GenAI Meetup Bangalore
Bangalore · Mar 2025
Workshop on training transformers from scratch
IIM Calcutta Tech Talk
Kolkata · Jan 2025
Guest lecture on neural network architectures and industry applications
DevOps India Summit
Delhi · Nov 2024
Deploying ML models at scale with ECS and CloudFormation
Cloud Native Conf
Hyderabad · Sep 2024
Building compliance-first AI architectures for FSI
PyData Mumbai
Mumbai · Jul 2024
From networks to neural networks - a practitioner's journey
My Talk

I operate at the intersection of two roles that complement each other. As a Senior Technical Account Manager at Amazon, I partner with C-Suite leaders to shape their cloud and AI strategy, drive informed decisions, and create opportunities that accelerate ROI. As an independent AI/ML researcher, I build foundation models from scratch, Dense decoders, Mixture of Experts, Encoders, training them on raw text and turning compute into intelligence. The next pages cover my consulting, leadership, and sales journey.

1 2 3