Ajay
Kannan.
I build production data platforms, ML pipelines, and generative AI systems — where reliability matters as much as intelligence.

In one paragraph
Senior software engineer with a strong data-engineering and machine-learning background. Delivered a production data platform for a major utility — boosting processing throughput by 20% and cutting system errors by 15% through automated pipelines and monitoring. Built image-analysis models hitting 94% accuracy on medical data, with deep experience in Python, PyTorch, and scalable backend services. Looking to apply this blend of data-platform and AI work to drive reliable, high-performance solutions.
The short version
Born in Chennai. Educated at Anna University and Arizona State. Now building in the Bay Area.
The thread that connects IoT sensors, gesture recognition, medical imaging, and utility-scale data platforms is the same one — curiosity about systems that have to work in the real world.
I grew up in Chennai, where I first fell in love with building things — wiring Arduino boards, training accelerometers to read sign language gestures, and leading an IoT research group before I had a degree to show for it.
That curiosity carried me to two summers at Microsoft, where I shipped sensor-driven ML for agriculture and helped integrate Cortana with a shopping platform — my first taste of building at scale, and of the gap between a model that works in a notebook and one that works in production.
I crossed continents for grad school at Arizona State, researching few-shot learning for immunology — a problem where data is scarce and the stakes are real. That tension between limited signal and high consequence shaped how I think about ML systems today.
Now at Pacific Gas and Electric in the Bay Area, I design the data platform and ML infrastructure behind utility operations — pipelines, anomaly detection, observability, and increasingly, generative AI that has to earn its place in production. The work is less glamorous than a chatbot demo, but reliability is the product.
The journey from Chennai to Santa Clara was not a straight line. It was an odyssey — shaped by curiosity, cross-cultural leaps, and the stubborn belief that the most interesting problems live where engineering meets the real world.
Papers & posters
Few-Shot Learning for TCR–Epitope Binding Affinity Prediction
Developed contrastive loss for few-shot learning in TCR–Epitope affinity prediction (PyTorch), improving accuracy by 30% over baselines via novel architectural and loss-function design. Implemented a Siamese network to evaluate efficiency across K-values and validate model performance over varying data distributions.
Where I've built
Pacific Gas & Electric
- Designed and deployed a production data platform (Electric Productivity Tracker) for large-scale utility datasets — improved processing throughput ~20% and reduced system errors ~15% through backend automation, monitoring, and reliability engineering across distributed systems.
- Engineered scalable, fault-tolerant data pipelines integrating heterogeneous sources (GIS, PostgreSQL, internal services) with consistent low-latency access and continuous stream processing.
- Built automated backend validation and testing infrastructure using Python services and REST interfaces — cut deployment verification time by 50% via comprehensive unit, integration, and end-to-end suites.
- Implemented end-to-end ETL, validation, and transformation workflows across distributed systems, improving data integrity and reducing manual ops overhead.
- Automated extraction and processing of the Distribution Operations Toolset (DOT) — a daily-updated, macro-enabled Excel system containing nested map-creation data — using Python and JavaScript to filter new maps, extract the relevant fields, and feed them into the map-creation pipeline, eliminating manual data entry and reducing processing errors.
- Unified three separate map-creation processes into a single streamlined workflow — consolidating fragmented operations and improving overall efficiency, consistency, and maintainability across the team.
- Applied ML and generative AI in production to improve anomaly detection reliability — reduced false positives ~30% via model-assisted signal validation, statistical inference, and system-level monitoring.
- Shipped observability and alerting (health checks, dashboards, centralized logging) across servers and databases — cut issue detection and resolution time by ~50%.
- Collaborated with cross-functional teams in design and code reviews, and produced technical documentation — architecture diagrams, API specifications, and runbooks — in Confluence, accelerating feature rollout and reducing onboarding time.
Sigtuple Inc.
- Applied image processing techniques to biological data using PyTorch, implementing Siamese Networks, GANs, and Neural Style Transfer — enabling more accurate medical image analysis of cellular structures.
- Used alpha blending to recreate histology slide images, accurately representing biological particles and membranes.
- Achieved 94% accuracy in image blending efficiency through innovative generative architectures.
- Built production software in Python with object-oriented design and comprehensive unit and integration tests, resulting in a stable system that reduced deployment errors.
Microsoft
- Utilised ML algorithms for the FarmBeats agricultural platform using Python and PyTorch — identified sensor-data patterns to enhance crop-yield predictions.
- Built a GIS-based heat-map generation system for FarmBeats — real-time spatial visualization of multi-sensor data (soil moisture, temperature, humidity).
- Contributed to the Shopping on Cortana project, integrating Cortana with a shopping platform to improve user experience.
- Developed backend services with Cassandra DB and C# as part of the Foundry Team in MS-IDC, building RESTful APIs for distributed data storage.
Everything on GitHub
A complete index of my public repositories — from production-style LLM systems to graduate coursework and the occasional weekend experiment. Most recent first.
View on GitHub ↗Tools of the trade
Programming Languages
- Python (Expert)
- C++
- Java
- C#
- JavaScript
- SQL
- Bash
ML / DL Frameworks
- PyTorch
- TensorFlow
- Hugging Face
- Keras
- NumPy
- Scikit-learn
- Pandas
LLM & AI Tools
- LangChain
- LLM Integration
- Prompt Engineering
- RAG Pipelines
Backend Development
- REST APIs
- FastAPI
- Node.js
- Express.js
- Spring Boot
- Microservices
- GraphQL
Databases
- PostgreSQL
- MongoDB
- MySQL
- Cassandra
- NoSQL
Cloud & DevOps
- AWS (EC2, S3, Lambda)
- Azure
- Docker
- Kubernetes
- Git
- CI/CD
Computer Science
- Data Structures
- Algorithms
- System Design
- Distributed Systems
- OOP
JAX · AWS Bedrock · Vector Databases (Pinecone) · CUDA Programming
Where I studied
Arizona State University, Tempe
Anna University, Chennai
Let's
build
something.
Open to conversations about ML infrastructure, generative AI in production, or anything in the neighborhood.