ML/AI • Cloud • DevOps

Hi, I am Siddharth.

From shipping scalable AI systems, cloud‑native infra, and seriously fast CI/CD, I build production‑grade ML/AI applications and AWS‑first architectures. Strong in observability, automation, and cost‑efficient design.

~10 tok/sec
LLM inference on AWS Lambda (C binaries / llama.cpp)
45+ models
Unified routing & access via Python lib
15× faster
28 Docker images: 30m → 2m via parallel builds

Recent Work

ML Ops & AI Development Engineer
Enterprise Technology — Tempe, AZ
  • LLM & AI Application Development:
    • Engineered a Python library facilitating access to over 45 Large Language Models (LLMs), with routing across multi-modal pipelines using AWS Lambda and API Gateway.
    • Deployed an LLM inference solution on AWS Lambda using llama.cpp C binaries, achieving 10 tokens/sec.
    • Built a multi-threaded Python app for Raspberry Pi integrating text-to-speech, speech-to-text, facial recognition, and a local LLM.
    • Fine-tuned LLMs and created proxy servers for a Model-as-a-Service framework to host and route open-source models.
    • Implemented rate-limiting and caching with Lua scripting on ElastiCache Redis using sliding windows.
  • Cloud Infrastructure & DevOps:
    • Provisioned scalable, multi-stage AWS infrastructure using Terraform.
    • Architected CI/CD pipelines with parallel Docker builds on Jenkins + Kubernetes, cutting build time for 28 images from 30m to 2m.
    • Designed microservices using AWS Lambda, DynamoDB, S3, API Gateway; integrated OAuth2 with Google Drive.
    • Implemented observability via SNS alerts; reduced downtime and improved traceability.
    • Automated secure cross-cloud provisioning across AWS and GCP.
    • Integrated Redshift and DynamoDB using zero‑ETL to build analytics-ready datamarts.
    • Developed Firecrawl, a caching layer for web crawling to reduce external API usage and cost.
Graduate Student Researcher — Thesis
Bio‑Inspired Robotics, Technology & Healthcare Lab — Tempe, AZ
  • Automated 180 friction experiments via custom 3‑axis rig & 6‑axis load cell (−40% setup time).
  • ROS2 PID control for UR‑16e robotic arm enabling adaptive load‑carrying tasks.
  • ROS‑based SpaceMouse controller for real‑time manipulation.

Selected Projects

Papertrail — Multi‑Modal OCR Document Organizer

Python • OCR • AWS • OAuth

Document organizer with fine‑grained access control & sharing. Multi‑modal OCR endpoint for accurate file→text conversion. Google OAuth2 (Drive & Picker) integration.

FastAPITesseract OCRLambdaAPI Gateway

Audio Projection Embeddings for RAG

PyTorch • CLAP • RAG

Projection model mapping 512→1536 vectors using CLAP on LibriSpeech to enable audio‑query RAG without transcription.

PyTorchEmbeddingsRAG

Dexterous Manipulation with a Robotic Hand

RL • Actor‑Critic • Python

Advantage‑Weighted Actor‑Critic implementation improved 6‑DoF manipulation success by ~20%.

AWACGymLinux

Multi‑Robot Search & Rescue

ROS2 • RTAB • OpenCV

Decentralized quadcopter swarm with Potential‑Field & Frontier Exploration for dynamic 3D mapping; validated 100×100 grid mapping in Gazebo while avoiding local minima.

ROS2GazeboRTAB

Fraud Detection — Statistical Analysis

Deep Learning • RNN • SMOTE

End‑to‑end pipeline with feature importance & statistical validation; ~97.2% accuracy.

TensorFlow/PyTorchOne‑HotImbalance

UAV Line‑Follower (Parrot Mambo)

Simulink • Edge Detection

Edge‑based HSV tracking with ~20 ms inference; ~95% accuracy across 40 tests; deployed via Simulink.

MATLABSimulinkHSV

Tech I use

Languages

Python, Embedded C/C++, SQL, Bash, Terraform, Groovy

Software & Platforms

Docker, ROS2, Jenkins, Git, Kubernetes

Frameworks

PyTorch, FastAPI, OpenCV, Tesseract OCR, llama.cpp

AWS (Core)

Lambda, API Gateway, S3, SQS, EC2, VPC, DynamoDB

AWS (Data)

Redshift, Glue, ElastiCache

Focus Areas

Model ServingMicroservicesObservability CI/CDCost Efficiency

Education

Arizona State University

Tempe, AZ —

M.S., Robotics & Autonomous Systems (Thesis)

Focus: Reinforcement Learning, Deep Learning, Multi‑Robot Systems, Optimal Control

D. J. Sanghvi College of Engineering

Mumbai, IN —

B.E., Mechanical

Leadership & Community

DJS KRONOS INDIA — Vice Captain

Mumbai, IN —
  • Led software & co‑led mechanical design for a 4WD ATV in Simulink; +17% efficiency; 2nd Best 4WD design award.
  • Built GSM SIM900 DAQ on Raspberry Pi Zero with ThingSpeak telemetry.

Hackathons — Evaluator & Mentor

Tempe, AZ —

Evaluated AI Innovation Hackathons & Spark Tank; mentored teams integrating AI capabilities.

AI Acceleration — Colleague

Tempe, AZ —
  • Approved architecture plans; guided implementation & code quality across APIs.
  • Led sprint planning & alignment between engineering and product.

Contact