Aditya Prasath

Education
M.S. Computer Science @ University of Illinois Urbana-Champaign
Current role
AI Engineering Intern @ StitchStudio
Technology focus
Distributed systems · GPU compute · AI infrastructure · production LLMs

I build backends that survive real traffic, CUDA stacks that earn their speedup, and LLM systems that ship in production—not demos.

01

Experience

Total Industry Experience (3+ Years)

Production software across telecom, AI platforms, and chartered accountancy workflow automation.

AI Engineering Intern

StitchStudio · Chicago, IL · Remote

Client: Internal Project

  • Building LangChain agents on Llama 3 with prompt-routing and state-machine orchestration.
  • RAG with FAISS—sub-100ms retrieval over million-scale embeddings.
LLMsLangChainRAGFAISS

Software Engineer

Cognizant Technology Solutions · Tamil Nadu, India

Client: Verizon

  • Owned microservices at 8M+ requests/day; 99.95% availability with circuit breakers and autoscaling.
  • Cut end-to-end latency 42% with async Kafka and SQL plan optimization.
  • Reduced incidents 35% via Prometheus/Grafana; raised test coverage 25%.
KafkaMicroservicesPrometheusPostgreSQL

Software Analyst Intern

BSP & CO. · Tamil Nadu, India

Client: Internal Project

  • Owned automation initiatives for manual practitioner workflows at a chartered accountancy firm—scoped bottlenecks with partners and shipped pipeline tooling end to end.
  • Built workflow automations that replaced repetitive manual steps in client reporting and compliance prep, improving turnaround time and consistency.
  • Documented process maps and handoff runbooks so firm staff could operate and extend automations without engineering support.
Workflow AutomationPythonProcess Design

Backend Platform Intern

SMZ & CO. · Kuala Lumpur · Remote

Client: Internal Project

  • Owned backend automation for manual task pipelines at a CA firm—translating spreadsheet-driven audit and reporting work into production APIs and data flows.
  • Designed REST services and PostgreSQL schemas for 100K+ records/month; reduced reporting latency from minutes to sub-second queries.
  • Worked directly with practitioners to map firm workflows, then delivered integrations that cut repetitive manual effort across deliverables.
PostgreSQLRESTWorkflow AutomationAPI Design
02

Selected work

Hands-on projects across inference, GPU kernels, agents, and cloud-native platforms.

01CUDA · C++ · Nsight Compute

GPT-2 Inference Engine

GPT-2 forward pass from scratch: FlashAttention-2, KV-cache, and memory tiling profiled on NVIDIA A40.

  • CUDA
  • FlashAttention
  • KV-Cache
  • Nsight
02GPU library

ThinkerCUDA

3D convolution and tiled matmul kernels—6× throughput over CPU through coalescing and occupancy tuning.

  • CUDA
  • C++
  • HPC
03LangChain · RAG

Audit Orchestrator

Agentic audit workflows with BMAD-METHOD; routes prompts by task complexity for throughput and accuracy.

  • LangChain
  • RAG
  • Agents
04AWS · Kubernetes

Auto-Scaling Platform

EKS microservices with ALB autoscaling; blue-green and canary releases for zero-downtime deploys.

  • AWS
  • EKS
  • Kubernetes
  • Docker
03

Research

Peer-reviewed edge AI and in-progress work on faster, leaner LLM agents.

UIUC CS 598 · In progress

PACT: Pruned Agent Call Throughput

Speculator model (Phi-3-mini) trims LLM over-deliberation; DPO alignment balances accuracy and latency.

LLM AgentsDPOSpeculative Decoding

Conference paper

Intelligent Surveillance over 5G Edge

Optimized real-time inference across edge and cloud—latency and bandwidth under 5G constraints.

Edge AI5GInference
04

Technical range

Languages and platforms I reach for when performance and reliability both matter.

01

Languages

Daily drivers

  • Python
  • C++
  • Java
  • JavaScript
  • Bash
  • SQL
02

Systems

Scale & reliability

  • High-throughput services
  • CUDA / GPU kernels
  • Kafka
  • Query optimization
  • Prometheus · Grafana
03

Cloud

Deploy & operate

  • AWS
  • Kubernetes
  • Docker
  • Microservices
  • REST
04

AI & data

Models & pipelines

  • Llama 3 · DeepSeek
  • LangChain
  • RAG · FAISS
  • GPT-2
  • Agentic Engineering
  • Spark
  • HDFS
  • ETL
05

Education

In Progress

M.S. Computer Science

University of Illinois Urbana-Champaign

Aug 2025 – Present

Parallel programming · Systems for GenAI · Applied ML · Cloud · LLMs

3.97GPA

B.Tech, CS & Business Systems

SRM Institute of Science and Technology

Jun 2019 – Jul 2023

DSA · Compilers · DBMS · Networks · Automata

06

Awards

Dean's recognition for students who lead at scale—across academics, sport, and campus life.

Dean's Award · Undergraduate cohort 2019–2023

Outstanding Contribution Award

SRM Institute of Science and Technology · Jun 2019 – May 2023

Core strengthLeadership skills

Awarded for sustained campus impact—scaling student operations, mentoring peers, and connecting technical communities with university-wide programs.

Impact

  • Led operations for 1,500+ technical and cultural events
  • First-class distinction across undergraduate academics
  • State-level clusters representee, badminton

Leadership roles

  • Head of Operations, White Hat Hackers Club
  • Discipline & Logistics Head, Association of Computer Science Engineers
  • Student Mentor & Director, Rotaract Club of SRM Vadapalani
07

Get in touch

Open to full-time roles in systems, AI infrastructure, autonomy, and GPU computing.

+1 217-249-4900 · Champaign, IL