Hemanth Reddy | New York, NY | MS CS @ NYU (May 2026)
New-grad software engineer focused on scalable backend systems and GenAI.
2+ years at Paytm building high-scale services, plus a 2025 Echostar internship delivering AWS pipelines, 3D visualization, and GenAI tooling. I love shipping fast, reliable products.
Education
MS CS, NYU
Experience
2+ years
Focus
Backend + Cloud
Profile snapshot
New Grad SWE
MS CS @ NYU (3.97/4). Former Paytm SDE with internship experience at Echostar. See experience below for details.
Looking for
New Grad SWE
Strengths
Backend + GenAI
// Experience
Industry experience across backend, cloud, and GenAI.

Software Systems Engineering Intern
- Architected an event-driven, SFTP-based drone data ingestion pipeline on AWS (Transfer Family, S3, Lambda, EventBridge, SQS) with 99.9% reliability and 97% cost reduction ($7.8M/yr).
- Automated 2D-to-3D photogrammetry workflows and built a Cesium 3D Tiles viewer with 3D site measurements using React and Python.
- Productionized with CloudFormation, CloudWatch, and least-privilege IAM for repeatable, secure deployments.

Software Development Engineer
- Led ONDC order services from launch to 1M+ orders, 5K+ merchants, 50K+ warehouses using Spring Boot/Node.js microservices with Kafka/RabbitMQ and MySQL/MongoDB.
- Cut cart errors by 75% and API latency by 40% via Redis caching, Kafka-based async workflows, SQL optimizations, and clearer error reporting.
- Built merchant/catalog rejection analytics (Hive + Django Admin) to surface seller-facing reasons, reducing seller NP JIRAs by 60%.
- Enabled zero-downtime deploys for 10+ microservices with Docker, Kubernetes (EKS), Spring Cloud Config; migrated EC2 to Graviton for 20% cost savings and strengthened observability with Prometheus/Grafana/Kibana (false alerts -40%).
// Projects
Projects that show how I build.
CliniPulse AI: On-Prem Cloud Medical Intelligence RAG
HIPAA-compliant platform with self-hosted LLaMA-3 (70B) on H100 and ChromaDB, cutting report generation from 30 min to under 2 min, on-prem S3 (AES-256).

LLM Fine-Tuning Factory
Fine-tuned 5 domain models (HR, Finance, Healthcare, Marketing, Sales) with Full/DPO/LoRA/QLoRA on H100, serving via OpenAI-compatible FastAPI + Open WebUI for a 70% cost reduction.

A2A Travel Orchestration
Agent-to-agent itinerary builder (LangChain, CrewAI, AutoGen) with concurrent debate and visualizations, 40% faster planning, deployed on Hugging Face Spaces.

Subscription Tracker MCP Server
MCP server with 16 tools integrating Gmail + MySQL, Claude extracts subscriptions with 90%+ accuracy and triggers 3-day renewal alerts to cut monthly costs by 25%.
Applicant Tracking & Recruiting Portal (ATS)
Scalable ATS for 500+ candidate profiles where automated pipeline tracking reduced recruiter screening time by 40% with optimized PostgreSQL queries.

GenAI Learning Assistant
RAG tutor with high relevance accuracy and sub-200ms responses using LangGraph + FastAPI, focused on quiz/document support.

Social Media Application
Full-stack platform supporting 1,000+ daily interactions with Redis caching and OAuth + JWT auth.

GridSense (NSF Research)
Power-grid telemetry pipeline with sub-500ms latency and D3 dashboards processing 75k+ records/min with improved outage prediction accuracy.

// Skills
A toolkit built for speed and clarity.
Languages
ToolkitBackend & Fullstack
ToolkitCloud (AWS)
ToolkitData & Messaging
ToolkitDevOps & Observability
ToolkitAI / ML
Toolkit// Education
Academic foundation with a strong CS focus.
May 2026
Master of Science, Computer Science
New York University, New York, NY
Jun 2022
Bachelor of Technology, Computer Science
IIT Dharwad, Dharwad, India