Skip to content

saivarunk/saivarunk

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

7 Commits
ย 
ย 

Repository files navigation

Hi ๐Ÿ‘‹, I'm Varun Kruthiventi

Staff Engineer ยท AI/ML Platform ยท LLM Infrastructure
Hyderabad, India ๐Ÿ‡ฎ๐Ÿ‡ณ

Building scalable, reliable, and cost-aware Generative AI systems at production scale

Website โ€ข LinkedIn


๐Ÿš€ About Me

Iโ€™m a Staff Engineer with 9.5+ years of experience designing and leading cloud-native AI/ML platforms, with a strong focus on LLMs, agentic systems, and observability.

My work centers on end-to-end LLM infrastructure โ€” from gateways and agent frameworks to production-grade observability โ€” enabling teams to ship reliable, governed, and cost-efficient Generative AI solutions with real business impact.

I enjoy solving hard platform problems where scale, reliability, cost, and developer experience intersect.


๐Ÿง  What I Focus On

  • LLM Platform Architecture
    Gateways, routing, model abstraction layers, and LLM Ops

  • Agentic Systems
    Multi-agent orchestration, tool execution, and stateful workflows

  • LLM Observability & Governance
    Cost tracking, evaluation, tracing, and policy enforcement

  • Cloud-Native ML Systems
    Scalable deployments on AWS with Kubernetes-first design

  • Production AI Enablement
    Helping teams ship GenAI safely, predictably, and confidently


๐Ÿ› ๏ธ Key Expertise

๐Ÿค– LLM & Agent Systems

  • LLM gateways & inference platforms
  • Agent frameworks: LangGraph, SmolAgents
  • AWS Bedrock: Agents & Strands
  • Prompt engineering, evaluation, and cost optimization

๐Ÿ“Š Observability & LLM Ops

  • Langfuse, Arize Phoenix
  • OpenTelemetry for traces, metrics, and logs
  • Token usage, latency, quality, and governance tracking

โ˜๏ธ MLOps & Infrastructure

  • AWS, Databricks
  • Kubernetes & Docker
  • CI/CD for ML systems
  • Secure, multi-tenant platform design

๐Ÿ’ป Programming & Systems

  • Python (primary)
  • Rust (systems & performance)
  • Data engineering & large-scale NLP systems

๐Ÿงฐ Languages & Tools


โœ๏ธ Writing & Knowledge Sharing

  • ๐Ÿ“ I regularly write about AI platforms, LLM systems, and engineering best practices
    ๐Ÿ‘‰ https://varunk.me/

  • ๐Ÿ’ฌ Happy to discuss: LLMs, Agentic AI, LLM Ops, MLOps, Kubernetes, Platform Architecture


๐Ÿ“ซ Reach Me


Building production-grade AI systems that balance innovation, reliability, and real-world value.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published