Skip to content
View saivarunk's full-sized avatar
💭
drowned into AI
💭
drowned into AI

Block or report saivarunk

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
saivarunk/README.md

Hi 👋, I'm Varun Kruthiventi

Staff Engineer · AI/ML Platform · LLM Infrastructure
Hyderabad, India 🇮🇳

Building scalable, reliable, and cost-aware Generative AI systems at production scale

WebsiteLinkedIn


🚀 About Me

I’m a Staff Engineer with 9.5+ years of experience designing and leading cloud-native AI/ML platforms, with a strong focus on LLMs, agentic systems, and observability.

My work centers on end-to-end LLM infrastructure — from gateways and agent frameworks to production-grade observability — enabling teams to ship reliable, governed, and cost-efficient Generative AI solutions with real business impact.

I enjoy solving hard platform problems where scale, reliability, cost, and developer experience intersect.


🧠 What I Focus On

  • LLM Platform Architecture
    Gateways, routing, model abstraction layers, and LLM Ops

  • Agentic Systems
    Multi-agent orchestration, tool execution, and stateful workflows

  • LLM Observability & Governance
    Cost tracking, evaluation, tracing, and policy enforcement

  • Cloud-Native ML Systems
    Scalable deployments on AWS with Kubernetes-first design

  • Production AI Enablement
    Helping teams ship GenAI safely, predictably, and confidently


🛠️ Key Expertise

🤖 LLM & Agent Systems

  • LLM gateways & inference platforms
  • Agent frameworks: LangGraph, SmolAgents
  • AWS Bedrock: Agents & Strands
  • Prompt engineering, evaluation, and cost optimization

📊 Observability & LLM Ops

  • Langfuse, Arize Phoenix
  • OpenTelemetry for traces, metrics, and logs
  • Token usage, latency, quality, and governance tracking

☁️ MLOps & Infrastructure

  • AWS, Databricks
  • Kubernetes & Docker
  • CI/CD for ML systems
  • Secure, multi-tenant platform design

💻 Programming & Systems

  • Python (primary)
  • Rust (systems & performance)
  • Data engineering & large-scale NLP systems

🧰 Languages & Tools


✍️ Writing & Knowledge Sharing

  • 📝 I regularly write about AI platforms, LLM systems, and engineering best practices
    👉 https://varunk.me/

  • 💬 Happy to discuss: LLMs, Agentic AI, LLM Ops, MLOps, Kubernetes, Platform Architecture


📫 Reach Me


Building production-grade AI systems that balance innovation, reliability, and real-world value.

Popular repositories Loading

  1. vue-simple-upload vue-simple-upload Public

    Simple File upload component for Vue.js

    HTML 148 30

  2. vue-toastr-2 vue-toastr-2 Public

    Simple toast notifications for Vue.js

    JavaScript 15 1

  3. distributed-systems-blog distributed-systems-blog Public

    Python 8 3

  4. krypton krypton Public

    Model Server for ML and DL Models built using FastAPI

    Python 4 1

  5. genai-playground genai-playground Public

    Collection of my GenAI experiments

    TypeScript 3

  6. Hapijs-Swagger-Demo Hapijs-Swagger-Demo Public

    Hapi JS Demo app with Swagger API Documentation

    JavaScript 2 2