Arjun Gupta

AI Engineer · Agentic Systems · Knowledge Graphs · Async Data Pipelines · Voice AI

Building production agentic systems end-to-end: federated multi-tenant orchestration, hybrid retrieval over knowledge graphs, and async Python infrastructure for real-time pipelines.

About

AI Engineer focused on shipping production agentic systems end-to-end. I architect federated multi-tenant platforms with stateful LangGraph orchestrators and custom MCP servers, design async-first Python infrastructure (FastAPI, asyncpg) with event-driven message buses, and build hybrid retrieval engines combining PostgreSQL pgvector with Neo4j knowledge graphs and semantic reranking. Previously built low-latency voice AI systems with LiveKit, PipeCat, and Deepgram.

Experience

AI Engineer Intern

Oct 2025 – Present

Stealth AI Startup

  • Owned three production AI codebases end-to-end as sole full-stack engineer and product owner, managing async backends and frontend deployments (Render, Vercel), enabling the team to ship the v0 pilot to live users without an additional engineering hire.

  • Architected a federated multi-tenant agentic platform (Hub/Spoke model) with A2A communication and stateful LangGraph orchestrators backed by a custom 14-tool MCP server, enabling spoke users to query a hub's relationship graph through agent-mediated consent without exposing private signals.

  • Engineered an async-first Python stack (FastAPI, asyncpg) with an event-driven message bus and deterministic dedup, handling real-time WhatsApp WebSocket ingest and Meta Cloud API delivery at sub-second consumer lag with at-least-once guarantees.

  • Designed a 4-way hybrid retrieval engine fusing PostgreSQL pgvector (HNSW + FTS) with a perception-aware Neo4j knowledge graph and HyDE + Cohere reranking, improving top-3 candidate recall over baseline vector-only search on internal eval queries.

  • Built stateful GPT-4o workflows with strict-JSON tool use and an LLM-as-judge (PageIndex tree) for intent classification and hallucination control; enforced multi-tenant data privacy via Clerk JWT auth and workspace-scoped SQL filters to prevent cross-tenant leakage by design.

AI Engineer Intern

Mar 2025 – Jul 2025

ZuduAI

  • Shipped production AI agents on LangChain + LangGraph in a microservices setup, delivering RAG workflows that served live customer queries across enterprise deployments.

  • Designed and deployed low-latency voice AI systems on LiveKit + PipeCat integrating Deepgram STT and multilingual TTS, achieving real-time response latency suitable for natural conversational interaction.

  • Built post-call NLP pipelines (sentiment, entity extraction, intent) on Azure Cognitive Services orchestrated via Apache Airflow + n8n, generating structured insights from raw call transcripts to feed downstream analytics.

Skills

AI / ML

LangChainLangGraphRAGKnowledge GraphsStateful AgentsTool / Function CallingLLM-as-JudgeMCP ServersPrompt EngineeringNLP

Languages

Python (Asyncio)TypeScriptSQLJava

Backend / Infra

FastAPIasyncpgWebSocketsPostgreSQL (pgvector)Neo4jRESTful APIsRenderVercelGit

Audio / Voice

LiveKitPipeCatDeepgramElevenLabs

Achievements

Top 5 in Fetch-a-thon (AI-based EV Trip Planner)

Finalist in Code For Bharat Season 1 @ Microsoft

Global Rank 2567 in LeetCode Biweekly 144

Top 10 in MLSA MIET Hackathon

Selected Work

A collection of projects showcasing expertise in AI/ML, voice agents, and full-stack development.

Aimers Voice-RAG Voice Agent

Real-time conversational AI voice agent with RAG pipeline using LlamaIndex and LiveKit. Integrated OpenAI GPT-4o, Deepgram for STT, and ElevenLabs for multilingual TTS.

LlamaIndexLiveKitOpenAI GPT-4oDeepgramElevenLabs

Krishi Jyoti - AI Agricultural Platform

Multi-faceted AI platform providing comprehensive support to farmers. Features RAG-powered chatbot for government schemes, ML-based crop recommendation engine, and real-time voice AI agent.

PythonLlamaIndexWebSocketsScikit-learnFastAPI

GenAI Sports Tournament Discovery System

Autonomous AI agent system to discover, process, and provide structured information about sports tournaments using agentic workflows and dynamic query generation.

LLMFastAPIWeb ScrapingAgentic Workflows

Codplat - Full-Stack Coding Platform

Modern, integrated environment for coding-related activities with TypeScript frontend for responsive UX and robust Python backend for server-side logic.

TypeScriptPythonReactFastAPIMySQL

Intelligent Chat Microservice

Standalone, containerized microservice for intelligent chat functionality, designed for easy integration into larger systems with Docker deployment.

PythonDockerFastAPISQLite

DEFMA WeaponWise

Comprehensive logistics platform for Indian Army fleet and armament management with real-time GPS tracking, predictive maintenance, and digital KOTE system.

ReactJSPythonFlaskMySQLReal-time Tracking

Serverless CSV Data Visualizer

Fully serverless web application that processes uploaded CSV data and visualizes results as interactive bar charts using AWS cloud services.

AWS LambdaAPI GatewayS3PythonPandas

Machine Learning Model Collection

Comprehensive collection of ML models including predictive analytics, classification systems, and data processing pipelines for various real-world applications.

PythonScikit-learnTensorFlowPandasNumPy

Get in Touch

Currently seeking full-time opportunities to leverage my skills in AI/ML engineering, conversational AI, and voice agent development. Let's connect!