Arjun Gupta
AI Engineer · Agentic Systems · Knowledge Graphs · Async Data Pipelines · Voice AI
Building production agentic systems end-to-end: federated multi-tenant orchestration, hybrid retrieval over knowledge graphs, and async Python infrastructure for real-time pipelines.
About
AI Engineer focused on shipping production agentic systems end-to-end. I architect federated multi-tenant platforms with stateful LangGraph orchestrators and custom MCP servers, design async-first Python infrastructure (FastAPI, asyncpg) with event-driven message buses, and build hybrid retrieval engines combining PostgreSQL pgvector with Neo4j knowledge graphs and semantic reranking. Previously built low-latency voice AI systems with LiveKit, PipeCat, and Deepgram.
Experience
AI Engineer Intern
Oct 2025 – PresentStealth AI Startup
- •
Owned three production AI codebases end-to-end as sole full-stack engineer and product owner, managing async backends and frontend deployments (Render, Vercel), enabling the team to ship the v0 pilot to live users without an additional engineering hire.
- •
Architected a federated multi-tenant agentic platform (Hub/Spoke model) with A2A communication and stateful LangGraph orchestrators backed by a custom 14-tool MCP server, enabling spoke users to query a hub's relationship graph through agent-mediated consent without exposing private signals.
- •
Engineered an async-first Python stack (FastAPI, asyncpg) with an event-driven message bus and deterministic dedup, handling real-time WhatsApp WebSocket ingest and Meta Cloud API delivery at sub-second consumer lag with at-least-once guarantees.
- •
Designed a 4-way hybrid retrieval engine fusing PostgreSQL pgvector (HNSW + FTS) with a perception-aware Neo4j knowledge graph and HyDE + Cohere reranking, improving top-3 candidate recall over baseline vector-only search on internal eval queries.
- •
Built stateful GPT-4o workflows with strict-JSON tool use and an LLM-as-judge (PageIndex tree) for intent classification and hallucination control; enforced multi-tenant data privacy via Clerk JWT auth and workspace-scoped SQL filters to prevent cross-tenant leakage by design.
AI Engineer Intern
Mar 2025 – Jul 2025ZuduAI
- •
Shipped production AI agents on LangChain + LangGraph in a microservices setup, delivering RAG workflows that served live customer queries across enterprise deployments.
- •
Designed and deployed low-latency voice AI systems on LiveKit + PipeCat integrating Deepgram STT and multilingual TTS, achieving real-time response latency suitable for natural conversational interaction.
- •
Built post-call NLP pipelines (sentiment, entity extraction, intent) on Azure Cognitive Services orchestrated via Apache Airflow + n8n, generating structured insights from raw call transcripts to feed downstream analytics.
Skills
AI / ML
Languages
Backend / Infra
Audio / Voice
Achievements
Top 5 in Fetch-a-thon (AI-based EV Trip Planner)
Finalist in Code For Bharat Season 1 @ Microsoft
Global Rank 2567 in LeetCode Biweekly 144
Top 10 in MLSA MIET Hackathon
Selected Work
A collection of projects showcasing expertise in AI/ML, voice agents, and full-stack development.
Real-time conversational AI voice agent with RAG pipeline using LlamaIndex and LiveKit. Integrated OpenAI GPT-4o, Deepgram for STT, and ElevenLabs for multilingual TTS.
Multi-faceted AI platform providing comprehensive support to farmers. Features RAG-powered chatbot for government schemes, ML-based crop recommendation engine, and real-time voice AI agent.
Autonomous AI agent system to discover, process, and provide structured information about sports tournaments using agentic workflows and dynamic query generation.
Modern, integrated environment for coding-related activities with TypeScript frontend for responsive UX and robust Python backend for server-side logic.
Standalone, containerized microservice for intelligent chat functionality, designed for easy integration into larger systems with Docker deployment.
Comprehensive logistics platform for Indian Army fleet and armament management with real-time GPS tracking, predictive maintenance, and digital KOTE system.
Fully serverless web application that processes uploaded CSV data and visualizes results as interactive bar charts using AWS cloud services.
Get in Touch
Currently seeking full-time opportunities to leverage my skills in AI/ML engineering, conversational AI, and voice agent development. Let's connect!