Hello I'm
Shubham Gore

MS CS (AI/ML) @ USC — ML Systems · LLM Infrastructure · Full-Stack AI

Shubham Gore

About me

I'm a Master's student in Computer Science (AI/ML) at USC, graduating December 2025. I work at the intersection of ML systems and backend engineering — building custom CUDA/Triton kernels, LLM training and inference pipelines, and full-stack agentic AI applications.

My recent work spans vision-language model training, reinforcement learning for retrieval-augmented QA, and production-grade AI agents deployed on GCP. Previously, I built distributed event-streaming systems at Avaya processing 1M+ messages/day and conducted ML research on traffic prediction at CDAC. I'm passionate about making AI systems that are fast, reliable, and actually useful.

My education & experience

My projects

AI HR Hiring Agent

Full-stack AI hiring assistant with LangGraph agentic workflow, Gemini LLM, FastAPI on GCP Cloud Run, Next.js 15 frontend on Vercel, and PostgreSQL with pgvector on Cloud SQL. Features candidate scoring...

  • LangGraph
  • FastAPI
  • GCP
  • Next.js
  • pgvector
  • Gemini
  • PostgreSQL

Vision-Language Model (VLM) Training

Trained a custom VLM combining SigLIP vision encoder, MLP projector, and Qwen2.5-0.5B language model on LLaVA-Instruct-150K dataset. Implemented KV-cache optimization, INT4/INT8 quantization, continuous batching, and custom Triton kernel fusion for...

  • PyTorch
  • Triton
  • CUDA
  • Qwen2.5
  • SigLIP
  • Quantization

Write-in-Margins (WiM) RL System

Retrieval-augmented QA system for HotpotQA where an LLM generates margin notes on document chunks before synthesizing answers. Trained with PPO on Qwen2.5-3B using NF4 quantization, LoRA/PEFT, and PagedAdamW8bit. Extended the...

  • PPO
  • LoRA
  • Qwen2.5
  • RAG
  • HuggingFace
  • W&B

VC Deal Sourcing Agent

LangGraph-based agentic system for venture capital deal sourcing with Anthropic tool-calling, Pydantic v2 anti-hallucination validators, and multi-source search router (Exa, Tavily, TechCrunch RSS, ProductHunt). Uses SqliteSaver persistence and source-weighted result...

  • LangGraph
  • Anthropic API
  • Pydantic
  • Exa
  • Tavily

GraphRAG Knowledge Retrieval

Built a hybrid retrieval system using Neo4j knowledge graphs combined with vector similarity search and cross-encoder reranking. Converts e-commerce product reviews into structured graph representations enabling both graph traversal and...

  • Neo4j
  • LangChain
  • RAG
  • OpenAI
  • Python

Plant Traits Prediction — Kaggle (Top 1%, Top 50/3000+)

Achieved Top 1% ranking (Top 50 out of 3,000+ participants) using Swin Transformer, ConvNeXT, and ViT models with augmented datasets and crowd-sourced data integration for plant trait prediction.

  • PyTorch
  • Swin Transformer
  • ViT
  • Kaggle
  • Computer Vision

My skills

Contact me

Please contact me directly at spgore@usc.edu