Open to opportunities · Jersey City, NJ

Bhakti Shelke Senior AI Engineer

5+ years building production-grade LLM systems, RAG architectures, and agentic AI pipelines. Currently at HCZ, New York — cutting hallucinations by 35%, analyst workload by 60%, and model iteration cycles from 2 days to under 4 hours.

5+
Years Python
35%
Hallucination Cut
60%
Research Time Saved
3.60
MS GPA · Pace
// about me

Building AI that
ships and scales

I'm a Senior Python Developer and AI Engineer with 5+ years delivering scalable data ingestion pipelines, LLM-powered classification systems, and production-grade RAG architectures.

At Harlem Children's Zone in New York, I architect agentic AI systems using LangGraph, AWS Bedrock, and Llama — building orchestration layers, vector search pipelines, and microservices that connect LLMs to enterprise systems at real scale.

Previously at ZiSystech (FinTech) and JSW Steel. MS in Data Science, Pace University (GPA 3.60). BE in Computer Science, University of Mumbai (GPA 3.80).

Python & Backend
Python (5+ yrs)FastAPIREST APIsMicroservicesDockerCI/CDGitHub Actions
LLMs & AI
LangChainLangGraphAWS BedrockLlamaHugging FacePrompt EngineeringLLMOps
RAG & Vectors
PineconeFAISSChromaEmbeddingsRAG ArchitecturesEval Frameworks
Big Data & Cloud
PySparkHadoopHiveDelta LakeSnowflakeAWS S3/LambdaPostgreSQL
Languages
PythonSQLPySparkJavaScriptC++R
// selected work

Projects

Production systems and independent builds — real numbers attached.

🔍
Modular RAG Pipeline
Pinecone/FAISS vector search pipeline at HCZ that cut analyst manual research time by 40–60%, eliminating a full sprint of manual work per project cycle.
40–60% reduction in manual research time
Pinecone + FAISS hybrid retrieval
PineconeFAISSPythonLangChainAWS Bedrock
⚙️
LangGraph Agentic Orchestration Layer
State-machine orchestration for deterministic multi-step agent execution with automatic error recovery — reducing failure-driven reruns by ~50% in production.
~50% fewer agent failure reruns
Deterministic multi-step execution
LangGraphPythonAWS LambdaLlama
📊
LLMOps Prompt Evaluation Harness
Reusable evaluation framework with rubric scoring and retrieval recall metrics that slashed model iteration from 2 days to under 4 hours. Adopted across 3 internal AI products.
2 days to under 4 hours per iteration
Adopted across 3 internal products
PythonLLMOpsRubric ScoringEval Frameworks
PySpark ETL / ELT Pipeline
ETL/ELT pipelines across 4 source systems using PySpark and SQL at ZiSystech FinTech, reducing query latency by 40% and enabling real-time reporting.
40% query latency reduction
4 source systems integrated
PySparkSQLHiveDelta LakeSnowflake
// career

Experience

Where I've built, shipped, and driven measurable impact.

Harlem Children's Zone (HCZ)
AI Engineer / Senior Python Developer · New York, NY
Apr 2024 – Present
  • Architected Python-based data ingestion pipelines integrating structured and unstructured data into LLM workflows, enabling scalable classification and summarization across enterprise datasets.
  • Implemented Llama via AWS Bedrock for text classification and summarization — reduced hallucination rate by 35%+ through rubric scoring and retrieval recall evaluation frameworks.
  • Built modular RAG pipelines (Pinecone/FAISS) cutting analyst manual research time by 40–60%, eliminating a full sprint of manual work per project cycle.
  • Engineered LangGraph state-machine orchestration for deterministic multi-step agent execution with automatic error recovery — reduced failure-driven reruns by ~50%.
  • Shipped Python microservices and REST APIs connecting LLM agents to 5+ enterprise systems (AWS Lambda, S3, Bedrock, Snowflake) with zero-downtime, auditable handoffs.
  • Built reusable prompt evaluation harness (LLMOps) cutting model iteration from 2 days to under 4 hours; adopted across 3 internal AI products.
ZiSystech Pvt Ltd (FinTech)
Senior Software Engineer · India
Jun 2021 – Jan 2023
  • Built ML-integrated Python pipelines automating 3 manual data quality workflows and eliminating ~15 hours/week of analyst review.
  • Designed ETL/ELT pipelines across 4 source systems using PySpark and SQL — reduced query latency by 40% and enabled real-time reporting.
  • Developed RESTful microservices with zero post-release critical bugs.
  • Built model evaluation dashboards reducing review cycles by 2 rounds per release.
JSW Steel Pvt Ltd
Software Developer · India
Jan 2018 – Jan 2021
  • Refactored legacy C++ backend bottlenecks achieving 30–35% reduction in system response time.
  • Built 3 JavaScript UI modules adopted by 2 product teams; raised test coverage from ~10% to 65%.
// writing

On Medium

Sharing what I've learned about AI engineering, the job hunt, and building in public.

Artificial Intelligence in Plain English
From Data Science Grad to Gen AI Engineer: How I Prepped for My First Major Interview
A candid, practical account of how I went from MS Data Science graduate to landing a Gen AI engineering role — the prep strategies, the mindset, and what actually worked.
Coming soon
More on RAG, LangGraph agents & LLMOps in production
Follow along on Medium for upcoming articles on RAG architectures, LangGraph state machines, LLMOps practices, and lessons from building AI systems in production at scale.
View all articles on Medium ↗
// education

Education

Master of Science, Data Science
Pace University · New York, NY
Graduated Dec 2024
GPA 3.60 / 4.0
Bachelor of Engineering, Computer Science
University of Mumbai · India
Graduated May 2021
GPA 3.80 / 4.0

Let's build something
remarkable together.

Open to Senior Python, LLM Engineer, and AI Engineer roles. Based in Jersey City, NJ — open to NYC and remote.

Ask Bhakti's AI
● Knows everything about Bhakti
Hi! I'm Bhakti's portfolio AI. Ask me anything about her experience, projects, skills, or how to contact her. 👋
Projects Skills Current role Contact Education