Rajeev Kumar

Build Agentic AI systems and GenAI pipelines for fintech & enterprise clients. GenAI | Agentic AI | LLMs | RAG | Python | FastAPI | LangChain | LangGraph | Vector Databases | Multi-agent Systems

Mant, Uttar Pradesh, India

About

I build production-grade AI systems focused on Agentic AI, RAG pipelines, LLM infrastructure, and scalable GenAI applications. My work revolves around designing intelligent systems that combine: • Large Language Models (LLMs) • Retrieval-Augmented Generation (RAG) • Vector Search & Embeddings • AI Agents & Tool Calling • Rust + Python backend systems • AI inference pipelines • Semantic search architectures I actively work with technologies and ecosystems including: • Rust • Python • LangChain • Qdrant / pgvector • FastAPI / Axum • ONNX / Ollama • WrenAI • MindsDB • Hugging Face • LLM APIs & local inference Currently exploring: → Agentic workflows → Multi-agent systems → AI search infrastructure → Low-latency inference systems → Streaming AI architectures → Scalable RAG pipelines I’m especially interested in building AI systems that are: • Fast • Reliable • Production-ready • Memory efficient • Scalable at high concurrency My focus is not just using AI APIs — but engineering the infrastructure and systems behind modern AI products. Open to collaborations, AI engineering opportunities, and building impactful GenAI products.

Experience

  • Sr. Consultant at Deloitte
    Jul 2025 - Present · 1 yr

  • Python Developer at Innefu Labs Pvt. Ltd.
    Apr 2021 - Jun 2025 · 4 yrs 3 mos

  • Sr. Python Developer at Nexensus
    Jan 2020 - Apr 2021 · 1 yr 4 mos

  • Python Developer at Adiroot Technologies
    Feb 2018 - Dec 2019 · 1 yr 11 mos