RAG Systems
Production retrieval built for real users
Grounded assistants with retrieval, source-aware answers, and architectures designed for reliability instead of demos.
Generative AI Engineer
I work across RAG, tool-connected assistants, evaluation, contextual memory, and observability to build AI products that are useful, measurable, and reliable in real environments.
AI Twin
A conversational layer for discussing architecture, evaluation, systems, and project experience.
How was the RAG app evaluated?
What guardrails did you add?
How do you approach production AI?
About
My background combines production software engineering with recent deep work in Generative AI. I have built systems across agritech, geospatial, construction, and AI product workflows, with a focus on turning technical capability into practical, measurable delivery.
RAG Systems
Grounded assistants with retrieval, source-aware answers, and architectures designed for reliability instead of demos.
Evaluation
Offline datasets, baseline scoring, regression checks, and practical feedback loops for improving output quality over time.
Observability
Tracing, structured logs, metrics, and operational thinking that make debugging and iteration faster in production.
Experience
2026 - Present
Building production-style GenAI systems across RAG, evaluation, guardrails, contextual memory, and observability.
2024 - 2025
Shipped AI-powered geospatial product features for agricultural monitoring workflows used across multiple farms and clients.
2022 - 2024
Delivered data-heavy interfaces, upload workflows, and cloud-connected spatial products across construction and real estate.
Signals
0.78 faithfulness baseline established through RAG evaluation
50% longer supported conversations via context management and memory strategies
45% faster map rendering by moving to COG tile workflows
Full-stack delivery across AI, geospatial, cloud, and product-facing systems
Projects
These projects are presented as engineering case studies rather than generic portfolio cards. The focus is on architecture, evaluation, observability, product usefulness, and what was actually built.
FastAPI, Pinecone, OpenAI, Ragas, AWS
A production-oriented retrieval system with offline evaluation, retrieval-vs-generation benchmarking, Prometheus and Grafana observability, and grounded answer generation.
Built to show what a production-minded RAG stack looks like beyond a demo, with measurable quality and operational visibility.
LangGraph, FastAPI, Postgres, Pinecone, Phoenix
A production-style multi-agent backend for legal review and workflow automation, combining retrieval, orchestration, persistence, human approval, and offline evaluation.
Built to demonstrate a more realistic agentic backend with traceability, workflow state, review lifecycle persistence, and evaluation support.
Portfolio Expansion
This portfolio structure is set up to expand with more GenAI, evaluation, and internal tooling work as additional systems are published.
The goal is to turn this site into a stronger technical portfolio rather than a generic personal landing page.
Blogs
This area can evolve into articles, technical notes, or architecture breakdowns. I have added professional placeholders so the structure is already there as you publish more work.
Thoughts on grounding, evaluation, and why a working prototype is not the same thing as a production-ready assistant.
Drafting / Coming soonHow memory, trimming, and compaction patterns change the usability of assistants over longer conversations.
Drafting / Coming soonA practical view on evals, observability, and the signals that matter when you want AI systems to be trusted.
Drafting / Coming soon