Open source

Projects

Things I've built in agentic AI and LLM engineering — from reinforcement learning for reasoning agents to the production runtime, caching, and evaluation tooling that agents need to ship.

AgentFlow-Pro

· Agentic RL Research

+5.0 pts

Process-supervised RL that taught an 8B model to reason better — and the gain transferred to a domain it never trained on.

PyTorchTRLDAPOPRMPEFT / LoRA

Read the case study →

GuardLoop

· Production Agent Runtime

A guardrail runtime for async agents: pre-flight cost/token/time budgets, per-tool circuit breakers, and OpenTelemetry spans — no agent rewrite.

OpenAI SDKAnthropic SDKLangGraphOpenTelemetry

Read the case study →

SmartMemo

· Semantic LLM Cache

+30 pts

A semantic cache for LLM agents where a learned classifier — not raw cosine similarity — decides when a cached answer is safe to reuse.

FAISSSentenceTransformersPyTorchSQLitePydantic

Read the case study →

Orchflow

· Agent Orchestration Framework

0 deps

A dependency-free Python framework for readable multi-agent pipelines: sequential, parallel, conditional, and resumable flows.

AsyncIOLiteLLMPydantic

Read the case study →

agenteval

· LLM Evaluation Tooling

pass-rate

Behavioral eval for agents: replaces brittle exact-match asserts with repeated-run pass-rate scoring for CI gates.

AsyncIOOpenAI SDKAnthropic SDKLangChainTyper

Read the case study →