mohammed@portfolio:~/projects$ ls --detail · 6 entries · sorted by year
~/projects/trace2025 · ★ active
TRACE

agentic qa + observability for ai agents — runs them through realistic tool / rag workflows, verifies outcomes, isolates where the workflow became unrecoverable, and turns failures into regression tests. built at wat.ai w/ composio + magic hour. catching agents when they hallucinate.

AI AgentsLLM EvalsRAGObservabilityPython
~/projects/meta-harness2025 · ★ active
Meta-Harness

stanford's meta-harness paper had a linear loop — i mapped it onto langgraph and made it a tree. two state machines, postgres-backed checkpointing, time-travel forking, cross-run memory. self-improving agent harnesses, by construction.

LangGraphPostgresFastAPINext.jsPython
────────────────────────────────────────────────archive────────────────────────────────────────────────
paybridge2022Founder, learned americans don't have etransfer
scam-mah2024real-time spam detection · 3rd place @ newhacks 2024
focusforge2024excel/vba time-management suite w/ gemini-powered insights
financial-planning-tool2023student budget forecasting in excel/vba · used by 100+ students

more on github.