Projects
Selected work across data engineering, AI systems, and enterprise platforms.
Finance RAG
Production-grade retrieval-augmented generation over SEC filings. Ask plain-English questions about 10-K and 10-Q disclosures and get answers grounded in cited source documents.
- ~2.1s median latency
- ~$0.02 per query
- ≥0.85 RAGAS faithfulness
- 12 SEC filings indexed
- RAG
- LLM
- FAISS
- OpenAI
- FastAPI
- Python
- SEC EDGAR
- Cohere
- Langfuse
Real-Time Enterprise Data Platform
End-to-end streaming platform on Apache Kafka and Azure Event Hubs, processing 50M+ events per day across distributed microservices with sub-100ms end-to-end latency.
- 50M+ events processed daily
- Sub-100ms p99 latency
- 12 source systems integrated
- Apache Kafka
- Azure Event Hubs
- Apache Spark
- Delta Lake
- dbt
- Azure