Optimizing Context Windows for Financial Report Analysis
How we reduced hallucination rates by 40% using hybrid search (Keyword + Vector) and metadata filtering in Pinecone.
// ARCHIVE_MODE: READ_ONLY
Thoughts on Data Engineering patterns, AI system architecture, and automation workflows.
How we reduced hallucination rates by 40% using hybrid search (Keyword + Vector) and metadata filtering in Pinecone.
A step-by-step guide to automating Excel reporting using Python, Pandas, and AWS Lambda, saving 15 hours per week.
Why the monolithic warehouse is struggling and how a domain-oriented Data Mesh approach improves velocity.
Exploring memory safety and speed by implementing a basic HNSW index from scratch in Rust.
Comparing the DAG-based approach of Airflow with the modern, code-first philosophy of Prefect 2.0.
Using QLoRA to fine-tune a 7B parameter model on a single RTX 4090 for SQL generation tasks.