The First Decade as Faculty - Looking Back

Aditya reflects on his greatest hits from the first decade of facultyhood.

Low-Cost LLM-Powered Data Processing with BARGAIN

BARGAIN reduces cost of data processing with LLMs while providing statistical guarantees on output quality

Liberating Structured Data from PDF Prisons

TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inferring the shared underlying visual template across documents

Interactive LLM-Powered Data Processing with DocWrangler

DocWrangler is an IDE that provides instant feedback, visual exploration tools, and AI assistance for building and iterating on LLM-powered data processing pipelines