Dual-Model NL-to-SQL Translation for MIMIC-IV

A healthcare-focused NL-to-SQL system that converts clinical questions into executable database queries.

Built a two-stage NL-to-SQL pipeline with a fine-tuned Qwen 2.5 generator and Phi-4 validator, benchmarking 9 models across prompting strategies on EHRSQL 2024 to improve clinical query reliability.

Part of the Capstone Thesis under Prof. Lipika Dey, Spring 2025.

Thesis titled: “Dual-Model Framework for Natural Language to SQL Translation in MIMIC-IV Healthcare Databases”