About me
I’m a professor at NYU’s Tandon School of Engineering, with appointments in Computer Science and Data Science, and I co-direct the VIDA Center. My research focuses on making data more accessible and trustworthy — spanning data analysis and integration, visualization, AI/ML, and information discovery, with applications in urban analytics, computational reproducibility, biomedical research, and online crime.
News
- Our NSF-funded project that aims to combat wildlife trafficking in online marketplaces was featured in Forbes.
- We will present two demos at SIGMOD 2026
- BDI-Kit Demo: A Toolkit for Programmable and Conversational Data Harmonization
- BDIViz in Action: Interactive Curation and Benchmarking for Schema Matching Methods
- Our paper describing AutoDDG has been accepted to ACM SIGMOD 2026. AutoDDG combines traditional profiling and LLMs to automatically generate textual descriptions of structured data. The descriptions not only improve dataset findability, but also make it possible for LLMs to understand datasets. Here’s the preprint: https://arxiv.org/pdf/2502.01050. The system is open source and available at https://github.com/VIDA-NYU/AutoDDG
