About me
Hi! I’m Lavender Jiang (蒋遥). I am a fifth year Data Science PhD student at New York University, co-advised by Eric Oermann and Kyunghyun Cho.
My research focuses on the science of training domain-specific models, building AI that bridges the gap between Machine Learning theory and real-world clinical deployment. My passion for healthcare was motivated by seeing my grandparents suffer from cancer, which inspires me to make a real-world impact in the field.
⭐🗓️ I’m at Neurips San Diego Dec 1 - Dec 7 and feel free to schedule an in-person meeting wth me here!
Affiliations and Honors
- Labs: OLAB and ML2
- Fellowships:
- Apple AIML PhD fellowship
- NYU Center for Data Science Fellowship
- NYU CDS Meidcal Fellowship.
- Grants:
- Seed Grant Funding Award (Apple Workshop on ML for Health 2024, $25,000). Kyunghyun Cho, Eric Oermann, Lavender Jiang.
Educational Background
09/2021 - present New York University, New York, NY PhD student in Data Science (GPA 3.979) Selected courseworks: Big Data, Machine Learning, Inference and Representation
09/2017 - 05/2021 Carnegie Mellon University, Pittsburgh, PA BSc (Honor): Electrical and Computer Engineering with an additional major in Mathematical Sciences (GPA 3.55) Selected courseworks: Probabilities and Statistics, Linear Algebra, Real Analysis, Digital Systems, Computer Systems, Signal Processing, Software Engineering, Machine Learning
Selected Publications
Health system-scale language models are all-purpose prediction engines. Lavender Yao Jiang, Chris Liu, Mustafa Nasir-Moin, Nima Pour Nejatian, Duo Wang, Anas Abidin, Howard Riina, Ilya Laufer, Paawan Punjabi, Kevin Eaton, Madeline Miceli, Nora C. Kim, Cordelia Orillac, Zane Schnurman, Christopher Livia, Hannah Weiss, David Kurland, Sean Neifert, Yosef Dastagirzada, Douglas Kondziolka, Alexander M Cheung, Grace Yang, Ming Cao, Mona Flores, Anthony B. Costa, Yindalon Aphinyanaphongs, Kyunghyun Cho and Eric Karl Oermann. (Nature)
Generalist Foundation Models Are Not Clinical Enough for Hospital Operations. Lavender Y. Jiang, Angelica Chen, Xu Han, Xujin Chris Liu, Radhika Dua, Kevin Eaton, Frederick Wolff, Robert Steele, Jeff Zhang, Anton Alyakin, Qingkai Pan, Yanbing Chen, Karl L. Sangwon, Daniel A. Alber, Jaden Stryker, Jin Vivian Lee, Yindalon Aphinyanaphongs, Kyunghyun Cho, Eric Karl Oermann. In preparation.
Language Models Can Guess Your Identities from De-identified Clinical Notes. Lavender Yao Jiang, Daniel Alexander Alber, Zihao Yang, Karl L. Sangwon, Xujin Chris Liu, Kyunghyun Cho, Eric Karl Oermann. In resubmission.
Language Model Classifier Aligns Better with Physician Word Sensitivity than XGBoost on Readmission Prediction. Grace Yang, Ming Cao, Lavender Y. Jiang, Xujin C. Liu, Alexander T.M. Cheung, Hannah Weiss, David Kurland, Kyunghyun Cho, Eric K. Oermann. (ML4H 2022)
Generalization in Healthcare AI: Evaluation of a Clinical Large Language Model. Salman Rahman, Lavender Yao Jiang, Saadia Gabriel, Yindalon Aphinyanaphongs, Eric Karl Oermann, Rumi Chunara.
MedG-KRP: Medical Graph Knowledge Representation Probing. Gabriel R. Rosenbaum, Lavender Y. Jiang, Ivaxi Sheth, Jaden Stryker, Anton Alyakin, Daniel Alexander Alber, Nicolas K. Goff, Young Joon Fred Kwon, John Markert, Mustafa Nasir-Moin, Jan Moritz Niehues, Karl L. Sangwon, Eunice Yang, Eric Karl Oermann. ML4H 2024.
Automated, Scalable and Generalizable Deep Learning for Tracking Cortical Spreading Depression Using EEG. Alireza Chamanzar¹, Xujin Liu¹, Lavender Y. Jiang, Kimon A. Vogt, José M. F. Moura, Pulkit Grover. (International IEEE/EMBS Conference on Neural Engineering, 2021.) Patent: System and method for deep learning for tracking cortical spreading depression using eeg (WO2022235467A1).
- Graph Signal Processing and Deep Learning, Mark Cheung, John Shi, Yao Jiang, Oren Wright and José Moura. (IEEE Signal Processing Magazine Special Issue on Graph Signal Processing)
Services
Reviewer for IEEE TNNLS, Scientific Reports, BMC Health Services Research, ICLR DMLR Workshop 2024, ACL Student Research Workshop 2023, ACL 2023, All Things Attention.
Co-organizer for Global AI Frontier Lab Workshop 2025, NYU AI School 2022
Teaching
06/2022 - 05/2025: Research mentor
I scoped and led research projects for 12 students ranging from high school interns to graduate students, guiding them to 4 accepted publications at ACL and ML4H workshops.
Clinical Model Evaluation: Oversaw the development of new benchmarks for numerical lab measurements and knowledge graph understanding in LLMs, directly addressing gaps in current clinical AI capabilities. Analyzed the differential predictive power of clinical note types and sections to inform data prioritization.
Model Interpretability: Led research into evaluating the alignment between Transformer sensitivity patterns with human physicians and utilizing correlation priors for automated ICD coding.
Infrastructure & Efficiency: Guided experiments on pretraining dynamics, specifically analyzing the impact of data packing and shuffling strategies on model loss.
03/2023 - 05/2025 Teaching Assistant
Taught, graded or tutored Machine Learning (Taught by Kyunghyun Cho, Rajesh Ranganath, Mengye Ren, Ravid Shwartz-Ziv) and NLP (Profs. He He).
Curriculum Development: Designed Python assignments bridging theoretical proofs with implementation, covering fundamental concepts (classification, backpropagation) and modern NLP (prompt engineering).
Teaching: Delivered recitations on Transformers and ML fundamentals. Achieved top-tier student evaluations for subject mastery, organization, and clarity in answering complex technical questions.
Mentorship: Guided 8 graduate teams through applied NLP capstone projects and supported students with proof-based theoretical concepts.
09/2020 - 12/2020 Section leader and grader for 21-260 (differential equations)
Students
| Student Name | Project Year | Affiliation |
|---|---|---|
| Luca Ruilin Wang | 2025 | MLE @ Pieta, NYU DS MS ‘25 |
| Cynthia Yanbing Chen | 2025 | Penn State Biostatistics PhD ‘29, NYU Biostatistics MS ‘25 |
| Gabriel R. Rosenbaum | 2024 | UChicago CS BS ‘29, Packer Collegiate Institute ‘25 |
| Kelly Ruiqi Deng | 2024 | Cornell Tech Health Tech MS ‘26, NYU CS BA ‘24 |
| Avery Chi Hang | 2024 | Yale Statistics MS ‘26, NYU DS + Math BA ‘24 |
| Hongyi Zheng | 2023 | Quantitative Strategist @ Akuna Capital, NYU Math+CS+DS BA ‘23 |
| Tracy Zhu | 2023 | UChicago Statistics MS ‘25, NYU DS + Math BA ‘23 |
| Gavin Zihao Yang | 2023 | Northeastern CS PhD ‘29, NYU CS+DS BA ‘24 |
| Lucy Wu | 2023 | Data Scientist @ Microsoft, Columbia Data Science MS ‘25, NYU DS+CS BA ‘23 |
| Stephen Zhang | 2023 | SDE @ Apple, UPenn Data Science MS ‘25, NYU CS+DS BA ‘23 |
| Grace Ge’er Yang | 2022 | Data Scientist @ Databricks, Stanford Data Science MS ‘25, NYU Math+DS BA ‘23 |
| Ming Cao | 2022 | UPenn Data Science MS ‘25, NYU DS+CS BA ‘23 |
