
Tianxi Cai, ScD
Professor of Biomedical Informatics, Harvard Medical School
John Rock Professor of Population and Translational Data Sciences, Harvard T.H. Chan School of Public Health
Director, Translational Data Science Center for a Learning Health System (CELEHS)
Tianxi Cai is a major player in developing analytical tools for mining EHR data and predictive modeling with biomedical data. She provides statistical leadership on several large-scale projects, including the NIH-funded Undiagnosed Diseases Network at DBMI. Cai's research lab develops novel statistical and machine learning methods for several areas including clinical trials, real world evidence, and personalized medicine using genomic and phenomic data. Cai received her ScD in Biostatistics at Harvard and was an assistant professor at the University of Washington before returning to Harvard as a faculty member in 2002.
DBMI Research Areas
Semi-supervised estimation of covariance with application to phenome-wide association studies with electronic medical records data.
Authors: Chan SF, Hejblum BP, Chakrabortty A, Cai T.
Stat Methods Med Res
View full abstract on Pubmed
Stat Methods Med Res
View full abstract on Pubmed
Large-Scale Simultaneous Testing of Cross-Covariance Matrices with Applications to PheWAS.
Associations of antepartum suicidal behaviour with adverse infant and obstetric outcomes.
Authors: Zhong QY, Gelaye B, Karlson EW, Avillach P, Smoller JW, Cai T, Williams MA.
Paediatr Perinat Epidemiol
View full abstract on Pubmed
Paediatr Perinat Epidemiol
View full abstract on Pubmed
Automated feature selection of predictors in electronic medical records data.
f(1Feature Extraction for Phenotyping from Semantic and Knowledge Resources.
Authors: Ning W, Chan S, Beam A, Yu M, Geva A, Liao K, Mullen M, Mandl KD, Kohane I, Cai T, Yu S.
J Biomed Inform
View full abstract on Pubmed
J Biomed Inform
View full abstract on Pubmed
Use of natural language processing in electronic medical records to identify pregnant women with suicidal behavior: towards a solution to the complex classification problem.
Authors: Zhong QY, Mittal LP, Nathan MD, Brown KM, Knudson González D, Cai T, Finan S, Gelaye B, Avillach P, Smoller JW, Karlson EW, Cai T, Williams MA.
Eur J Epidemiol
View full abstract on Pubmed
Eur J Epidemiol
View full abstract on Pubmed
Variance components genetic association test for zero-inflated count outcomes.
Probabilistic record linkage of de-identified research datasets with discrepancies using diagnosis codes.
Authors: Hejblum BP, Weber GM, Liao KP, Palmer NP, Churchill S, Shadick NA, Szolovits P, Murphy SN, Kohane IS, Cai T.
Sci Data
View full abstract on Pubmed
Sci Data
View full abstract on Pubmed
Identifying lupus patients in electronic health records: Development and validation of machine learning algorithms and application of rule-based algorithms.
Authors: Jorge A, Castro VM, Barnado A, Gainer V, Hong C, Cai T, Cai T, Carroll R, Denny JC, Crofford L, Costenbader KH, Liao KP, Karlson EW, Feldman CH.
Semin Arthritis Rheum
View full abstract on Pubmed
Semin Arthritis Rheum
View full abstract on Pubmed
Distinct Proteomic Signatures in 16 HDL (High-Density Lipoprotein) Subspecies.
Authors: Furtado JD, Yamamoto R, Melchior JT, Andraski AB, Gamez-Guerrero M, Mulcahy P, He Z, Cai T, Davidson WS, Sacks FM.
Arterioscler Thromb Vasc Biol
View full abstract on Pubmed
Arterioscler Thromb Vasc Biol
View full abstract on Pubmed