Project Summary/Abstract We identify critical weaknesses with Classi?cation and Regression Trees (CART), a widely used base learner for machine learning of big omic-data analysis, and propose to replace these with a fundamentally different type of base learner we call super greedy trees (SGT's). SGT's cut the space in a fundamentally different manner, resulting in a richer partition structure with provable consistency and superior empirical performance. The project will develop a uni?ed SGT framework for big data analysis using machine learning including the treatment of time varying covariate survival analysis, unsupervised learning, highly imbalanced data and multivariate regression. The SGT framework will be deployed within scalable and extensible open source software that will allow NIGMS researchers to deploy them to deal with their challenging big data problems.
|Effective start/end date||6/1/21 → 5/31/22|
- National Institute of General Medical Sciences: $415,245.00
Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.