Trang Le
#math graduate. Postdoc fellow with Jason Moore.
University of Pennsylvania
RLadies Johannesburg 2021-09-14
@trang1618
🌳
🔥
Terry Therneau, Beth Atkinson, Brian Ripley
Almende B.V., Benoit Thieurmel , Titouan Robert
Torsten Hothorn, Heidi Seibold, Achim Zeileis
Martin Borkovec, Niyaz Madin, et al.
Terence Parr, Prince Grover
target
features
dependent variable/outcome/phenotype
predictors/variables
treeheatr::heat_tree(x)
can be of object
heat_tree(x, target_lab = 'Outcome')
351 blood samples (January 10 - February 18)
3 features:
heat_tree(x = covid, target_lab = 'Outcome')
data.frame
heat_tree(covid, target_lab = 'Outcome',
feats = NA)
split_ldh <- partysplit(1L, breaks = 365)
split_crp <- partysplit(2L, breaks = 41.2)
split_lymp <- partysplit(3L, breaks = 14.7)
custom_tree <- partynode(1L, split = split_ldh , kids = list(
partynode(2L, split = split_crp, kids = list(
partynode(3L, info = 'Survival'),
partynode(4L, split = split_lymp, kids = list(
partynode(5L, info = 'Death'),
partynode(6L, info = 'Survival'))))),
partynode(7L, info = 'Death')))
heat_tree(x = custom_tree, data_test = covid, target_lab = 'Outcome')
a standard practice to illustrate the strengths and weaknesses of algorithms with regards to different problem characteristics
target
features
dependent variable/outcome/phenotype
predictors/variables
pmlbr::fetch_data(x)
character object
name of the dataset to fetch from PMLB
fetch_data('wine_quality_red')
fixed.acidity | volatile.acidity | ... | target |
---|---|---|---|
7.4 | 0.700 | 5 | |
7.8 | 0.880 | 5 | |
7.8 | 0.760 | 5 | |
11.2 | 0.280 | 6 | |
7.4 | 0.700 | 5 | |
7.4 | 0.660 | 5 | |
7.9 | 0.600 | 5 |
fetch_data('wine_quality_red')
Please save a copy of the notebook in *your own* Google Drive so you can start editing.
@trang1618
Funding
NIH LM010098
NIH AI116794
R packages
ggplot2
partykit
ggparty
heatmaply
People
Jason Moore
Anonymous reviewers
By Trang Le
Workshop at RLadies Miami, Nov 19 2020 https://trang1618.github.io/treeheatr https://epistasislab.github.io/pmlb/