Final_NW

650 BCE

1792

1890

1943

1956

2006

2011

2020

2021

2022

Babylonian astrology for prediction

National Weather Service

John McCarthy coins "Artificial Intelligence."

IBM's Watson wins Jeopardy!

AI begins to outperform in healthcare diagnostics

Old Farmer's Almanac first published

McCulloch Pitts' neural network

Deep learning by Geoffrey Hinton

GPT-3

GPT-4 | Dall-e | AI reaching critical mass

ishanu chattopadhyay

Universal screening for complex diseases

Predictive intelligence for security

Digital Twins for complex systems

Zero-knowledge Discovery

Biology, Medicine and Social Systems

University of Chicago Medicine

650 BCE

1792

1890

1943

1956

2006

2011

2020

2021

2022

Babylonians use astrology for prediction

National Weather Service established

John McCarthy coins "Artificial Intelligence."

IBM's Watson wins Jeopardy!

AI begins to outperform in healthcare diagnostics

Old Farmer's Almanac first published

McCulloch Pitts' neural network

Deep learning by Geoffrey Hinton

GPT-3

GPT-4 | Dall-e | AI reaching critical mass

mathematics

computer science

social science

medicine

AI/ML learning theory and applications

Complex systems

Implication of AI in Future of Societay

University of Chicago Medicine

The Laboratory for Zero Knowledge Discovery

collaborators

Alex Leow

Psychiatry UIC

Anna Podolanczuk, Pulmonary Care, Weill Cornell

Gary Hunninghake, Pulmonary C, Harvard

Robert Gibbons, Bio-statistics

Daniel Rubins, Anesthesia and Critical Care

Peter Smith, Pediatrics

Michael Msall Pediatrics

Fernando Martinez, Pulmonary Critical Care, Weill Cornell

James Mastrianni, Neurology

James Evans, sociology

Erika Claud, Pediatrics

Aaron Esser-Kahn Molecular Engineering

David Llewellyn

University of Exeter

Kenneth Rockwood

Dalhousie University

Andrew Limper Mayo Clinic

zed.uchicago.edu

Dr. Shahab Asoodeh
Dr. Yi Huang
Dmytro Onishenko
Victor Rotaru
Jin Li
Ruolin Zhang
David Yang

Dr. Nicholas Sizemore
Drew Vlasnik
Lucas Mantovani
Jaydeep Dhanoa
Jasmine Mithani
Angela Zhang
Warren Mo

people

zed.uchicago.edu

Department of Pediatrics

UChicago

Department of Neurology & The Memory Center

UChicago

Department of Psychiatry

UChicago

Pulmonary Critical Care, Weill Cornell

Department of Anesthesia and Critical Care

UChicago

Center for Health Statistics

UChicago

Pulmonary Critical Care, Harvard Medical School

Department of Psychiatry

UIC

Demon Network, Exeter, Alan Turing Institute, UK

Dalhousie University, Canada

Pritzker School of Molecular ENgineering

Social Science

UChicago

zed.uchicago.edu

D3M (I2O)

PAI (DSO)

PREEMPT (BTO)

YFA (DSO)

NIA

Problem: Late or missed diagnosis of serious illnesses

Can we use existing EHR to reliably screen for complex diseases such as pulmonary fibrosis, dementia and rare cancers?

Electronic Healthcare Record

IPF

ASD

ADRD

Onishchenko, Dmytro, Robert J. Marlowe, Che G. Ngufor, Louis J. Faust, Andrew H. Limper, Gary M. Hunninghake, Fernando J. Martinez, and Ishanu Chattopadhyay. "Screening for idiopathic pulmonary fibrosis using comorbidity signatures in electronic health records." Nature Medicine 28, no. 10 (2022): 2107-2116.

Universal screening for complex diseases

Problem: Event-level prediction in social systems,

e.g. predicting crime before it happens

Predictive intelligence for security

Can we predict complex spatio-temporal stochastic processes?

Rotaru, Victor, Yi Huang, Timmy Li, James Evans, and Ishanu Chattopadhyay. "Event-level prediction of urban crime reveals a signature of enforcement bias in US cities." Nature human behaviour 6, no. 8 (2022): 1056-1068.

Problem: Can we predict the next pandemic?

Can we predict future mutations? Can we define the "edge of emergence"?

Digital Twins for complex systems

Chattopadhyay, Ishanu, Kevin Wu, Jin Li, and Aaron Esser-Kahn. "Emergenet: Fast Scalable Pandemic Risk Assessment of Influenza A Strains Circulating In Non-human Hosts." (2023). Under Review in Nature

PREEMPT

Problem: Can AI predict how we think and interact?

Can we predict how opinions evolve?

Digital Twins for complex systems

YFA 2020

Can an AI tell if you are lying?

Can an AI tell how you are going to vote?

Yang, David, James EVans, and Ishanu Chattopadhyay. "‘Its the Economy Stupid’: Predictive Theory of Belief Shift Connecting Economic Stress to Societal Polarization." (2023).

Problem: Late or missed diagnosis of serious illnesses

Can we use existing EHR to reliably screen for complex diseases such as pulmonary fibrosis, dementia and rare cancers?

Electronic Healthcare Record

IPF

ASD

ADRD

Universal screening for complex diseases

The need for Universal Screening

Often the problem is not that diseases cannot be diagnosed by physicians, but one of missed or late diagnoses in the primary care workflow

Takes too long,

not supported by insurance,

"gut feeling" / "wait & see" common

Universal screening for many diseases are non-existant

Is AI/ML adding anything of relevance?

"predicting" autism > 3yrs

"diagnosing" fibrosis from lung imaging

"diagnosing" dementia from brain scan

Rapid Universal Point-of-care Screening for ILD/IPF Using Comorbidity Signatures in Electronic Health Records

Flag patients before they (or doctors) suspect

Primary Care

Pulmonologist

Zero-burden Co-morbid Risk Score (ZCoR)

shortness of breath

dry cough

doctor can hear velcro crackles

Common Symptoms

>50 years old

more men than women

IPF

Rare disease

~5 in 10,000

Post-Dx

Survival

~4 years

At least one misdiagnosis

~55%

Two or more misdiagnoses

38%

Initially attributed to age- related symptoms:

72%

Cannot always be seen on CXR

Non-specific symptoms

PCP workflow demands

Initial midiagnoses

~ 4yrs

current

post-Dx survival ~4yrs

~ 4yrs

current clinical DX

ZCoR screening

Onishchenko, D., Marlowe, R.J., Ngufor, C.G. et al. Screening for idiopathic pulmonary fibrosis using comorbidity signatures in electronic health records. Nat Med 28, 2107–2116 (2022). https://doi.org/10.1038/s41591-022-02010-y

n=~3M

AUC~90%

Likelihood ratio ~30

Conventional AI/ML attempts to model the physician

AI in IPF Research

Co-morbidity Patterns
No data demands
Use whatever data is already in patient file

ICD administrative codes

IPF

ILD

target codes appear

Past medical history

No target codes appear

case

control

2yrs

1YR

Use ICD codes to determine cases and controls

IPF drugs prescribed

Signature of IPF diagnostic sequence

pirfenidone or nintedanib

age > 50 years
at least two IPF target codes identified at least 1 month apart
chest CT procedure (ICD-9-CM 87.41 and Current Procedural Terminology, 4th Edition, codes 71250, 71260 and 71270) before the first diagnostic claim for IPF
no claims for alternative ILD codes occurring on or after the first IPF claim

target codes appear

Past medical history

No target codes appear

case

control

2yrs

1YR

ICD Codes can be noisy

"cases" are not always true IPF

Truven MarketScan (IBM)
Commerical Claims & Encounters Database
2003-2018

>100M patients visible

>7B individual claims

>87K unique diagnostic codes

>7% Medicare data present

2,053,277 patients included in study

University of Chicago Medical Center 
2012-2021

68,658 patients

Random sample from Optumlabs Data Warehouse courtsey Mayo Clinic

861,280 patients

2,983,215 patients

very likelihood ratios achieved irrespective of subgroup

performance tables

Out-of-sample Results

specificity ~99%

NPV >99.9%

IPF

ILD

Comorbidity Spectra

patient A

patient B

patient C

lesson 1

Beyond "risk factors" to personalized risk patterns

False Positives:

Heathcare Capacity

Ethics:

Risk from Imaging Tests

For every 20-30 flags,

1 is positive

General likelihood ratio 60-80
PPV 3.5-5%

Notifying patients 4 years early?

No cure, why screen

minimal

acceptable?

Better outcomes

Collard, Harold R., Alex J. Ward, Stephan Lanes, D. Cortney Hayflinger, Daniel M. Rosenberg, and Elke Hunsche. "Burden of illness in idiopathic pulmonary fibrosis." Journal of medical economics 15, no. 5 (2012): 829-835.

Early anti-fibrotic therapy seems increasingly promising

Better shot at lung transplant

Early dx reduces hospital-izations by a factor of 1-3

Clinical Trial Cohort Selection

Current screen failure rate ~50-60%

ZCoR boosted screen failure rate ~20%

Longitudinal history is important

lesson 2

Off-the-shelf AI does not suffice

lesson 3

Leveraging Longitudinal Patterns

Specialized HMM models from code sequences

Model control and case cohorts seprately

given a new test case, compute likelihood of sample arising from case models vs control models

sequence likelihood defect

Future

ZCoR 2.0

Deploy as an Epic App

primary care

secondary care

ZCoR

clinical notes

imaging analytics

Optimize implementation

ZeD Lab: Predictive Screening from Comorbidity Footprints

Nature Medicine

JAHA

CELL Reports

Science Adv.

The ZCoR Approch: Rapidly Re-targettable

	ZED performance	Competition
Autism	>80% AUC at 2 yrs	"obvious"
Alzheimer's Disease	~90% AUC	60-70% AUC
Idiopathic Pulmonary Fibrosis	~90% AUC	NA
MACE	~80% AUC	~70% AUC
Bipolar Disorder	~85% AUC	NA
CKD	~85% AUC	NA
Cancers (Prostate, Bladder, Uterus, Skin)	~75-80% AUC	Low

Deploy all/many/most of these!

1 in 59

Autism Spectrum Disorder

national median diagnosis age > 4yrs

MCHAT/F

questionnare completed in wellness-visits

poor sensitivity

poor specificity

ACoR

<3 yrs

Data: Onishchenko etal. Science Advances 2021

Children with ASD experience higher co-morbidities

Can we exploit these patterns to predict diagnosis?

Common Knowledge: Comorbidties Exist

source: IBM Marketscan data

Autism Co-morbid Risk (ACoR) Score

Joint Operation with MCHAT/F

reduce false positives by 50%

boost sensitivity by 100%

Data: Onishchenko etal. Science Advances 2021

MCHAT/F

standalone operation

Alzheimer's Disease and Related Dementia*

* in press

>5 Million in US. >13 Million in next 10 years

Alzheimer's Disease and Related Dimentia

MOCA, Blood Tests

Current Practice:

state of art with EHR:

~67% AUC*

ZCoR: ~87%

Alzheimer's Disease and Related Dimentia

state of art with EHR:

~67% AUC*

ZCoR: ~87%

Preempting ADRD accurately upto a decade in future

Applicable To Screening for Mild Cognitive Impairment

Clinical Trial Participant Selection

Current screen-failure rate: 80-90%

Estimated rate with ZCoR:

40%

Application to Suicide Attempts and Ideation (SISA):

Surprising connection between mood disorders and physiological comorbidities

Gibbons RD, Kupfer D, Frank E, Moore T, Beiser DG, Boudreaux ED. Development of a Computerized Adaptive Test Suicide Scale-The CAT-SS. J Clin Psychiatry. 2017 Nov/Dec;78(9):1376-1382. doi: 10.4088/JCP.16m10922. PMID: 28493655.

* in press

Application to Malignant Neoplasms*

Melanoma

Melanoma has a high survival rate of over 90% when treated early. But if it progresses to later stages, the survival rate drops significantly. Identifying potentially life-threatening melanomas is crucial.

* in press

Clip from Joe Rogan's Podcast aired July 2022 for educational/non-commercial purposes under fair use guidelines.

The Problem of Free Will

social

behavior

prediction

Predict the spatio-temporal event risk
NOT individual people
CANNOT be used for individual "pre-arrests"

No manual selection of factors
No "lists"
Uses only de-identified data

Can use the models to "audit" the state
Identify and reveal enforcement biases
Democratization of AI

The Underlying Math

Hotspots?

Very different from past efforts

Not based on standard "Deep Learning"

The Underlying Math

Not based on standard "Deep Learning"

Forecasting rare events in multi-variable stochastic evolution requires new modeling architecture

Learn local "activation functions" as symbolic probabilistic transducers

Assemble these local predictors into a "fractal net"

Applies to any rare/extreme event phenomena

Ishanu Chattopadhyay, Yi Huang, James Evans et al. Deep Learning Without Neural Networks: Fractal-nets for Rare Event Modeling, 26 October 2020, PREPRINT (Version https://doi.org/10.21203/rs.3.rs-86045/v1

93% accuracy

87% AUC

~70% specificity at ~80% sensitivity

Chicago Predictive Performance

10 actual crimes:

11 predicted:

8 correct:

2 missed:

3 false alarms

1 Week in advance

Within ~2 city blocks

ONLY Past eventlog as input

Triangles: actual events

heatmap: predicted risk 3 days ahead

Jan 1 2016 - April 1 2019

Philadelphia

100 crimes

Raise 103 flags

90 correct flags

13 false positives

10 missed

Could we have predicted this?

Double homicide

Jan 7 2019

Triple homicide incident

Jan 7 2019

https://www.inquirer.com/crime/kensington-triple-shooting-homicide-philadelphia-police-20190107.html

Triangles: actual events

heatmap: predicted risk 3 days ahead

Not just a predictor

Digital Twin of social interactions

Predict policy effects

Precise predictation

Problem: Can we predict the next pandemic?

Can we predict future mutations? Can we define the "edge of emergence"?

Digital Twins for complex systems

PREEMPT

\Phi_i:\prod_{j \neq i} \Sigma_j \rightarrow \mathcal{D}(\Sigma_i)

Q-Net

recursive forest

$$J \textrm{ is the Jensen-Shannon divergence }$$

Theorem

Sanov's Theorem & Pinsker's Inequality

\left \vert \ln \frac{Pr(x \rightarrow y ) }{Pr( y \rightarrow y)} \right \vert \leqq \beta \theta(x,y)

{\theta(x,y) \triangleq \mathbf{E}_i \left ( \mathbb{J}^{\frac{1}{2}} \left (\Phi_i^P(x_{-i}) , \Phi_i^Q(y_{-i})\right ) \right )}\\

q-distance

smaller $\theta$ implies higher risk

Influenza Risk Assessment Tool (IRAT) scoring for animal strains

Can we replicate IRAT scores*?

slow (months), quasi-subjective, expensive

*https://www.cdc.gov/flu/pandemic-resources/monitoring/irat-virus-summaries.htm

Emergenet: finding emergence risk of animal strains

CDC published 24 scores in 10 years

Emergenet time: 6 seconds

IRAT: months to compute 1 score

BioNorad

Problem: Can AI predict how we think and interact?

Can we predict how opinions evolve?

Digital Twins for complex systems

YFA 2020

Can an AI tell if you are lying?

Can an AI tell how you are going to vote?

Yang, David, James EVans, and Ishanu Chattopadhyay. "‘Its the Economy Stupid’: Predictive Theory of Belief Shift Connecting Economic Stress to Societal Polarization." (2023).

Modeling Responses to PTSD Evaluation

The Cognet Framework

Digital Twin of Opinion dynamics

predict worldviews from incomplete data

Identify malingering in psychiatric diagnoses

Future

Grants Pending

Vision

ZCoR-IPF 2.0 (CDMRP, PRMP)
Continuous monitoring of psychological health (CDMRP, TBIPH)
Reconfigurable Universal Screening (PCORI)
Early SCreening for ADRD (CDMRP, PRARP)
Universal EHR-integrated SCreening fro Autism (NIH, CDMRP)
AI-enabled Primary-care Screening for Aggressive Melanoma (CDMRP)
PIPP Phase 2 NSF
ARPA-H (multiple)

Transform bio-surveillance

Transform modeling of complex systems

Transform early diagnosis

Democratize AI unleashing its power for social good