Yofre H. Garcia, UNACH
Saúl Diaz-Infante Velasco
Jesús Adolfo Minjárez Sosa, UNISON
saul.diazinfante@unison.mx




https://slides.com/sauldiazinfantevelasco/sym_psp_xv


Argument. When there is a shortage of vaccines, sometimes the best response is to refrain from vaccination, at least for a while.

Hypothesis. Under these conditions, inventory management suffers significant random fluctuations
Objective. Optimize the management of vaccine inventory and its effect on a vaccination campaign

On October 13 2020, the Mexican government announced a vaccine delivery plan from Pfizer-BioNTech and other companies as part of the COVID-19 vaccination campaign.

Methods. Given a vaccine shipping schedule, we describe stock management with a backup protocol and quantify the random fluctuations due to a program under high uncertainty.
Then, we incorporate this dynamic into a system of ODE that describes the disease and evaluate its response.

Agent
action
state
reward


The effort invested in preventing or mitigating an epidemic through vaccination is proportional to the vaccination rate

Let us assume at the beginning of the outbreak:
Then we estimate the number of vaccines with
Then, for a vaccination campaign, let:

Then we estimate the number of vaccines with
Then, for a vaccination campaign, let:
Estimated population of Hermosillo, Sonora in 2024 is 930,000.
So to vaccinate 70% of this population in one year:









Base Model




After discretizise with a NSFDS



Stock degradation due to Temperature
Sources of uncertainty
- Random fluctuations in the stock replenishment
- Stochastic degradation due to temperature


Replenishment random perturbation
- order size
- time-delivery
Stock degradation due to Temperature





DALY (Disability-Adjusted Life Years)
For a given cause (c), sex (s), age (a), and year (t).
🩸 YLL — Years of Life Lost
Measures years of life lost due to premature death.
- N(c, s, a, t) — number of deaths due to cause c
- L(s, a) — standard life expectancy at age a for sex s
♿ YLD — Years Lived with Disability
Measures years lived with disease or disability.
- I(c, s, a, t) — incident cases for cause c
- DW(c, s, a) — disability weight for cause c
- L(c, s, a, t) — average duration of the case
DALY measures the total burden of disease, combining:
- 🕯️ Mortality (YLL) — early death
- 🧩 Morbidity (YLD) — time lived with illness or disability













Q-learning for multiple stages and uncertainty in policies and other terms related to epidemics.
Move to Reinforcement LearningOnésimo Hernández-Lerma, Leonardo R. Laura-Guarachi, Saul Mendoza-alacios, David González-Sánchez
An Introduction to Optimal Control Theory
The Dynamic Programming Approach
978-3-031-21138-6
Reinforcement learning and optimal control
Athena Sci. Optim. Comput. Ser.
Athena Scientific, Belmont, MA, 2019, xiv+373 pp.
ISBN: 978-1-886529-39-7


Powell, Warren B.
Reinforcement Learning and Stochastic Optimization: A Unified Framework for Sequential Decisions. United Kingdom: Wiley, 2022.
ISBN: 978-1-119-81505-1

Reinforcement Learning: An Introduction
Richard S. Sutton and Andrew G. Barto
Publisher: MIT Press
Year: 2018 (Second Edition)
ISBN: 978-0-262-03924-6




https://github.com/SaulDiazInfante/rl_vac.jl
https://slides.com/sauldiazinfantevelasco/sym_psp_xv


GRACIAS!!
XV Symposium of probability and stochastic processes
By Saul Diaz Infante Velasco
XV Symposium of probability and stochastic processes
Explore the dynamic programming and reinforcement learning through the insights of Dimitri P. Bertsekas. Discover techniques like HJB and rollout methods that can enhance your understanding and application in optimization!
- 43