Vittorio Iocco
Sheng Long
Mara Ulloa
Connor Mayberry
TK Kwon
How Did COVID-19 Affect CTA Ridership Across Chicagoland?
Domain Task
Q: Is there any correlation between ridership changes during COVID19 and socioeconomic status?
Communication Goal
\(\to\) Demonstrate visually that the impact of COVID-19, measured by changes in CTA ridership between 2019, 2020, and 2021, is not felt the same across stations with different socioeconomic backgrounds
Changes in aggregate weekly ridership across all stations
Changes in aggregate yearly ridership per station
Changes in aggregate weekly ridership across all stations
Changes in aggregate yearly ridership per station
- Line graph 📉 to show substantial decrease
- Animation (🔎) to emphasize the scale of events
- Scatter plot
- Each dot \(\to\) 🚉
- X-axis: income, % white population, % bachelor's degree
- Y-axis: % change in aggregate ridership
- color: the CTA line they belong to
Line graph
2019-1-02 to 2021-12-01
2011-01-02 to 2021-12-01
Scatter plot
Narration
-
Martini glass
-
answer question with graphs
-
user exploration in the end
-
-
Scrolly-telling
-
Animation allows us to show the sheer amount of stations
-
Thank you!
Now, demo time!
The general model
A collection of \(n\) items
-
For each item \(i\in [n]\), there is
an associated score \(\tau_i\)
an associated quality \(\omega_i \in \mathbb{R}\)
-
\(\pi: [n] \to [n]\) denotes a permutation;
\(\pi(i)\) denotes the item with the \(i\)th largest score
The general model cont'd
-
\(M_{ij}\in (0,1)\): the probability that item \(i\) wins the comparison against item \(j\)
"noise"
usually takes the form of a function \(\Phi\) (or \(F\)) in parametric models
could also be expressed as \(p(i, j)\)
-
By definition,
\(M_{ij}+M_{ji}=1\) -- every comparison has one winner
\(M_{ii} = 1/2\) -- completeness
Scores
-
Borda score \[\tau_i := \frac{1}{n}\sum_{j=1}^n M_{ij}\]
Interpretation: ...
Bradley-Terry-Luce (BTL) \[\tau_i := \exp(\omega_i)\]
Noise and Comparison
-
Possible extension of \(M_{ij}\): (if \(i\) ranks higher than \(j\))
\(M_{ij} = 1/2 + \gamma, \gamma \in (0, 1/2]\)
-
Comparison pairs can be chosen non-adaptively (offline) or adaptively (online)
"Adaptive" in the sense that pairs chosen for comparison is based on previous results
Non-adaptive example: uniform random
Adaptive example: Knockout tournament
A graphical illustration
- complete rank
- top \(k\) items / approximate rank
quality \(\omega_i\)
1
2
\(n\)
...
adaptive
non-adaptive
pairwise comparisons
noise \(M_{ij}\)
score \(\tau_i\)
e.g. Borda score
e.g. difficulty level of question answered
\(={n \choose 2}\)
\(<{n \choose 2}\)
Parametric Models
Requires \(M_{ij}\) to be a specific function of the difference between item \(i\) and item \(j\)'s qualities
-
That is, \[M_{ij} = F(\omega_i - \omega_j)\;\; \forall i, j \in [n]\] such that the function \(F:\mathbb{R} \to [0, 1]\) is a strictly increasing and continuous CDF
We also use \(\Phi\) sometimes to represent \(F\)
\(F\) is typically assumed to be known
Parametric Model Properties
-
By construction, any parametric model has the following property:
If \(\omega_i > \omega_j\) for some pair of items \((i,j)\), then we are also guaranteed that \(M_{il} > M_{jl}\) for every item \(l\)
From this property, we have the ranking induced by the scores \(\{\tau_i\}_{i=1}^n\) is equivalent to that induced by \(\{\omega_i\}_{i=1}^n\)
Bradley-Terry-Luce (BTL)
A specific case of the parametric model
Recall that the probability that \(i\) wins over \(j\) is given by \[P(\tau_i>\tau_j) = \frac{\tau_i}{\tau_i+\tau_j}\]
If we define the scoring function to be \(\tau_i = \exp(\omega_i)\), then \[M_{ij}=P(\tau_i>\tau_j)=\frac{e^{\omega_i}}{e^{\omega_i}+e^{\omega_j}}=\frac{1}{1+e^{-(\omega_i-\omega_j)}}\]\[=F(\omega_i - \omega_j)\]
TrueSkill / ELO (chess)
A more concrete example
Each player has skill \(s_i\) (quality \(\omega_i\))
-
Each player has performance \(p_i\) (score \(\tau_i\))
\(p_i \sim \mathcal{N}(s_i, \beta^2)\)
Probability that player \(i\) wins player \(j\): \[P(p_i > p_j|s_i, s_j) = \Phi\left(\frac{s_i - s_j}{\sqrt{2}\beta}\right) (M_{ij})\]
\(\Phi\) is CDF of \(\mathcal{N}(0, 1)\)
Mallows
\(F\) is now the Gaussian CDF
Non-parametric Model
Strong Stochastic Transitivity (SST)
A superset of parametric models
Does not assume the existence of a quality vector \(\omega \in \mathbb{R}^n\)
Does not assume the existence of any specific form for \(M_{ij}\)
Assumes the existence of the total ordering of \(n\) items
Strong Stochastic Transitivity (SST) cont'd
Assumes \(M_{il} \geq M_{jl}\) for every pair \((i, j)\) in which \(i\) is ranked above \(j\) in the total ordering and every item \(l\) (Shah and Wainwright 2018)
-
Alternative definition: (Falahatgar et al. 2017)
\(M_{il} \geq \max (M_{ij}, M_{jl})\)
Exact recovery of top \(k\) items
depends on the degree of separation between \(k\) ranked and \((k+1)\) ranked item
-
Define \(k\)-separation threshold:
\(\Delta_k(M) := \tau_{(k)}(M) - \tau_{(k+1)}(M) \)
Approximate ranking
Approximate in the sense that the top \(k\) items obtained has a Hamming error between itself and the true ordering
Hamming distance / error between two sets \(A\) and \(B\) is defined as \[D_H=|(S \cup S^*) \setminus (S \cap S^*)|\]
Separation threshold between \(k\)-th ranked item and \((k+1)\)-th ranked item
Approach I'd like to take
Assume parametric model -- not too sure
Use Borda score
Assume adaptive comparison pair selection ? maybe start with non-adaptive, uniform randomly chosen
end goal: learn the best possible rank with the fewest possible sample complexity
CS Interactive Info Vis project
By Sheng Long
CS Interactive Info Vis project
- 51