|
Alistair Wilson Caltech Experimental Summer School June 2025 |
David
Danz
Pittsburgh
Lise
Vesterlund
Pittsburgh
Different types of beliefs we might care about:
Measured belief here is used in a regression:
See Benjamin (2019) for a detailed literature review
Measured belief used in a Grether regression:
The main challenge in belief elicitation is that we need to be correct in our theoretical assumptions over the mechanism, at the individual level
Let's look at eliciting a probability of an event \(E\) (a binary outcome)
Score on \(E\)
Score on \(E^c\)
\(s_E(\theta)\)
\(s_{E^c}(\theta)\)
\(U(\mathcal{L}_\theta(q)|\theta)\)
\(U(\mathcal{L}_\theta(\theta)|\theta)\)
| Rule | ||
|---|---|---|
| Quadratic | ||
| Spherical | ||
| Logarithmic |
\(s_E(q)\)
\(s_{E^c}(q)\)
\(1-(1-q)^2\)
\(1-(0-q)^2\)
\(\frac{q}{\sqrt{q^2+(1-q)^2}}\)
\(\frac{1-q}{\sqrt{q^2+(1-q)^2}}\)
\(\ln(q)\)
\(\ln(1-q)\)
| Row # | Option A | Option B |
|---|---|---|
| 0 | $X on E | $X with 0% chance |
| 1 | $X on E | $X with 1% chance |
| 2 | $X on E | $X with 2% chance |
| 3 | $X on E | $X with 3% chance |
| 99 | $X on E | $X with 99% chance |
| 100 | $X on E | $X with 100% chance |
\(\vdots\)
\(\vdots\)
\(\vdots\)
| Row # | Option A | Option B |
|---|---|---|
| 0 | $X on E | $X with 0% chance |
| 1 | $X on E | $X with 1% chance |
| 2 | $X on E | $X with 2% chance |
| 3 | $X on E | $X with 3% chance |
| 99 | $X on E | $X with 99% chance |
| 100 | $X on E | $X with 100% chance |
\(\vdots\)
\(\vdots\)
\(\vdots\)
| Row # | Option A | Option B |
|---|---|---|
| 0 | $X on E | $X with 0% chance |
| 1 | $X on E | $X with 10% chance |
| 2 | $X on E | $X with 20% chance |
| 3 | $X on E | $X with 30% chance |
| 99 | $X on E | $X with 90% chance |
| 100 | $X on E | $X with 100% chance |
\(\vdots\)
\(\vdots\)
\(\vdots\)
Fraction not reporting induced prior
Expected deviation by induced prior
Reduced distortions, and no differential effect across risk aversion (here using a meta-study dataset from Danz et al. 2024)
Each belief scenario consists of three seperate elicitations.
| Property | Information |
|---|---|
| Dominant Strategy | |
| Payoff Description | |
| Payoff Slider | |
| Feedback |
✅
✅
✅
✅
✅
✅
✅
✅
Instructions:
The payment rule is designed so that you can secure the largest chance of winning the prize by reporting your most-accurate guess.
Slide summarizing instructions:
(literally the last thing they see
before they begin making decisions)
At the end of each round:
Only 15 percent of participants consistently report induced prior
Only 15 percent of participants consistently report induced prior
84% chance
83% chance
| Stated Belief on Red | Chance to Win if Red | Chance to Win if Blue |
|---|---|---|
| 1 | 100% | 0% |
| 0.9 | 99% | 19% |
| 0.8 | 96% | 36% |
| 0.7 | 91% | 51% |
| 0.6 | 84% | 64% |
| 0.5 | 75% | 75% |
| Reporting rule | Earnings |
|---|---|
| Truthful | $6.27 |
| Middle | $6.00 |
| Random | $5.33 |
| Minimizing | $2.88 |
Near-extreme
Center
Distant-extreme
Proportion of non-centered reports in each bin:
Incentives
Failure to reduce compound lottery
flatness
asymmetry
Vary information on incentives:
RCL-calculator : Aid reduction of compound lottery
No-Information: Eliminate quantitative information on incentives
✅
✅
✅
✅
❌
✅
✅
✅
✅
❌
| Property | Information | RCL | No-Information |
|---|---|---|---|
| Dominant Strategy | | ||
| Payoff Description | | ||
| Payoff Slider | | ||
| Feedback | | ||
| RCL calculator | |
✅
❌
❌
❌
❌
✅
✅
✅
✅
✅
| Treatment | Source of false reports |
|---|---|
| Information | confusion, BSR incentives, failed RCL |
| RCL | confusion, BSR incentives |
| No Information | confusion |
Truthful reporting greatest w/o incentive information
Near-extreme
Center
Distant-extreme
Proportion of non-centered reports in each bin:
Inf:
RCL:
NoInf:
To understand the effects on inference we use a simple model of the center-bias distortions
Observed belief is:
Regression model where X is a binary treatment indicator:
What happens when distorted beliefs are used?
Observed belief is:
Left-hand-side effect is clear:
Observed belief is:
Right-hand-side treatment effect will depend on unknowns:
We return to the Niederle & Vesterlund study:
Run this study twice:
LHS: Confidence difference between men and women:
RHS: Competition difference for men and women after controlling for confidence:
Information predicted to attenuate gender confidence difference
Information predicted to make the gender-gap in tournament-entry larger (after controlling for confidence
Original finding is that:
0
100
20
80
Suppose that forming a clearer belief requires effort
0
100
20
80
Supposing that forming a clearer belief requires some effort
Ans: 56.25%
Working project with Brandon Williams:
Ten rounds: Elicit cost for an easy task, or a Hard one plus amount \(\$X\)
LHS:
Constant
Difficulty
RHS:
Varying
Difficulty
Always
Pays $.50
If Correct
$X
If Correct
Choose
$X threshold
\(\text{OLS }(\text{Effort})\)
\(\text{Tobit}(\text{WTA})\)
\(\text{Logit }(\text{Correct})\)
Output
Effort
WTA
Our calibration demonstrate that the task responds to effort, is costly, and in a set of other experiments, we obtain their low-effort guesses
After 15 second
After 45 second
Use four incentives to ask about beliefs in ten different urns:
Model:
Participants told:
Are then asked for the proportion of blue balls given a dot:
We then elicit the costs for performing the Bayesian update calculation:
Calc or Count
Calc only
Proportion of Blue dotted tokens
Can they perform each task?
Elicit how much they require to do the task:
For those who can do the calculation
For those who can't
Avg: $0.82
Avg: $0.99
For beliefs about a number, avoid eliciting the mean consider eliciting the mode (or modal interval) instead
For beliefs about a frequency, consider eliciting the probability of a single, randomly chosen outcome
Consider coarser elicitations
Use a pricelist for probabilities; however, if committed to a scoring rule use the BQSR