Teleconferencing in Spatial Audio
A thesis by Jackson Goode
Teleconferencing
is here to stay
But it's often a tiring affair...
- "Zoom fatigue" is real
- Reduced dimensionality
- Poor quality
Latency, network reliability, visual and audio fidelity can all contribute to a fatiguing experience
But software can also play a major role
Issues in Teleconferencing
Don't reinvent the wheel - instead focus on one critical component
How can the treatment of audio bring us to the goal of telepresence, and closer to realistic conversations
Replicate how real interactions take place
-
Spatial model, spatial audio
- Visual-aural coherency
- Binaural audio
-
Benefits from the literature
- Lateralizing audio can improve intelligibility
- Disentangle double-talk
- And more: reduce cognitive load, improve comprehension, and is generally more favorable
Jitsi Meet
An open source video conferencing platform
WebAudio
An API in Javascript for working with audio
Implementation
- Capturing participants' audio streams (WebRTC)
- Head-related transfer functions via PannerNode
- Dynamic processing of participants
A short demonstration
Validating with
a user study
There was support for four hypothesis of perceived metrics:
- Decreased cognitive effort
- Increased social presence
- Increased vocal intelligibility
- Increase in opinion score
And the future?
Toward a Telepresence of Sound
By jacksongoode
Toward a Telepresence of Sound
- 43