Teleconferencing in Spatial Audio

A thesis by Jackson Goode

Teleconferencing
is here to stay

But it's often a tiring affair...

  • "Zoom fatigue" is real
  • Reduced dimensionality
  • Poor quality

 

Latency, network reliability, visual and audio fidelity can all contribute to a fatiguing experience

 

But software can also play a major role

Issues in Teleconferencing

Don't reinvent the wheel - instead focus on one critical component


How can the treatment of audio bring us to the goal of telepresence, and closer to realistic conversations

Replicate how real interactions take place

  • Spatial model, spatial audio
    • Visual-aural coherency
    • Binaural audio
  • Benefits from the literature
    • Lateralizing audio can improve intelligibility
    • Disentangle double-talk
    • And more: reduce cognitive load, improve comprehension, and is generally more favorable

Jitsi Meet

An open source video conferencing platform

WebAudio

An API in Javascript for working with audio

Implementation

 

  • Capturing participants' audio streams (WebRTC)
  • Head-related transfer functions via PannerNode
  • Dynamic processing of participants

A short demonstration

Validating with
a user study

There was support for four hypothesis of perceived metrics:

  • Decreased cognitive effort
  • Increased social presence
  • Increased vocal intelligibility
  • Increase in opinion score

 

 

And the future?

Toward a Telepresence of Sound

By jacksongoode

Toward a Telepresence of Sound

  • 43