Anne next
Technical overview for non-techies
What's anne?
First, a question:
From a techie's perspective:
- Client-facing software (AnneNext, "the face")
- "The Dashboard"
- A bunch of servers
- Speech and the libraries around it
- Telemetry
- AI and user-focused machine learning
the client
...or what the users see
Built with
- Electron
- Node
- JavaScript
- ...and around ~3000 other
smaller libraries
may be the most "visible" part...
but it's only around 40% of the code
the dashboard
...or what makes anne behave one way or another
The dashboard is:
- The other client-facing part
- What makes Anne "tick"
- Allows us to be very user-centric:
Their Anne is their own, and they
get to decide how she behaves. - Anything can be made a setting,
and in time, most things will.
The dashboard HAS:
- Settings for every module
- Speech settings
- Fine-grained permissions
- Content / media control
the server bunch
...or the technical mumbo-jumbo that makes it all work together
server-side anatomy
- Amazon services - authentication, data storage, automation, settings (AKA an "API", but that's quite
innacurate here). - Miniflux - the server software we use for news.
- Nextcloud - the server software we use for calendars (and potentially much more).
more importantly...
- The thing that makes the dashboard work...
- Which in turn allows the AnneNext client to work.
- In a way, the "heart" of the operation.
the speech stack
...asr, tts, Classifiers and other big words. simplified, i promise :)
TTS - text to speech
- You send it text, it speaks it out
- It can be in various languages
- It can use various voices
- And we aim to make it all configurable
(via the dashboard)
ASR - automatic speech recognition
- AKA Speech to text (STT)
- You speak to it, and it turns that speech into text
- Also supports various languages
- We also aim to make it configurable via
the dashboard
the avatar library
- The part that ties ASR, TTS and the AnneNext client together.
- Where ASR brings recognition of phrases, the library makes the client "understand" the phrases - what should be shown on screen, what should Anne say back, etc.
- This is a part of the local AI (Anne comes with multiple AI elements).
A short recap
- AnneNext client reads settings from servers
- Dashboard writes settings to servers
- ASR recognizes speech
- TTS makes text into speech
- The Avatar library makes ASR and TTS
"talk" to the AnneNext client.
telemetry & AI
...or how we get to know whether a user likes something. and much more!
the data we collect...
- Is completely anonymized*
- We collect it to learn from it:
- How to improve Anne as a product
- How to improve Anne for the user
- Make sure the user is doing good
- Find ways to make their lives easier
the data we collect...
- Allows caregivers to track well-being
- Track trends across time
- Provide meaningful insights via our AI /
Analysis tools
the data we collect...
- Is a vital part of the global AI
(the one that runs on servers) - Will ultimately allow us to adapt
Anne to the users needs - "Seeing" a user's habits, Anne will
make herself more useful to
said user - The more they use her, the better
she gets
thank you!
QUESTIONS?
AnneNext overview
By Darko Bozhinovski
AnneNext overview
- 274