Chris Baume
Senior Research Engineer
Whittaker, Steve, et al. "SCANMail: a voicemail interface that makes speech browsable, readable and searchable." Proceedings of the SIGCHI conference on Human factors in computing systems. ACM, 2002.
Casares, Juan, et al. "Simplifying video editing with SILVER." CHI'02 Extended Abstracts on Human Factors in Computing Systems. ACM, 2002.
Whittaker, Steve, and Brian Amento. "Semantic speech editing." Proceedings of the SIGCHI conference on Human factors in computing systems. ACM, 2004.
Berthouzoz, Floraine, Wilmot Li, and Maneesh Agrawala. "Tools for placing cuts and transitions in interview video." ACM Trans. Graph. 31.4 (2012): 67-1.
Rubin, Steve, et al. "Content-based tools for editing audio stories." Proceedings of the 26th annual ACM symposium on User interface software and technology. ACM, 2013.
Sivaraman, Venkatesh, Dongwook Yoon, and Piotr Mitros. "Simplified Audio Production in Asynchronous Voice-Based Discussions." Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems. ACM, 2016.
Shin, Hijung Valentina, Wilmot Li, and Frédo Durand. "Dynamic Authoring of Audio with Linked Scripts." Proceedings of the 29th Annual Symposium on User Interface Software and Technology. ACM, 2016.
"Speech Editor Prototype"
"Discourse" / "Dialogger"
"The Magic Pen"
[
{
"start": 5.58,
"end": 5.83,
"confidence": 0.4,
"word": "hello",
"punct": "Hello"
},
{
"start": 5.85,
"end": 6.08,
"confidence": 0.49,
"word": "world",
"punct": "world."
]
JSON
<a data-start="5580" data-end="5830" data-next="5850" data-content="00:00:05">Hello </a>
<a data-start="5850" data-end="6080" data-next="6120" data-content="00:00:05">world. </a>
HTML
{
"grain_type": "event",
"source_id": "fa15e306-ede6-4f7f-8025-3ef4191c9e13",
"flow_id": "f058bf49-fc5b-4a05-86be-5fd4e0bf8b9a",
"origin_timestamp": "1471604633:632000000",
"sync_timestamp": "1468420295:0",
"creation_timestamp": "1471604633:632000000",
"event_payload": {
"type": "urn:x-ipstudio:format:event.transcript",
"topic": "/sources/64313f2c-fd6f-46a5-9f3b-1ce92a97c20f",
"path": "/segments/db40db76-532f-48e1-93ec-bf0f6a8f1730/
utterance/cc7e04cc-5aed-44c7-851a-79414aee565f",
"pre": {},
"post": {
"word": "hello",
"punctuated_word": "Hello",
"confidence": 0.4
}
}
}
JSON
Requirements:
Solution:
in no particular order...