Title:
A Novel Approach to Streaming and Client Side Rendering of Multichannel Audio with Synchronised Metadata
A Novel Approach to Streaming and Client Side Rendering of Multichannel Audio with Synchronised Metadata
dc.contributor.author | Paradis, Matthew | |
dc.contributor.author | Pike, Chris | |
dc.contributor.author | Day, Richard | |
dc.contributor.author | Melchior, Frank | |
dc.contributor.corporatename | BBC Research & Development | en_US |
dc.date.accessioned | 2016-03-24T14:41:02Z | |
dc.date.available | 2016-03-24T14:41:02Z | |
dc.date.issued | 2016-04 | |
dc.description | Presented at the 2nd Web Audio Conference (WAC), April 4-6, 2016, Atlanta, Georgia. | en_US |
dc.description.abstract | Object based audio broadcasting is an approach which combines audio with metadata that describes how the audio should be rendered. This metadata can include spatial positioning mixing parameters and descriptors to define the type of audio represented by the object. In this talk we show an approach to enabling the streaming of multichannel audio and synchronised metadata to the browser. Audio is rendered in the browser to multiple formats based on the information contained in the synchronised metadata channel. This allows adaptive mixing and rendering of content and user interaction. Based on the MPEG/DASH standard this approach allows an arbitrary number of audio channels to be presented as discrete inputs to the Web Audio API (dependent on any channel limit imposed by the browser). Binaural, 5.1 and stereo renders can be generated and selected for output by the user in real time without any change to the source media stream. Channels marked as being interactive can have their properties exposed to the user to adjust based on their preferences. The audio and metadata is originated from a single BWF file compliant with ITU-R BS 2076 (Audio Definition Model) with the audio being encoded using AAC (as per the MPEG/DASH standard) and the metadata represented in JSON format to the browser. This approach provides a flexible framework for the prototyping and presentation of new audio experiences to online audiences and provides a platform for delivery object based audio to online users. | en_US |
dc.embargo.terms | null | en_US |
dc.identifier.citation | Paradis, M., et al. "Object based audio broadcasting is an approach which combines audio with metadata that describes how the audio should be rendered. This metadata can include spatial positioning mixing parameters and descriptors to define the type of audio represented by the object. In this talk we show an approach to enabling the streaming of multichannel audio and synchronised metadata to the browser. Audio is rendered in the browser to multiple formats based on the information contained in the synchronised metadata channel. This allows adaptive mixing and rendering of content and user interaction. Based on the MPEG/DASH standard this approach allows an arbitrary number of audio channels to be presented as discrete inputs to the Web Audio API (dependent on any channel limit imposed by the browser). Binaural, 5.1 and stereo renders can be generated and selected for output by the user in real time without any change to the source media stream. Channels marked as being interactive can have their properties exposed to the user to adjust based on their preferences. The audio and metadata is originated from a single BWF file compliant with ITU-R BS 2076 (Audio Definition Model) with the audio being encoded using AAC (as per the MPEG/DASH standard) and the metadata represented in JSON format to the browser. This approach provides a flexible framework for the prototyping and presentation of new audio experiences to online audiences and provides a platform for delivery object based audio to online users." (ABSTRACT). In Jason Freeman, Alexander Lerch, Matthew Paradis (Eds.), Proceedings of the 2nd Web Audio Conference (WAC-2016), Atlanta, 2016. ISBN: 978-0-692-61973-5 | en_US |
dc.identifier.isbn | 978-0-692-61973-5 | |
dc.identifier.uri | http://hdl.handle.net/1853/54661 | |
dc.publisher | Georgia Institute of Technology | en_US |
dc.relation.ispartofseries | Web Audio Conference ; 2016 | |
dc.rights | Licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0). | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Object based audio broadcasting | en_US |
dc.subject | Audio metadata | en_US |
dc.subject | Web audio | en_US |
dc.title | A Novel Approach to Streaming and Client Side Rendering of Multichannel Audio with Synchronised Metadata | en_US |
dc.type | Text | |
dc.type | Moving Image | |
dc.type.genre | Abstract | |
dc.type.genre | Proceedings | |
dc.type.genre | Presentation | |
dspace.entity.type | Publication | |
local.contributor.corporatename | School of Music | |
local.contributor.corporatename | College of Design | |
local.relation.ispartofseries | Web Audio Conference | |
relation.isOrgUnitOfPublication | 92d2daaa-80f2-4d99-b464-ab7c1125fc55 | |
relation.isOrgUnitOfPublication | c997b6a0-7e87-4a6f-b6fc-932d776ba8d0 | |
relation.isSeriesOfPublication | 9254e016-2352-47b3-9b98-bc01c2fbe242 |
Files
Original bundle
1 - 4 of 4
- Name:
- WAC2016-53.pdf
- Size:
- 88.4 KB
- Format:
- Adobe Portable Document Format
- Description:
- Abstract
No Thumbnail Available
- Name:
- ANovelApproach.mp4
- Size:
- 153.36 MB
- Format:
- MP4 Video file
- Description:
- Download video
No Thumbnail Available
- Name:
- ANovelApproach_videostream.html
- Size:
- 985 B
- Format:
- Hypertext Markup Language
- Description:
- Streaming Video
No Thumbnail Available
- Name:
- Transcription.txt
- Size:
- 13.4 KB
- Format:
- Plain Text
- Description:
- Transcription
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 3.13 KB
- Format:
- Item-specific license agreed upon to submission
- Description: