Title:
Panoptic reconstruction of immersive virtual soundscapes using human-scale panoramic imagery with visual recognition

dc.contributor.author Huang, Mincong (Jerry)
dc.contributor.author Chabot, Samuel
dc.contributor.author Braasch, Jonas
dc.contributor.corporatename International Community for Auditory Display
dc.date.accessioned 2022-03-21T16:18:47Z
dc.date.available 2022-03-21T16:18:47Z
dc.date.issued 2021-06
dc.description Presented at the 26th International Conference on Auditory Display (ICAD 2021) 25-28 June 2021, Virtual conference.
dc.description Presented at the 26th International Conference on Auditory Display (ICAD 2021) 25-28 June 2021, Virtual conference.
dc.description.abstract This work, situated at Rensselaer's Collaborative-Research Augmented Immersive Virtual Environment Laboratory (CRAIVELab), uses panoramic image datasets for spatial audio display. A system is developed for the room-centered immersive virtual reality facility to analyze panoramic images on a segment-by-segment basis, using pre-trained neural network models for semantic segmentation and object detection, thereby generating audio objects with respective spatial locations. These audio objects are then mapped with a series of synthetic and recorded audio datasets and populated within a spatial audio environment as virtual sound sources. The resulting audiovisual outcomes are then displayed using the facility's human-scale panoramic display, as well as the 128-channel loudspeaker array for wave field synthesis (WFS). Performance evaluation indicates effectiveness for real-time enhancements, with potentials for large-scale expansion and rapid deployment in dynamic immersive virtual environments.
dc.identifier.doi https://doi.org/10.21785/icad2021.043
dc.identifier.uri http://hdl.handle.net/1853/66346
dc.publisher Georgia Institute of Technology
dc.publisher Georgia Institute of Technology
dc.publisher.original International Community on Auditory Display
dc.publisher.original International Community for Auditory Display (ICAD)
dc.relation.ispartofseries International Conference on Auditory Display (ICAD)
dc.rights Licensed under Creative Commons Attribution Non-Commercial 4.0 International License.
dc.rights.uri http://creativecommons.org/licenses/by-nc/4.0/
dc.subject Auditory display
dc.title Panoptic reconstruction of immersive virtual soundscapes using human-scale panoramic imagery with visual recognition
dc.type Text
dc.type.genre Proceedings
dspace.entity.type Publication
local.contributor.corporatename Sonification Lab
local.relation.ispartofseries International Conference on Auditory Display (ICAD)
relation.isOrgUnitOfPublication 2727c3e6-abb7-4df0-877f-9f218987b22a
relation.isSeriesOfPublication 6cb90d00-3311-4767-954d-415c9341a358
Files
Original bundle
Now showing 1 - 1 of 1
Thumbnail Image
Name:
ICAD_2021_43.pdf
Size:
14.12 MB
Format:
Adobe Portable Document Format
Description:
Collections