Title:
Panoptic reconstruction of immersive virtual soundscapes using human-scale panoramic imagery with visual recognition
Panoptic reconstruction of immersive virtual soundscapes using human-scale panoramic imagery with visual recognition
dc.contributor.author | Huang, Mincong (Jerry) | |
dc.contributor.author | Chabot, Samuel | |
dc.contributor.author | Braasch, Jonas | |
dc.contributor.corporatename | International Community for Auditory Display | |
dc.date.accessioned | 2022-03-21T16:18:47Z | |
dc.date.available | 2022-03-21T16:18:47Z | |
dc.date.issued | 2021-06 | |
dc.description | Presented at the 26th International Conference on Auditory Display (ICAD 2021) 25-28 June 2021, Virtual conference. | |
dc.description | Presented at the 26th International Conference on Auditory Display (ICAD 2021) 25-28 June 2021, Virtual conference. | |
dc.description.abstract | This work, situated at Rensselaer's Collaborative-Research Augmented Immersive Virtual Environment Laboratory (CRAIVELab), uses panoramic image datasets for spatial audio display. A system is developed for the room-centered immersive virtual reality facility to analyze panoramic images on a segment-by-segment basis, using pre-trained neural network models for semantic segmentation and object detection, thereby generating audio objects with respective spatial locations. These audio objects are then mapped with a series of synthetic and recorded audio datasets and populated within a spatial audio environment as virtual sound sources. The resulting audiovisual outcomes are then displayed using the facility's human-scale panoramic display, as well as the 128-channel loudspeaker array for wave field synthesis (WFS). Performance evaluation indicates effectiveness for real-time enhancements, with potentials for large-scale expansion and rapid deployment in dynamic immersive virtual environments. | |
dc.identifier.doi | https://doi.org/10.21785/icad2021.043 | |
dc.identifier.uri | http://hdl.handle.net/1853/66346 | |
dc.publisher | Georgia Institute of Technology | |
dc.publisher | Georgia Institute of Technology | |
dc.publisher.original | International Community on Auditory Display | |
dc.publisher.original | International Community for Auditory Display (ICAD) | |
dc.relation.ispartofseries | International Conference on Auditory Display (ICAD) | |
dc.rights | Licensed under Creative Commons Attribution Non-Commercial 4.0 International License. | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc/4.0/ | |
dc.subject | Auditory display | |
dc.title | Panoptic reconstruction of immersive virtual soundscapes using human-scale panoramic imagery with visual recognition | |
dc.type | Text | |
dc.type.genre | Proceedings | |
dspace.entity.type | Publication | |
local.contributor.corporatename | Sonification Lab | |
local.relation.ispartofseries | International Conference on Auditory Display (ICAD) | |
relation.isOrgUnitOfPublication | 2727c3e6-abb7-4df0-877f-9f218987b22a | |
relation.isSeriesOfPublication | 6cb90d00-3311-4767-954d-415c9341a358 |
Files
Original bundle
1 - 1 of 1