Organizational Unit:
School of Music

Research Organization Registry ID
Previous Names
Parent Organization
Parent Organization
Organizational Unit
Includes Organization(s)

Publication Search Results

Now showing 1 - 10 of 47
  • Item
    reNotate: The Crowdsourcing and Gamification of Symbolic Music Encoding
    (Georgia Institute of Technology, 2016-04) Taylor, Benjamin ; Shanahan, Daniel ; Wolf, Matthew ; Allison, Jesse ; Baker, David John
    Musicologists and music theorists have, for quite some time, hoped to be able to make use of computational methods to examine large corpora of music. As far back as the 1940s, an IBM card-sorter was used to implement patternfinding in traditional British folk songs (Bronson 1949, 1959). Alan Lomax famously implemented statistical methods in his Cantometrics project (Lomax, 1968), which sought to collate a large corpus of folk music from across many cultures. In the 1980s and 90s, a number of encoding projects were instituted in an attempt to be able to make searchable music notation on a large scale. The Essen Folksong Collection (Schaffrath, 1995) collected ethnographic transcriptions, whereas projects at the Center for Computer Assisted Research in the Humanities (CCARH) focused on scores in the Western Art Music tradition (Bach chorales, Mozart sonatas, instrumental themes, etc.). Recently, scholars have focused on improving Optical Music Recognition, in the hopes of facilitating the acquisition of large numbers of musical scores (Fujinaga, et al., 2014), but non-notated music, such as improvisational jazz, is often overlooked. While there have been many advances in music information retrieval in recent years, parameters that would facilitate in-depth musicological analysis are still out of reach (for example, stream segregation to examine specific melodic lines, or the analysis of harmony at a resolution that would allow for an analysis of specific chord voicings). Our project seeks to implement methods similar to those used in CAPTCHA and RECAPTCHA technology to crowdsource the symbolic encoding of musical information through a web-based gaming interface. The introductory levels ask participants to tap along with an audio recording's tempo, giving us an approximate BPM, while the second level asks for participants to tap with onsets. The third level asks them to match a contour of a three-note segment, and the final stage asks for specific note matching within that contour. A social-gaming interface allows for users to compete against one another. It is our hope that this work can be generalized to many types of musical genres, and that a web-based framework might facilitate the encoding of musicological and music-theoretic datasets that might be underrepresented by current MIR work.
  • Item
    Live Looping Electronic Music Performance with MIDI Hardware
    (Georgia Institute of Technology, 2016-04) McKegg, Matt
    I have spent much of the last three years building a Web Audio based desktop application for live electronic music performance called Loop Drop. It was inspired by my own frustration with existing tools for live performance. I wanted a tool that would give me, as a performer, the level of control and expression desired but still feel like playing a musical instrument instead of programming a computer. My application was built using web technologies such as JavaScript and HTML5, leveraging existing experience as a web developer and providing an excellent workflow for quick prototyping and user interface design. In combination with Electron, a single developer can build a desktop audio application very efficiently. Loop Drop uses Web MIDI to interface with hardware such as Novation Launchpad. The software allows creation of sounds using synthesis and sampling, and arranges these into “chunks” which may be placed in any configuration across the midi controller’s button grid. These sounds may be triggered directly, or played quantised to the current tempo at a given rate using “beat repeat”. Everything the performer plays is collected in a buffer that at any time may be turned into a loop. This allows the performer to avoid recording anxiety — a common problem with most live looping systems. They can jam out ideas, then once happy with the sequence, press the loop button to lock it in. In my performance, I will use Loop Drop in conjunction with multiple Novation Launchpad midi controllers, to improvise 15 minutes of electronic music using sounds that I have organised ahead of time. The user interface will be visible to the audience as a projection. I will also be demonstrating the power of hosting Web Audio in Electron by interfacing with an external LED array connected over Serial Peripheral Interface Bus (SPI) to be used as an audio visualiser light show.
  • Item
    rMIXr: how we learned to stop worrying and love the graph
    (Georgia Institute of Technology, 2016-04) Fields, Ben ; Phippen, Sam
    In this talk we present a case study in the use of the web audio APIs. Specifically, our use of them for the creation of a rapidly developed prototype application. The app, called rMIXr (, is a simple digital audio workstation (DAW) for fan remix contests. We created rMIXr in 48 hours at the Midem Hack Day in June 2015. We’ll give a brief demo of the app and show multi-channel sync. We'll also show various effects as well as cutting/time-slicing.
  • Item
    Virtual Sound Gallery
    (Georgia Institute of Technology, 2016-04) Bundin, Andrey
    Virtual Sound Gallery (VSG) is a web stage for modern multichannel music, sound, and audiovisual art. It is an accessible, web-based virtual reality (VR) environment for a visualized binaural simulation of multichannel sound reproduction. In this environment, a user can change their location among virtual loudspeakers and rotate their head to get the best spatial listening experience. In addition, an integrated video engine provides the ability to play visual content on one or several virtual screens in sync with the audio. VSG provides access to different electroacoustic music compositions presented in several virtual exhibitions and classified by concepts, styles, and organizations.
  • Item
    A Novel Approach to Streaming and Client Side Rendering of Multichannel Audio with Synchronised Metadata
    (Georgia Institute of Technology, 2016-04) Paradis, Matthew ; Pike, Chris ; Day, Richard ; Melchior, Frank
    Object based audio broadcasting is an approach which combines audio with metadata that describes how the audio should be rendered. This metadata can include spatial positioning mixing parameters and descriptors to define the type of audio represented by the object. In this talk we show an approach to enabling the streaming of multichannel audio and synchronised metadata to the browser. Audio is rendered in the browser to multiple formats based on the information contained in the synchronised metadata channel. This allows adaptive mixing and rendering of content and user interaction. Based on the MPEG/DASH standard this approach allows an arbitrary number of audio channels to be presented as discrete inputs to the Web Audio API (dependent on any channel limit imposed by the browser). Binaural, 5.1 and stereo renders can be generated and selected for output by the user in real time without any change to the source media stream. Channels marked as being interactive can have their properties exposed to the user to adjust based on their preferences. The audio and metadata is originated from a single BWF file compliant with ITU-R BS 2076 (Audio Definition Model) with the audio being encoded using AAC (as per the MPEG/DASH standard) and the metadata represented in JSON format to the browser. This approach provides a flexible framework for the prototyping and presentation of new audio experiences to online audiences and provides a platform for delivery object based audio to online users.
  • Item
    Soundworks Tutorial
    (Georgia Institute of Technology, 2016-04) Schnell, Norbert ; Lambert, Jean-Philippe ; Goldszmidt, Samuel ; Matuszewski, Benjamin
    The Soundworks framework is dedicated to the development of applications featuring co-located collaborative/collective mobile interactions. The framework is entirely based on web APIs and Node.js. It provides a set of services, abstractions, and user interfaces that help the developer to concentrate on the essential – the design of interaction and audiovisual rendering. In the tutorial, we will present the features and architecture of the framework as well as some examples before guiding the attendees through the implementation of a simple application. The workshop will conclude with a brief collective performance.
  • Item
    Constellation: A Musical Exploration of Phone-Based Audience Interaction Roles
    (Georgia Institute of Technology, 2016-04) Madhavan, Nihar ; Snyder, Jeff
    Constellation designs various relationships between audience and performers by using mobile devices to empower communication during a performance. We direct audience members to a website, which changes throughout the piece and controls the interaction among the performers and audience. We explore several paradigms of interaction, using them as movements of a larger piece. (1) We first explore audience members producing a soundscape using their own actions, through a mobile visual interface that encourages motion, and produces sounds such as bells and controlled noise. (2) We then allow the audience to control onstage performers, through an interface by which an audience can vote on projected notes that performers attempt to follow. This can easily be adjusted to be synthesized sounds without performers. (3) A final paradigm of interaction is of performers directly controlling the audience, using instrumentation that echoes out through the phones of the performers. We move throughout three different sections of the piece, exploring these different interactions and blending them together musically. Constellation was designed and built with the target of performance in an April 2015 concert with the Princeton Laptop Orchestra (PLOrk), with source material from medieval piece “Stella Splendens”. Constellation uses primarily uses, node.js, WebAudio, and Full Tilt.
  • Item
    Cross-Town Traffic 2.0
    (Georgia Institute of Technology, 2016-04) Walker, William ; Belet, Brian
    Cross-Town Traffic 2.0 is an ensemble music performance environment for any number of audience performers, a principal performer, and a conductor. The performers use their own mobile devices running a performance interface based on the Web Audio API. The conductor leads the performers through a fully composed musical structure. Sixteen previously recorded audio files (eight Hammond B3 samples, performed and recorded by Walker; and eight viola samples, performed and recorded by Belet) are arranged into four groups, with the audience performers similarly arranged in four corresponding performance sections. Following cues from the conductor, the ensuing performance immerses humans in the midst of cellphone speakers and the flow of the musical structure. Individual performers can shape their own audio contribution within the confines of the larger composed structure, providing an element of playful participation. The resulting distributed cellphone audio challenges the performance roles of the humans in the room, as opposed to the number and quality of loudspeakers in the space. Using mobile web audio offers very low barriers to audience participation, in contrast to logging into an app store, searching and finding the appropriate native app, installing and launching the app, all prior to the start of the performance.
  • Item
    Live Coding With EarSketch
    (Georgia Institute of Technology, 2016-04) Freeman, Jason
    EarSketch combines a Python / JavaScript API, a digital audio workstation (DAW) visualization, an audio loop library, and an educational curriculum into a web-based music programming environment. While it was designed originally as a classroom educational tool for music technology and computer science, it has recently been expanded to support live coding in concert performance. This live coding performance explores the artistic potential of algorithmic manipulations of audio loops in a multi-track DAW paradigm and explores the potential of DAW-driven visualizations to demystify live coding and algorithms for a concert audience.
  • Item
    WavesJS Tutorial
    (Georgia Institute of Technology, 2016-04) Matuszewski, Benjamin ; Schnell, Norbert ; Goldszmidt, Samuel
    This tutorial session will present WavesJS, a client-side low-level library dedicated to the audiovisual rendering of recorded audio signals and related data in the browser. The library has been designed with a strong focus on flexibility and modularity, and offers a set of extensible building blocks supporting a large range of applications. Along with the submitted paper, the workshop will present the general structure and architecture of the library (particularly the ui and audio components). A more in depth approach of the low level features will be achieved through the implementation of a simple application. WavesJS on GitHub: