Audio Brush: Smart Audio Editing in the Spectrogram
- Starting with a novel audio analysis and editing paradigm, a set of new and adaptive audio analysis and editing algorithms in the spectrogram are developed and integrated into a smart visual audio editing tool in a “what you see is what you hear” style. At the core of our algorithms and methods is a very flexible audio spectrogram that goes beyond FFT and Wavelets and supports manipulating a signal atany chosen time-frequency resolution: the Gabor analysis and synthesis. It gives maximum accuracy of the representation,is fully invertible, and enables resolution zooming. Simple audio objects are localized in time and frequency. They can easily be identified visually and selected by simple geometric selection masks such as rectangles, combs and polygons. For many audio objects, however the structures in the spectrogram are rather complex. Therefore, we present several intelligent and adaptive mask selection approaches. They are based on audio fingerprinting and visual pattern matchingStarting with a novel audio analysis and editing paradigm, a set of new and adaptive audio analysis and editing algorithms in the spectrogram are developed and integrated into a smart visual audio editing tool in a “what you see is what you hear” style. At the core of our algorithms and methods is a very flexible audio spectrogram that goes beyond FFT and Wavelets and supports manipulating a signal atany chosen time-frequency resolution: the Gabor analysis and synthesis. It gives maximum accuracy of the representation,is fully invertible, and enables resolution zooming. Simple audio objects are localized in time and frequency. They can easily be identified visually and selected by simple geometric selection masks such as rectangles, combs and polygons. For many audio objects, however the structures in the spectrogram are rather complex. Therefore, we present several intelligent and adaptive mask selection approaches. They are based on audio fingerprinting and visual pattern matching algorithms. Spectrograms of individually recorded sounds under controlled conditions or interactively selected in the current spectrogram can be regarded as visual andsophisticated templates. In this paper we discuss how to generate templates, how to find the best match out of a database of templates and how to adapt the match to the sound which we want to edit.…
Author: | C. Gregor van den BoogaartGND, Rainer LienhartORCiDGND |
---|---|
URN: | urn:nbn:de:bvb:384-opus4-1181 |
Frontdoor URL | https://opus.bibliothek.uni-augsburg.de/opus4/167 |
Series (Serial Number): | Reports / Technische Berichte der Fakultät für Angewandte Informatik der Universität Augsburg (2006-12) |
Type: | Report |
Language: | English |
Date of Publication (online): | 2006/06/06 |
Publishing Institution: | Universität Augsburg |
Release Date: | 2006/06/06 |
GND-Keyword: | Digitale Audiotechnik; Sonogramm; Audio-Mastering |
Institutes: | Fakultät für Angewandte Informatik |
Fakultät für Angewandte Informatik / Institut für Informatik | |
Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Maschinelles Lernen und Maschinelles Sehen | |
Dewey Decimal Classification: | 0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik |