Visual Audio: An Interactive Tool for Analyzing and Editing of Audio in the Spectrogram
- We present a tool for analyzing and editing audio signals in the visual domain. As visual representation we use spectrograms, which give descriptive information about the sound. This allows analysing and editing audio in a “what you see is what you hear” style. Gabor analysis and synthesis serves as a basis to create images and recreate audio signals from the edited images in hi-fi quality. As the structures in the spectrogram are rather complex, image processing and computer vision methods are applied for smart user assisted editing. Templates based on sounds recorded under defined conditions are therefore used. This allows detecting, separating, eliminating and/or modifying audio objects supervised (interactively) or automatically. We further propose the usage of resolution zooming, to support manipulating the spectrogram of a signal at any chosen time-frequency resolution.
Author: | C. Gregor van den BoogaartGND, Rainer LienhartGND |
---|---|
URN: | urn:nbn:de:bvb:384-opus4-1221 |
Frontdoor URL | https://opus.bibliothek.uni-augsburg.de/opus4/171 |
Series (Serial Number): | Reports / Technische Berichte der Fakultät für Angewandte Informatik der Universität Augsburg (2005-22) |
Type: | Report |
Language: | English |
Year of first Publication: | 2005 |
Publishing Institution: | Universität Augsburg |
Release Date: | 2006/06/06 |
GND-Keyword: | Digitale Audiotechnik; Audio-Mastering |
Institutes: | Fakultät für Angewandte Informatik |
Fakultät für Angewandte Informatik / Institut für Informatik | |
Fakultät für Angewandte Informatik / Institut für Informatik / Lehrstuhl für Maschinelles Lernen und Maschinelles Sehen | |
Dewey Decimal Classification: | 0 Informatik, Informationswissenschaft, allgemeine Werke / 00 Informatik, Wissen, Systeme / 004 Datenverarbeitung; Informatik |