papers AI Learner
The Github is limit! Click to go to the new site.

Neuralogram: A Deep Neural Network Based Representation for Audio Signals

2019-04-10
Prateek Verma, Chris Chafe, Jonathan Berger

Abstract

We propose the Neuralogram – a deep neural network based representation for understanding audio signals which, as the name suggests, transforms an audio signal to a dense, compact representation based upon embeddings learned via a neural architecture. Through a series of probing signals, we show how our representation can encapsulate pitch, timbre and rhythm-based information, and other attributes. This representation suggests a method for revealing meaningful relationships in arbitrarily long audio signals that are not readily represented by existing algorithms. This has the potential for numerous applications in audio understanding, music recommendation, meta-data extraction to name a few.

Abstract (translated by Google)
URL

http://arxiv.org/abs/1904.05073

PDF

http://arxiv.org/pdf/1904.05073


Similar Posts

Comments