Abstract
In the audio source separation topic, most researchers based on deep learning methods ignored higher-frequency signals due to lack of efficient data compression method. We propose a new model named OvertoneNet (OveNet) that adopts two novel concepts, frequency 1x1 convolution layers, and complex-spectrogram channels, to handle the 44.1k audio signals (Hi-Res audio signals) containing full overtones. The result shows that OveNet performs well in both objective and subjective evaluation on interference using limited training data from SiSEC2018.
Original language | English |
---|---|
Title of host publication | Proceedings - 2019 IEEE International Symposium on Multimedia, ISM 2019 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 148-151 |
Number of pages | 4 |
ISBN (Electronic) | 9781728156064 |
DOIs | |
State | Published - 12 2019 |
Externally published | Yes |
Event | 21st IEEE International Symposium on Multimedia, ISM 2019 - San Diego, United States Duration: 09 12 2019 → 11 12 2019 |
Publication series
Name | Proceedings - 2019 IEEE International Symposium on Multimedia, ISM 2019 |
---|
Conference
Conference | 21st IEEE International Symposium on Multimedia, ISM 2019 |
---|---|
Country/Territory | United States |
City | San Diego |
Period | 09/12/19 → 11/12/19 |
Bibliographical note
Publisher Copyright:© 2019 IEEE.
Keywords
- U-Net
- audio source separation
- convolutional neural networks