Itakura-Saito nonnegative matrix factorization with group sparsity : complete results

Acknowledgements

We wish to thank artists AlexQ and Shannon Hurley, for providing source tracks of their compositions ). They are available on the Internet Archive and the ccmixter website, and licensed under the Creative Commons Non-Commercial License
We experiment our algrithm on two tracks : The sources were available, from which we took 20-30 seconds excerpts. We partially muted each source to control the proportion p of overlap, as illustrated below :

.jpg image

We compare our algorithm (GIS-NMF) with : Click on items in the table below to listen to the audio files !


track source GIS-NMF baseline Ideal NMF Random Mask
love 0% bass 8.88 -67.53 8.86 -8.55
guitar 13.60 3.77 13.94 -2.19
love 33% bass 4.33 -4.60 4.56 -8.74
guitar 9.77 -7.40 9.90 -2.02
love 66% bass 1.47 -5.29 3.12 -9.08
guitar 7.72 -8.11 8.68 -1.94
love 100% bass -5.13 -4.16 2.54 -9.02
guitar -0.21 -2.68 8.09 -2.02
sunrise 0% guitar 3.74 3.33 11.33 -4.27
voice 0.31 -0.88 10.62 -5.35
sunrise 33% guitar 12.06 -31.68 11.30 -3.85
voice 10.42 -2.85 9.72 -5.82
sunrise 66% guitar 2.37 2.88 10.90 -3.77
voice -4.03 -3.41 9.12 -5.93
sunrise 100% guitar 2.30 -7.53 5.98 -3.57
voice -5.37 -2.43 3.33 -6.14