Conferences >2018 IEEE International Confe...

Music Structure Boundary Detection and Labelling by a Deconvolution of Path-Enhanced Self-Similarity Matrix

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

We propose a music structure analysis method that converts a path-enhanced self-similarity matrix (SSM) into a block-enhanced SSM using non-negative matrix factor 2-D dec...View more

Metadata

Abstract:

We propose a music structure analysis method that converts a path-enhanced self-similarity matrix (SSM) into a block-enhanced SSM using non-negative matrix factor 2-D deconvolution (NMF2D). With a non-negative constraint, the deconvolution intuitively corresponds to the repeated stripes in the path-enhanced SSM. Then the block-enhanced SSM is constructed without any clustering technique. We fuse block-enhanced SSMs obtained using different parameters, resulting in better and more robust results. Discussion shows that the proposed method can be a potential tool for analysing music structure at different scales.

Published in: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Date of Conference: 15-20 April 2018

Date Added to IEEE Xplore: 13 September 2018

ISBN Information:

Electronic ISSN: 2379-190X

DOI: 10.1109/ICASSP.2018.8461319

Conference Location: Calgary, AB, Canada

Contents

1. Introduction

A self-similarity matrix (SSM) is an important mid-level representation for music structure analysis, which is generated by computing frame-to-frame similarity. The repetition of segments typically leads to diagonal stripes in the SSM [1]. The stripes can be emphasised in the path-enhanced SSM by applying diagonal smoothing to the SSM [2] or to the recurrence plot [3]. As shown in Figure 1, the stripe patterns clearly illustrate the repetition of segments (e.g., verse or chorus) and the repetition within a segment (e.g., the bridge in Figure 1). The stripes of the same group can be generated by shifting a specific structure pattern. Based on this observation, we propose to decompose the path-enhanced SSM into structure patterns (as shown in Figure 2(a)) and shift activations corresponding to each pattern using non-negative matrix factor 2-D deconvolution (NMF2D) [4]. We then construct a new, block-enhanced SSM by computing the SSM of the normalised activations. In order to obtain a robust result, we fuse block-enhanced SSMs obtained with different parameters. Then we detect boundaries on the fused, block-enhanced SSM using a checkerboard kernel [5], and label the detected segments by comparing each pair of segments.

References is not available for this document.

Music Structure Boundary Detection and Labelling by a Deconvolution of Path-Enhanced Self-Similarity Matrix

Abstract:

Metadata

Abstract:

1. Introduction

References

IEEE Account

Purchase Details

Profile Information

Need Help?

Music Structure Boundary Detection and Labelling by a Deconvolution of Path-Enhanced Self-Similarity Matrix

Alerts

Abstract:

Metadata

Abstract:

1. Introduction

References