Here is an experiment of segmentation and audio-to-audio time alignment of Goldberg's variations. The Goldberd's Variations of J.S. Bach are a series of 32 music pieces initially played on harpsichord, and commonly played now on piano. A lot of recordings have been made, e.g. the famous Glenn Gould's recordings, and in some cases, repetitions are made (see e.g. https://github.com/petal2020/petal_bach_goldberg-variations). Using the recording of Cédric Pescia, without any repetitions, as reference, we try here to detect the repetition using an automatic segmentation method. Then, the detected segments are aligned in time. The test are based on the method of: “Invariant Audio Prints for Music Indexing and Alignment”, see https://anasynth.papers.ircam.fr/2024/CBMI/ .
For some variations (the Aria, and the variations 1, 3, 5, 7, 18, 25 and 26), we got the recordings of the following musicians:
For each variation, using the Pescia's recording as reference, named Ref, the recording of the other musicians are analysed, named Test.
For each test, one audio is created, with two stereo channels:
Remark: this transformation is based on a phase vocoder with time varying
stretching factor. In some cases, with strong factors, the sound quality
of the transformed segment is not good.
The following tables present for each tested variation the previews of
the original recordings, and of the realigned recordings.
When the cursor is over a realignment audio preview, two figures appear below the table. The axes are:
As a general conclusion about the results, presented below, the segmentation is almost good, and the time alignment is not bad.
Few tested variations are not well segmented, especially this of Wandna Landowska because: first the instrument is different than the reference instrument (harpsichord vs. piano), the recording is old (1933) and has noise, and some repetitions are different (repetition ABA' see https://github.com/petal2020/petal_bach_goldberg-variations/blob/main/Goldberg%20Variations_Repeats.tsv).
Nevertheless, the Andreas Staier's recording, also with harpsichord, is well segmented and aligned. Moreover, we can note that the tuning used for the recording is different that the tuning of Pescia's recording. Thanks to the invariance properties of the used audio codes (pitch shifting, and timbre change), the method succeds in this case.