| # | Source Prompt | Target Prompt | Original Audio | Edited Audio | Edit Skip |
|---|---|---|---|---|---|
| 1 | A recording of a sneaky jazz song. | A recording of a tense classical music score. | 90 | ||
| 2 | A recording of a hard rock song. | A recording of a jazz song. | 100 | ||
| 3 | A recording of a happy upbeat classical music piece. | A recording of a happy upbeat arcade game soundtrack. | 100 | ||
| 4 | A recording of a rock song. | A recording of Arabic music. | 90 | ||
| 5 | —— | A recording of a funky hip hop song. | 90 | ||
| 6 | Trumpets playing alongside a piano, bass and drums in an upbeat old-timey cool jazz song. | A banjo playing alongside a piano, bass and drums in an upbeat old-timey cool country song. | 110 | ||
| 7 | A recording of an upbeat gospel song. | A recording of an upbeat techno song. | 100 | ||
| 8 | A recording of a happy upbeat song in a Latin jazz style. | A recording of a happy upbeat song in a retro arcade game soundtrack style. | 110 | ||
| 9 | —— | A recording of a dark techno song. | 110 | ||
| 10 | A recording of a dramatic epic Chinese piece. | A recording of a dramatic heavy metal piece. | 160 | ||
| 11 | —— | A recording of an upbeat cool jazz song. | 110 | ||
| 12 | A recording of an old rock song. | A recording of an techno song. | 110 | ||
| 13 | Chinese strings, flutes, and harps playing an upbeat piece. | Chinese strings, flutes, and harps playing an somber piece. | 120 | ||
| 14 | A high quality recording of wind instruments and strings playing. | A high quality recording of a piano playing. | 130 | ||
| 15 | —— | A recording of an upbeat arcade game soundtrack. | 120 | ||
| 16 | A high quality recording of a cat meowing. | A high quality recording of a dog barking. | 50 | ||
| 17 | A high quality recording of a dog barking a lot. | A high quality recording of a gun shooting a lot. | 100 | ||
| 18 | A kid talking loudly. | A rooster crowing. | 90 |
| # | Source Prompt | Target Prompt | Original Audio | Skip=90 | Skip=100 | Skip=110 | Skip=120 | Skip=130 |
|---|---|---|---|---|---|---|---|---|
| 1 | A recording of a happy upbeat song in a Latin jazz style. | A recording of a happy upbeat song in a retro arcade game soundtrack style. | ||||||
| 2 | — | A recording of a funky jazz song. | ||||||
| 3 | Trumpets playing alongside a piano, bass and drums in an upbeat old-timey cool jazz song. | A banjo playing alongside a piano, bass and drums in an upbeat old-timey cool country song. |
| # | Inversion Prompt | Original Audio | Edited Audio +PC | Edited Audio +2PC | PC Interpretation | Edit Parameters |
|---|---|---|---|---|---|---|
| 1 | A high quality recording of flutes and a trumpet playing. | Melody change | t'∈[200, -1] Specific t=80 used PCs 1+2+3 |
|||
| 2 | A recording of a calm country song. | Remove singer | t'∈[150, -1] Specific t=115 used PCs 1+2+3 |
|||
| 3 | — | Just drums | t'∈[150, -1] Specific t=80 used PCs 1+2+3 |
|||
| 4 | A recording of a scary classical music piece. | Melody change | t'∈[150, 50] Specific t=95 used PCs 1+2+3 |
|||
| 5 | A trumpet and a saxophone playing a cool jazz melody, with an accompaniment of a piano, bass and drums. | Melody change | t'∈[135, 95] PCs 1+2+3 |
|||
| 6 | A high quality recording of wind instruments and strings playing. | Melody change | t'∈[135, 95] PCs 1+2+3 |
|||
| 7 | A strings section playing classical music. | Minor melody changes | t'∈[95, 80] PCs 1+2+3 |
|||
| 8 | A high quality recording of a woman singing while a guitar and drums play in the background. | Instrument change | t'∈[200, -1] Specific t=65 used PCs 1+2+3 |
| # | Inversion Prompt | Edited Audio -γPC | Original Audio | Edited Audio +γPC | PC Interpretation | Edit Parameters |
|---|---|---|---|---|---|---|
| 1 | A high quality recording of a man singing and drums, guitar and bass playing a song, and later a woman is singing. | Lead Guitar/Singers emphasis | t'∈[115, 80] PC #1 |
|||
| 2 | A high quality recording of a man singing and drums, guitar and bass playing a song, and later a woman is singing. | Singers/Drums emphasis | t'∈[115, 80] PC #2 |
|||
| 3 | A recording of ryhtmic clapping, a women singing, and drums and guitar playing. | Vibrato strength | t'∈[150, -1] Specific t=120 used PC #3 |
|||
| 4 | A high quality recording of a man singing with a rock band accompaniment. | Drum-beats style | t'∈[200, -1] Specific t=80 used PC #1 |
|||
| 5 | A recording of an old timey rock song from the sixties. | Guitar/Singer emphasis | t'∈[200, -1] Specific t=65 used PCs 1+2+3 |
|||
| 6 | — | Isolate Woman/Man | t'∈[115, 95] PC #1 |
| # | Source Prompt | Target Prompt | Original Audio | Ours | SDEdit skip=100 skip=130 skip=160 |
MusicGen | DDIM Inversion |
|---|---|---|---|---|---|---|---|
| 1 | A recording of a sneaky jazz song. | A recording of a tense classical music score. |
skip=90 |
|
|||
| 2 | A recording of a rock song. | A recording of Arabic music. |
skip=90 |
|
|||
| 3 | A recording of an upbeat rock song. | A recording of an arcade game soundtrack. |
skip=100 |
|
|||
| 4 | — | A recording of a dark techno song. |
skip=110 |
|
|||
| 5 | — | A recording of a funky hip hop song. |
skip=90 |
|
|||
| 6 | — | A recording of an upbeat arcade game soundtrack. |
skip=120 |
|
|||
| 7 | — | A recording of an upbeat cool jazz song. |
skip=110 |
|
|||
| 8 | A recording of an upbeat gospel song. | A recording of an upbeat techno song. |
skip=100 |
|
|||
| 9 | Trumpets playing alongside a piano, bass and drums in an upbeat old-timey cool jazz song. | A banjo playing alongside a piano, bass and drums in an upbeat old-timey cool country song. |
skip=110 |
|
|||
| 10 | A recording of a dramatic epic Chinese piece. | A recording of a dramatic heavy metal piece. |
skip=160 |
|
|||
| 11 | Chinese strings, flutes, and harps playing an upbeat piece. | Chinese strings, flutes, and harps playing an somber piece. |
skip=120 |
|
|||
| 12 | A high quality recording of wind instruments and strings playing. | A high quality recording of a piano playing. |
skip=130 |
|
|||
| 13 | — | A recording of a happy arcade game soundtrack. |
skip=90 |
|
|||
| 14 | A recording of a hard rock song. | A recording of a jazz song. |
skip=100 |
|
|||
| 15 | A recording of an old rock song. | A recording of an techno song. |
skip=110 |
|
|||
| 16 | A recording of a happy upbeat song in a Latin jazz style. | A recording of a happy upbeat song in a retro arcade game soundtrack style. |
skip=110 |
|
| # | Source Prompt | Target Prompt | Original Audio | Ours | SDEdit skip=50 | SDEdit skip=80 | SDEdit skip=100 | SDEdit skip=130 | DDIM Inversion |
|---|---|---|---|---|---|---|---|---|---|
| 1 | A high quality recording of a cat meowing. | A high quality recording of a dog barking. | skip=50 |
||||||
| 2 | A high quality recording of a dog barking a lot. | A high quality recording of a gun shooting a lot. | skip=100 |
||||||
| 3 | A kid talking loudly. | A rooster crowing. | skip=90 |
| # | Inversion Prompt | Original Audio | Our Semantic Edit | SDEdit Skip=85 | SDEdit Skip=100 | SDEdit Skip=115 | SDEdit Skip=130 | Our Edit Parameters |
|---|---|---|---|---|---|---|---|---|
| 1 | A high quality recording of a man singing and drums, guitar and bass playing a song, and later a woman is singing. | t'∈[115, 80] PC #1 |
||||||
| 2 | A high quality recording of a man singing with a rock band accompaniment. | t'∈[200, -1] Specific t=80 used PC #1 |
||||||
| 3 | — | t'∈[150, -1] Specific t=80 used PCs 1+2+3 |
||||||
| 4 | A high quality recording of flutes and a trumpet playing. | t'∈[200, -1] Specific t=80 used PCs 1+2+3 |
||||||
| 5 | A recording of a calm country song. | t'∈[150, -1] Specific t=115 used PCs 1+2+3 |
||||||
| 6 | A recording of a scary classical music piece. | t'∈[150, 50] Specific t=95 used PCs 1+2+3 |
||||||
| 7 | A trumpet and a saxophone playing a cool jazz melody, with an accompaniment of a piano, bass and drums. | t'∈[135, 95] PCs 1+2+3 |
||||||
| 8 | A high quality recording of wind instruments and strings playing. | t'∈[135, 95] PCs 1+2+3 |
||||||
| 9 | A strings section playing classical music. | t'∈[95, 80] PCs 1+2+3 |
||||||
| 10 | A recording of an old timey rock song from the sixties. | t'∈[200, -1] Specific t=65 used PCs 1+2+3 |
||||||
| 11 | A high quality recording of a woman singing while a guitar and drums play in the background. | t'∈[200, -1] Specific t=65 used PCs 1+2+3 |
| # | Type | Inversion Prompt | Edited Audios -γPC | Original Audio | Edited Audios +γPC | PC Interpretation | Edit Parameters | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Random | A high quality recording of a man singing with a rock band accompaniment. | γ = -12 |
γ = -8 |
γ = -2 |
γ = 2 |
γ = 8 |
γ = 12 |
t'∈[200, -1] Specific t=80 used PC #1 |
||
| Ours | A high quality recording of a man singing with a rock band accompaniment. | γ = -3 |
γ = -2 |
γ = -1 |
γ = 1 |
γ = 2 |
γ = 3 |
Drum-beat style | t'∈[200, -1] Specific t=80 used PC #1 |
||
| 3 | Random | — | γ = -240 |
γ = -120 |
γ = -40 |
γ = 40 |
γ = 120 |
γ = 240 |
t'∈[115, 95] PC #1 |
||
| Ours | — | γ = -60 |
γ = -40 |
γ = -20 |
γ = 20 |
γ = 40 |
γ = 60 |
Isolate Woman/Man | t'∈[115, 95] PC #1 |
||
| 5 | Random | A recording of an old timey rock song from the sixties. | γ = -12 |
γ = -8 |
γ = -2 |
γ = 2 |
γ = 8 |
γ = 12 |
t'∈[200, -1] Specific t=65 used PCs 1+2+3 |
||
| Ours | A recording of an old timey rock song from the sixties. | γ = -2 |
γ = -1 |
γ = -0.5 |
γ = 0.5 |
γ = 1 |
γ = 2 |
Guitar/Singer emphasis | t'∈[200, -1] Specific t=65 used PCs 1+2+3 |
||