Max generation time 30 seconds #13

gitfabianmeyer · 2024-05-28T15:38:58Z

Hi, first of all, great work! Saw this on reddit. Is the 30 seconds limitation due to the model behind it or what is the reason? Is it planned to increase this? Or even possible?

gabotechs · 2024-05-28T16:21:22Z

It's a bit involved, the current models only generate up to 30 seconds, this is a hard limitation of the models used (https://audiocraft.metademolab.com/musicgen.html)

There's a way of overcoming this limitation using a variation of the model that can be conditioned not only by text prompts but also by music samples (https://huggingface.co/facebook/musicgen-melody), my plan is to implement this.

freetoad · 2024-06-07T13:18:25Z

does musicgen-melody allow users to generate vocals?

gabotechs · 2024-06-17T14:35:41Z

I don't think so, musicgen-melody just allows conditioning the music generating based on other music samples, but it's still not trained to generate vocals

Nurb4000 · 2024-08-23T01:02:24Z

Looking forward to this, as 30 seconds is not overly useful beyond "hey, it works" ( and i realize its not your fault, appreciate what you have done.. ). Its too bad they 'hard limit' to such a small time.

It's a bit involved, the current models only generate up to 30 seconds, this is a hard limitation of the models used (https://audiocraft.metademolab.com/musicgen.html)

There's a way of overcoming this limitation using a variation of the model that can be conditioned not only by text prompts but also by music samples (https://huggingface.co/facebook/musicgen-melody), my plan is to implement this.

adamperez · 2024-10-11T16:34:28Z

There's a section on the MusicGen site titled "Long generation", where they reference having a sliding window to make a longer track. Is that something that is surfaced with this tool?

George3d6 · 2024-10-22T04:50:09Z

Also curious about what @adamperez mentioned? And, if not surfaced, how easy would it be to add? I'd be interested in contributing it

rubeniskov · 2024-11-13T12:15:46Z

To support these features, the ONNX export models need to be available for melody models.

huggingface/optimum#2095

designspin · 2025-01-19T23:51:41Z

This is super impressive and lightning quick on my m1 Mac studio... Overcoming the 30 second limit would be awesome, this would be a super time saver for my indie game music. It takes me weeks to come up with something myself, this is churning out better results in minutes!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Max generation time 30 seconds #13

Max generation time 30 seconds #13

gitfabianmeyer commented May 28, 2024

gabotechs commented May 28, 2024

freetoad commented Jun 7, 2024

gabotechs commented Jun 17, 2024

Nurb4000 commented Aug 23, 2024

adamperez commented Oct 11, 2024

George3d6 commented Oct 22, 2024

rubeniskov commented Nov 13, 2024

designspin commented Jan 19, 2025

Max generation time 30 seconds #13

Max generation time 30 seconds #13

Comments

gitfabianmeyer commented May 28, 2024

gabotechs commented May 28, 2024

freetoad commented Jun 7, 2024

gabotechs commented Jun 17, 2024

Nurb4000 commented Aug 23, 2024

adamperez commented Oct 11, 2024

George3d6 commented Oct 22, 2024

rubeniskov commented Nov 13, 2024

designspin commented Jan 19, 2025