Google introduces MusicLM, a model for generating high-fidelity music from text descriptions such as “a calming violin melody backed by a distorted guitar riff.”
![](https://static.wixstatic.com/media/669e65_10f9194db7534320ba1958533aa754e1~mv2.png/v1/fill/w_600,h_404,al_c,q_85,enc_auto/669e65_10f9194db7534320ba1958533aa754e1~mv2.png)
MusicLM casts the process of conditional music generation as a hierarchical sequence-to-sequence modeling task, and it generates music at 24 kHz that remains consistent over several minutes.
Comments