Users will be able to use text, images, and videos as a reference to generate music.