🎨 Art2Mus: Artwork-to-Music Generation via Visual Conditioning and Large-Scale Cross-Modal Alignment 🎶
Ivan Rinaldi and Matteo Mendula and Nicola Fanelli and Florence Levé and Matteo Testi and Giovanna Castellano and Gennaro Vessio
Music generation has advanced markedly through multimodal deep learning, enabling models to synthesize audio from text and, more recently, from images. However, existing image-conditioned systems suff...