OpenAI is making strides in the realm of generative music by developing a new tool designed to create compositions from text and audio prompts.
According to sources cited by The Information, this innovation aims to improve the creative toolkit available to content producers, blending artificial intelligence with musical artistry.
The tool could enable users to automatically craft musical accompaniments, such as background scores tailored for video projects or guitar parts complementing vocal tracks. This would significantly streamline the process for creators seeking customised audio, though it is not yet known when OpenAI plans to release the product or whether it will operate independently or be integrated into existing services like ChatGPT or the video application Sora.
In an effort to refine the AI’s musical understanding, OpenAI is reportedly collaborating with students from the Juilliard School, who assist by annotating musical scores. This partnership ensures that the training data reflects both technical precision and artistic nuance, vital for producing convincing and expressive music outputs.

While OpenAI experimented with generative music technology before the debut of ChatGPT, the company has recently focused on audio AI models targeted at enhancing text-to-speech and speech-to-text systems.
Earlier this year, OpenAI introduced its next-generation audio models, pushing the boundaries of these technologies. This new music tool underlines the company’s ongoing commitment to audio innovation.
The field of AI-generated music is becoming increasingly competitive, with prominent players like Google and emerging startups such as Suno also developing advanced generative music systems. These efforts collectively signal a shift toward more accessible, AI-assisted music creation, potentially transforming how artists and creators engage with sound.
As technology and artistic creativity converge, OpenAI’s latest initiative could open new avenues for producing personalised and sophisticated music, making professional-level composition tools available to a broader audience.