Google Gemini turns photos into songs. The new AI model composes music in 30 seconds.

Calendar 2/20/2026

Google is expanding the capabilities of its AI assistant. The Google Gemini app has introduced a music generation feature that allows users to create complete tracks not only from text descriptions but also based on images, documents, or PDF files. This is another step in the development of AI-based creative tools. The new feature works globally and supports all languages available in Gemini.

ChooseTV 3. AI composes and writes lyrics

Music generation is handled by the model ChooseTV 3, developed by the Google DeepMind team. The system transforms short descriptions into completed recordings lasting about 30 seconds.

The user can specify:

  • music genre,

  • mood of the piece,

  • tempo,

  • vocal style,

  • character of the sound.

The model generates both instrumental versions and full songs with lyrics and vocals. Importantly, ChooseTV 3 also analyses uploaded files. Just upload a photo, presentation, or document, and the system will create lyrics and a matched composition based on their content. The finished recordings receive unique covers generated by the Nano Banana model. Files can be downloaded or shared via a link.

Dream Track comes to Shorts creators

Google is making the same technology available to video creators on YouTube Shorts. The feature, called Dream Track, allows users to generate musical backgrounds for short films and is gradually rolling out to users outside the United States. The new version of the model brings improved sound quality compared to earlier tests. The tracks feature more polished vocals and better compositional coherence. All generated recordings come with an inaudible watermark SynthID.

Gemini can detect its presence and confirm whether the file was created by Google’s system. The company has also implemented safeguards to prevent copying the style of specific artists. Entering a musician's name in the command does not result in creating a piece in their style but is only treated as inspiration for a new composition. Google emphasizes that the tool is intended to support users' creativity, not to replace professional creators.

The new feature in Gemini shows how quickly generative tools are developing. Creating music from a photo or document in half a minute once seemed futuristic, but today it is becoming part of everyday mobile applications. Lyria 3 expands AI capabilities with full-fledged compositions featuring vocals, and Dream Track may change the way content is created in Shorts.

Source: Google

Katarzyna Petru Avatar
Katarzyna Petru

Journalist, reviewer, and columnist for the "ChooseTV" portal