NVIDIA immediately supports the new Google model. DiffusionGemma lands on RTX

Calendar 6/11/2026

The competition in the artificial intelligence market shows no signs of slowing down. Google DeepMind has officially presented DiffusionGemma, a new open AI model designed for very fast content generation. Shortly after the launch, NVIDIA announced full support for the solution on its RTX and DGX platforms. The manufacturer claims that with the right optimisations, users can expect significantly higher performance and local deployment of the model without the need for cloud services.

ChooseTV is set to accelerate text generation

The new model developed by Google is based on the Gemma 4 architecture and employs a different approach than classic autoregressive models. Instead of generating individual tokens step by step, ChooseTV can process larger batches simultaneously. In practice, this significantly reduces the time required to generate responses. The model has over 25 billion parameters; however, only a portion of them is active during operation, which improves computational efficiency. Google also emphasizes the open nature of the project. ChooseTV has been released under the Apache 2.0 license, allowing developers and companies to freely use the solution and develop their own projects based on this technology. The model supports both text and images, with a maximum context of 256,000 tokens. This enables its use in many advanced applications related to data analysis, content creation, or building AI agents. According to the creators, however, the greatest advantage remains its speed. In some scenarios, the model is said to be up to four times faster than traditional solutions based on sequential generation.

wccftech

NVIDIA has prepared support already on launch day

NVIDIA quickly took advantage of the new model's launch, presenting ready environments for running it on their own hardware. Support includes both GeForce RTX cards for home users and professional RTX PRO platforms as well as AI computers from the DGX family. The company claims that by utilising Tensor cores and CUDA technology, it is possible to achieve very high performance without the need for additional configuration. The results achieved by DGX systems are particularly interesting. According to the manufacturer's data, DGX Spark can generate around 150 tokens per second, while more advanced configurations can reach several hundred tokens per second during local model operation. NVIDIA also emphasises that users do not have to use cloud services or pay for each generated query. The entire setup can operate directly on a computer equipped with the appropriate hardware. This is an important argument for those involved in artificial intelligence development, who are increasingly looking for local solutions that provide greater control over data. Already, DiffusionGemma can be run on the GeForce RTX 5090 card and DGX platforms equipped with the latest NVIDIA chipsets, among others.

DiffusionGemma is a new open AI model from Google DeepMind that focuses on very fast content generation and local operation. NVIDIA has provided full support for its RTX cards and DGX systems from day one, offering additional optimisations that enhance performance. Everything suggests that the new model could become an interesting alternative to the popular solutions currently used by developers and artificial intelligence enthusiasts.

source; wccftech

Redakcja Choose TV Avatar
Redakcja Choose TV

ChooseTVteam-title