Google Launches AI Image Generation Model Imagen 3 and Video Model Veo

Google lastly launched its newest synthetic intelligence (AI) picture and video technology fashions on Wednesday. Each of those AI fashions had been unveiled at Google I/O because the tech large’s newest development in generative AI. Now, greater than six months later, the Mountain View-based firm has launched it on Vertex AI for its enterprise shoppers. Notably, whereas Imagen 3 was not accessible as a standalone platform to date, it was being utilized in a number of platforms and instruments reminiscent of Google Docs, Gemini, and an experimental instrument known as GenChess.

Google Imagen 3, Veo AI Fashions

In a blog post, the tech large introduced the introduction of the 2 new AI fashions in Vertex AI. Google’s Vertex AI platform is a managed machine studying (ML) platform on Google Cloud that enables builders and enterprises to construct, deploy, and handle AI fashions. It’s much like Amazon Bedrock and Microsoft Azure and provides built-in instruments and options for AI workflows.

The tech large said that the Veo video technology mannequin is now accessible on Vertex AI in non-public preview and companies can generate movies utilizing textual content or picture prompts. Then again, Imagen 3 will likely be made accessible beginning subsequent week. It takes textual content prompts and enterprises can use it to generate photographs that replicate their model type and logos.

Coming to the capabilities of Veo, Google says it will possibly generate high-quality movies primarily based on both textual content or picture prompts. The generated movies might be in a variety of cinematic and visible kinds. Developed by DeepMind, the AI mannequin is alleged to have excessive immediate adherence and might generate constant footage of objects and other people and even seize actions realistically.

Imagen 3, which will likely be accessible in Vertex AI beginning subsequent week, can generate photorealistic photographs in a variety of kinds. Calling it “our most succesful picture technology mannequin but”, Google said that the picture technology mannequin can perceive pure language prompts and customers wouldn’t have to explain the technical parts to get the specified outcome.

The Imagen 3 AI mannequin can even be accessible with enhancing instruments for inpainting and outpainting. Firms also can infuse their model’s colors, kinds, logos, and different parts within the generated photographs.

For privateness and security, the tech large has added a number of instruments. SynthID, the watermarking know-how developed by DeepMind, will likely be embedded into each picture and body of video that these AI fashions produce to fight situations of deepfakes and misinformation. Google additionally said that the AI fashions is not going to be skilled on buyer information and the instruments will function following Google Cloud’s information governance and privateness controls.

Catch the most recent from the Shopper Electronics Present on Devices 360, at our CES 2025 hub.

Source link