Science - Technology

Google releases video-generating AI, competing with OpenAI Sora

TH (according to VnExpress) May 15, 2024 08:10

Google first introduced a video-generating AI from commands called Veo with the ability to create 1080p videos longer than a minute, competing with OpenAI's Sora.

Ảnh được tạo ra từ câu lệnh: Ba người phụ nữ đứng cạnh nhau cười, với một người nằm ngoài khoảng nét một chút. Mặt trời đang lặn ở phía sau những người này, tạo ra ánh sáng loá của ống kính và làm nổi bật mái tóc, tạo hiệu ứng mờ ở hậu cảnh. Phong cách chụp chân thực, ghi lại khoảng khắc kết nối và hạnh phúc giữa những người bạn.... Ảnh: Google
The photo was created from the command: "Three women standing side by side smiling, with one slightly out of focus. The sun is setting behind them, creating lens flare and highlighting hair, creating a blurred effect in the background. The style is candid, capturing a moment of connection and happiness between friends..."

Veo was launched at the Google I/O event in the early morning of May 15 (Hanoi time). The product was introduced by Demis Hassabis, CEO of Google DeepMind, as being able to create "high quality" 1080p videos with many different visual and cinematic styles.

Veo was announced three months after Sora appeared and caused a stir in the community.

According to a Google representative, the AI ​​is capable of understanding natural language and can "accurately capture the tone of a prompt," thereby creating videos that closely reflect the user's creative vision. The model also understands cinematic terms like "timelapse" video or "aerial landscape photography," and can create consistent and coherent footage, with human subjects, animals, and objects moving realistically throughout the shot.

Demonstration videos of Veo’s capabilities are around eight seconds long, but Google says users can request longer durations of up to 1 minute and 10 seconds, as well as tweak them with additional prompts to change the results. That’s up from the one-minute maximum previously announced by OpenAI Sora.

According to Google, Veo is built on five video generation models including Generative Query Network (GQN), DVD-GAN, Imagen-Video, Phenaki, WALT, VideoPoet and Lumiere, combined with many other techniques to improve output quality and resolution.

They've improved techniques for how models learn to understand content in videos, display high-resolution images, simulate the physics of our world, and more.

“These insights will fuel advances in our AI research and enable us to build even more useful products that help people interact and communicate in new ways,” Google said.

At the event, the US tech giant also introduced an image-generating AI called Imagen 3. The product is advertised as creating pictures with "incredible levels of detail", realistic, lifelike images and less distracting details in the photo than previous models.

Imagen 3 also better understands natural language and predicts the user's intent behind the prompt, and can create photos with different styles.

Like many other video and photo-generating AIs, Veo and Imagen 3 are not yet widely available. Google says the new product is available for a limited number of creators to try out, with interested users needing to join a waiting list. The company also plans to bring some of Veo’s features to YouTube Shorts and other products.

TH (according to VnExpress)
(0) Comments
Latest News
Google releases video-generating AI, competing with OpenAI Sora