影片說明
How do AI models actually turn a text prompt into a cohesive, high-resolution video? In this episode of Release Notes Explained, we take a look at the technical architecture behind AI video generation and how diffusion models work under the hood to create video clips.
0:00 - Intro and How AI video works
1:05 - Diffusion vs. text models
1:19 - The diffusion process (forward and reverse diffusion)
3:56 - Using text prompts to guide output
6:19 - Solving flickering
9:12 - Saving compute with latent space
10:36 - Reconstructing pixels for the final output
Subscribe to Google for Developers → https://goo.gle/developers
Speaker: Nikita Namjoshi
Products Mentioned: Google AI