Google has introduced "Imagen Video" which is a text-conditional video generation system.
With a text input, the system generates high-def videos using a base video generation model and a sequence of interleaved spatial and temporal video super-resolution models.
[
imagen.research.google...]