In China, they made a neural network CogVideo, which creates short videos based on a text description

June 1, 2022

In fact, the output is gifs. CogVideo can generate video at a relatively high frame rate of 32 frames in four seconds. The developers noted that the actual text input for video generation is in Chinese. CogVideo works on a principle similar to the neural networks DALL-E 2 and Imagine, which generate images from a text description.