This is an automated archive made by the Lemmit Bot.

The original was posted on /r/stablediffusion by /u/Similar_Piano_963 on 2024-08-27 14:38:13+00:00.


This model looks pretty good in the demos. Sadly it’s only t2v though. In my experience, ALL current video gen models are quite slot machine-y right now, so it would be great to be able to have it run i2v locally…

Possible for someone to turn this into an image to video model?

I’m no ML researcher, but maybe one could train an IP-Adapter type model to condition the beginning of the video? Maybe that’s not feasible, I don’t know. How cool would LORAs be for this too!?

Download CogVideoX 5b weights:

This 5B model runs on a 3060. And the previous 2B model is now Apache 2.0.

Hugging Space:

Paper:

Sources:

Gradio

Vaibhav (VB) Srivastav on X:

Adina Yakup on X:

Tiezhen WANG:

ChatGLM: