The CogVideoX1.5-5B-I2V of diffusers #611
liuxiaoyu1104
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
System Info / 系統信息
diffusers 0.32.0.dev0
torch 2.4.1+cu121
python 3.10.14
Information / 问题信息
Reproduction / 复现过程
I tried using CogVideoX1.5-5B-I2V and CogVideoX-5B-I2V based on CogVideoXImageToVideoPipeline(diffusers).
For CogVideoX-5B-I2V, width= 720, height = 480, num_frames = 49, num_inference_steps = 50.
For CogVideoX1.5-5B-I2V, width=1360, height=768, num_frames = 77, num_inference_steps = 50.
The generated videos of CogVideoX-5B-I2V are good.
In the generated videos of CogVideoX1.5-5B-I2V, the brightness of the first few frames is inconsistent with the images, and the latter part of the video exhibits blurriness and temporal inconsistency.
The image:
![5281642-hd_1920_1080_30fps](https://private-user-images.githubusercontent.com/58921114/395972154-a490c609-8d00-4c31-8e5b-5055d893b58f.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3Mzk0MjM5MDksIm5iZiI6MTczOTQyMzYwOSwicGF0aCI6Ii81ODkyMTExNC8zOTU5NzIxNTQtYTQ5MGM2MDktOGQwMC00YzMxLThlNWItNTA1NWQ4OTNiNThmLmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTMlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEzVDA1MTMyOVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWI4YTk5MjBlOTNiYWJjNDFmMTgwYmIyNjEyZjk2MjY0ZjYyNTFkZjZiYWVjNTg2NWNhMTJmMWUzYWIxY2Q4ZjImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.-IYHxg6E1mAePVPd1iNpvOShzolR7O6rmOiGtGPQh-U)
The result of CogVideoX-5B-I2V:
A.man.walking.in.the.road._480_720_49_1.0_output.mp4
The result of CogVideoX1.5-5B-I2V:
A.man.walking.in.the.road._768_1360_77_output.mp4
Expected behavior / 期待表现
The brightness of videos generated byCogVideoX1.5-5B-I2V is consistent with the images.
Beta Was this translation helpful? Give feedback.
All reactions