by Vt71fcAqt7 on 10/16/24, 2:56 PM with 44 comments
by cube2222 on 10/16/24, 5:58 PM
~25x faster performance than Flux-dev, while offering comparable quality in benchmarks. And visually the examples (surely cherry-picked, but still) look great!
Especially since with GenAI the best way to get good results is to just generate a large amount of them and pick the best (imo). Performance like this will make that much easier/faster/cheaper.
Code is unfortunately "(Coming soon)" for now. Can't wait to play with it!
by ttul on 10/17/24, 5:05 AM
Looking forward to it. This space just keeps getting more interesting.
by lpasselin on 10/16/24, 7:02 PM
by echelon on 10/16/24, 4:25 PM
3D models (sculpts, texture, retopo, etc.) are following a similar trend and trajectory.
Open video models are lagging behind by several years. While CogVideo and Pyramid are promising, video models are petabyte scale and so much more costly to build and train.
I'm hoping video becomes free and cheap, but it's looking like we might be waiting a while.
Major kudos to all of the teams building and training open source models!
by wiradikusuma on 10/17/24, 4:54 AM
That would be useful for e.g. book illustration, comic strips, icon sets. Otherwise, people would think you pick those images all over the internet and not from one source/theme.
by cpldcpu on 10/16/24, 10:59 PM
Basically they compress/decompress the images more, which means they need less computation during generation. But on the flip side this should mean less variability.
Isn't this more of a design trade-off than an optimization?
by cynicalpeace on 10/17/24, 1:30 PM
You have to release your model in some fashion for it to be impressive.
by amelius on 10/17/24, 12:35 PM
by smusamashah on 10/16/24, 3:11 PM