by arduinomancer on 8/16/24, 2:20 AM with 61 comments
Has anyone figured out why this is the case?
by keiferski on 8/16/24, 3:35 AM
This is actually something of a pet peeve of mine - people sharing AI images never use styles other than the generic shiny one, and so places like Reddit.com/r/midjourney are filled with the same exact style of images.
Edit: if you’re looking for other style inspiration ideas, this website is a great resource for Midjourney keywords: https://midlibrary.io/styles
by vipshek on 8/16/24, 3:47 AM
All that being said, it's very possible to prompt these generators to create images in a particular style. I usually include "flat vector art" in image generation prompts to get something less photorealistic that I've found is closer to the style I want when generating images.
If you really want to go down the rabbit hole, click through the styles on this Stable Diffusion model to see the range that's possible with finetuning (the tags like "Watercolor Anime" above the images): https://civitai.com/models/264290/styles-for-pony-diffusion-...
by feverzsj on 8/16/24, 5:18 AM
by sidkshatriya on 8/16/24, 3:38 AM
In AI models, I think the pictures the AI generates is biased to generate is also a form of "airbrush" except the model makes the reflectivity of the images high -- simply to hide the fact that there _arent_ any imperfections that would make the photo more realistic.
In other words, gloss is just a form of airbrushing -- AI does it to hide the fact that there are no more details available.
I would guess that AI models could make the airbrush more like the airbrush human photo editors do by changing some hyper-parameters of their models.
by spaceman_2020 on 8/16/24, 5:28 AM
You can get realistic images with Midjourney and Flux with minimal prompt tuning. Adding “posted on snapchat” or “security camera footage” to the prompt will often produce mostly realistic looking images
by txnf on 8/16/24, 3:25 AM
https://github.com/LAION-AI/laion-datasets/blob/main/laion-a...
obviously, it reflects the mass preference for glosslop
secondarily it is likely due to a desire to ensure that ai images have a distinct look
by ClassyJacket on 8/16/24, 4:22 AM
by blululu on 8/16/24, 5:43 AM
First, AI Images != OpenAI/ChatGPT Images. OpenAI has done a great job making a product that is accessible and thus their product decisions get a lot more exposure than other options. A few people have commented how there are several Stable Diffusion fine tunings that produce very different styles.
Second, AI Images and images of AI images of people are different. I think that the high gloss style is most pronounced in people. Partly this is because it is more notable and out of place.
If you take the previous two points as being true the question becomes why does ChatGPT image model skew toward generated shiny people. I would venture that is a conscious product decision that has something to do with what someone thought looked the most reliably good given their model's capabilities.
Some wild speculation as to why this would be the case:
* Might have to do with fashion photos having unusually bright lights and various cosmetics to give a sheen.
* It might have something to do with training the model on synthetic data (i.e. 3d models) which will have trouble producing the complicated subsurface scattering of human skin.
* Might have something to do with image statistics and glossy finishes creeping in where they don't belong.
* Might have to do with the efficiency of representing white spots.
by DaoVeles on 8/16/24, 5:41 AM
by latentsea on 8/16/24, 4:44 AM
by simonw on 8/16/24, 4:03 AM
This is pretty clearly training for a preference model - so they now have MILLIONS of votes showing which images are more "pleasing" to their users.
by BobbyTables2 on 8/16/24, 3:27 AM
by disconcision on 8/16/24, 5:48 AM
by anileated on 8/16/24, 4:39 AM
by ffhhj on 8/16/24, 3:10 AM
That would be the "oiled bodybuilder" applied to image training. Maybe similar and clearly defined lighting also allows AI's to match features much better, specially volumes.
by osigurdson on 8/16/24, 4:40 AM
by t0bia_s on 8/16/24, 12:11 PM
Mostly highlights down, shadows and clarity up. I often needs to edit it back to have realistic looking lights on scene.
Also "--s" 0 helps with generating more realistic images.
by LeoPanthera on 8/16/24, 4:51 AM
by Scrapemist on 8/16/24, 5:51 AM
by Blackthorn on 8/16/24, 3:37 AM
by bronya19c on 8/16/24, 5:02 PM
by jerpint on 8/16/24, 4:24 AM
by bronya19c on 8/16/24, 5:06 PM
by RicoElectrico on 8/16/24, 3:59 PM
by tivert on 8/16/24, 4:11 AM
I haven't noticed that a lot of AI images are generated with a "realistic cartoon" style, and I assume that's to smooth over some uncannyness.
by bni on 8/16/24, 10:53 AM
by allanren on 8/16/24, 4:09 AM