from Hacker News

DALL-E Mini – Generate images from a text prompt

by tuhins on 6/10/22, 10:01 PM with 22 comments

  • by __rito__ on 6/11/22, 7:10 AM

    Wow, this author is very dishonest as it does not mention any of the people who created this project in the first place. I was one of the people who worked in this project.

    This was spearheaded by Boris Dayma, now at Weights and Biases.

    This is an Open Source project with all code and methods in public.

    See either GitHub (https://github.com/borisdayma/dalle-mini) or the hosted space in Hugging Face Hub (https://huggingface.co/spaces/dalle-mini/dalle-mini) or the project report (https://wandb.ai/dalle-mini/dalle-mini/reports/DALL-E-mini-G...).

    This project was also covered in the NYT article on Dalle2 by Cade Metz.

    The author gives no credits at all. That is apalling.

    (Also, the one hosted in the HF Hub gives you better results)

    I just realized that this person is either using our model (some point in the past) and not giving us due credit, or they trained a new model and the name just happens to match.

    In the latter case, please ignore my rant and use my links as a reference to another project than the claim that this prpject is our project.

  • by smcleod on 6/11/22, 5:09 AM

    This one seems really poor compared to the other minis I've tried. Mostly unrecognisable, blurred shapes
  • by wbraun on 6/11/22, 2:40 AM

    Are there different variants of DALL-E Mini? Running prompts through both this version and the one hosted on huggingface gives noticeably different results. The one on huggingface seems to give more accurate responses.
  • by masswerk on 6/11/22, 5:32 AM

    Interesting results: I tried "a train entering a station" and "a train in the countryside". Both images showed a track with rails and some kind of distortion (somewhat reminiscent of speed, more so the first one), but no train, omitting the subject in favour of circumstances.

    So, a touch of Rain, Speed and Steam?

    So I tried "a train speeding in rain" and got a somewhat car-like out of the window view on a rainy landscape, with a hint of rails somewhat mangled into what looked more like a road for automobiles to me. — However, no Turner… ;-)

  • by scottlawson on 6/11/22, 1:50 AM

    I tried

    a green bowl a green bowl with an apple a green bowl with an apple inside a banana in a bowl

    the only one that seemed correct was "a green bowl", all of the others were very different.

  • by jerpint on 6/11/22, 2:27 AM

    How is this different from dall-e mini on huggingface?
  • by userbinator on 6/11/22, 8:56 AM

    The results are amusing but not particularly accurate; "cat" resulted in a recognisable but distorted cat, "dog" produced a barely recognisable nightmarish blob of fur and eyes, and "pig" output something with nothing more than the general texture of a pig.
  • by ncr100 on 6/11/22, 3:38 PM

    Check out the horror show that is "carrot top comedian".

    For out of four queries resulted in synthetic portraits that are terrifically scary.

  • by athorax on 6/11/22, 1:00 AM

    Mostly just getting unrecognizable blobs