by mpaepper on 3/28/23, 8:47 PM with 25 comments
by ftxbro on 3/28/23, 9:00 PM
by yeldarb on 3/28/23, 9:54 PM
Counting: https://imgur.com/KTuQ1Bv
Parse the chess board: https://imgur.com/2zYFK1P
(Result): https://imgur.com/Ei4MAl7
Few-Shot Object Detection (Pascal VOC): https://imgur.com/gZkDMn8
Few-Shot Object Detection (simplified): https://imgur.com/Hk8QGMd
Not quite there yet. I've been more impressed with the other new zero-shot multimodal models like Grounding DINO and Azure Dense Captioning. Really looking forward to putting multimodal GPT-4 through its paces as well.
by vagabund on 3/28/23, 9:23 PM
[0] https://i.postimg.cc/GtrGs8mw/Screenshot-2023-03-28-at-5-19-...
by mpaepper on 3/28/23, 8:59 PM
by dfrankle on 3/28/23, 11:22 PM
by juxtaposicion on 3/29/23, 1:10 AM
by duxup on 3/28/23, 8:58 PM