by heliophobicdude on 12/21/23, 1:38 PM with 42 comments
by mk_stjames on 12/21/23, 8:35 PM
It uses diff-gaussian-rasterization from the original gaussian splatting implementation (which, is a linked submodule on the git, so if you are trying to git clone that dependency remember to use --recursive to actually download it).
But that is written in mostly pure CUDA.
That part is just used to display the resulting gaussian splatt'd model, and there have been other cross-platform implementations to render splats – there was even that web demo a few weeks ago, that was using WebGL [0] – and if that was used as a display output in place of the original implementation there is no reason people couldn't use this on non-Nvidia hardware, I think.
edit: also device=cuda is hardcoded in the torch portions of the training code (sigh!). This doesn't have to be the case. pytorch could push this onto mps (metal) probably just fine.
[0] https://github.com/antimatter15/splat?tab=readme-ov-file
by catapart on 12/21/23, 4:09 PM
Which would produce a new mesh in the style of your other meshes, based on a single photograph of a real-world object.
...and, at this speed, you could do that as a real-time(ish) import into a running application/game.
Gotta say, I'm looking forward to someone putting these puzzle pieces together! But it really does feel like if we wait another month, there might be some new AI that shrinks that pipeline by another one or two steps! It's an exhausting time to be excited!
by joosters on 12/21/23, 5:18 PM
Like, if you have an image of a car, viewed at an angle, you can gauge the shape of the 3d object from the image itself. You could then assume that the hidden side of the car is similar to the side that you can see, and when you generate a 360 rotation animation of it, it will look pretty good (cars being roughly symmetrical). But if you gave it a flat image of a playing card, just showing the face up side, how would it reconstruct the reverse side? Would it infer it based on the front, or would it 'know' from training data that playing cards have a very different patterned back to them?
by roflmaostc on 12/21/23, 3:38 PM
Maybe some CV/ML people can help me understanding.
by XorNot on 12/21/23, 2:23 PM
by rijx on 12/21/23, 2:14 PM
by eurekin on 12/21/23, 5:05 PM
by lawlessone on 12/21/23, 3:10 PM
I'm expecting Overwhelming Fast Splatter by January.
by teunispeters on 12/21/23, 4:47 PM
by billconan on 12/21/23, 5:57 PM
by alkonaut on 12/21/23, 3:40 PM
Unlike say a crude model of a fire hydrant which you could throw into a game or whatever. If the model is fed some more constraints/assumptions? I think I saw some recent paper that did generate meshes now instead of pixels.
by StreetChief on 12/21/23, 4:52 PM
by amelius on 12/21/23, 5:55 PM
by anigbrowl on 12/21/23, 10:18 PM
by tantalor on 12/21/23, 3:23 PM