by matsuu on 9/18/16, 8:16 AM with 26 comments
by exDM69 on 9/18/16, 10:22 AM
I've done some high throughput streaming from HD/SSD to GPU before, and it's pretty easy to beat the naive solution but getting the most out of it would require kernel space code.
I was doing random access streaming of textures using memory mapped files for input and copying to persistent/coherent mapped pixel buffers on the CPU with memcpy with background threads. This was intended to take advantage of the buffer caches (works great when a page is reused) and intended for random access. If I would have been working on a sequential/full file upload, my solution would be entirely different.
Edit: here's the source: https://github.com/kaigai/ssd2gpu
It has a custom kernel module.
by zokier on 9/18/16, 10:11 AM
by foobar2020 on 9/18/16, 10:14 AM
by witty_username on 9/18/16, 9:44 AM
Neat.
by justinclift on 9/18/16, 1:02 PM
by nl on 9/18/16, 10:35 AM
NVLink is in the Power9 servers Google is using.
by carbocation on 9/18/16, 5:04 PM
by Razengan on 9/18/16, 6:01 PM
by foobarbecue on 9/18/16, 5:25 PM
by musha68k on 9/18/16, 10:00 AM
by MrBuddyCasino on 9/18/16, 12:22 PM