by briggers on 10/10/20, 1:24 PM with 38 comments
by lostdog on 10/10/20, 5:55 PM
The article does skip the most important step for getting great inference speeds: Drop Python and move fully into C++.
by t-vi on 10/10/20, 6:54 PM
At least for the PyTorch bits of it, using the PyTorch JIT works well. When you run PyTorch code through Python, the intermediate results will be created as Python objects (with GIL and all) while when you run it in TorchScript, the intermediates will only be in C++ PyTorch Tensors, all without the GIL. We have a small comment about it in our PyTorch book in the section on what improvements to expect from the PyTorch JIT and it seems rather relevant in practice.
by nraynaud on 10/10/20, 11:15 PM
Long story short, that’s good, you’ve used a neural net to avoid using a human or an animal as a pose estimation datum, how do you correlate that to the rest of the sensor suite?
by NikolaeVarius on 10/10/20, 4:58 PM
I would love an alternative that is reasonably simple to implement. I dislike having to handle raw bits.
by vj44 on 10/11/20, 3:29 AM
This is a fascinating space, and there are tons of speed up opportunities. Depending on the type of the workload you're running, you might be able to ditch the GPU entirely and run everything just on the CPU, greatly reducing cost & deployment complexity. Or, at the very least, improve SLAs and 10x decrease the GPU (or CPU) cost.
I've seen this over and over again. Glad someone's documenting this publicly :-) If any one of you readers have more questions about this I'm happy to discuss in the comments here. Or you can reach out to me at victor at onspecta dot com.
by spockz on 10/10/20, 8:44 PM
by threatripper on 10/10/20, 8:31 PM
Are there some CNN-libraries that have way less overhead for small batch sizes? Tensorflow (GPU accelerated) seems to go down from 10000 fps on large batches to 200 fps for single frames for a small CNN.
by O5vYtytb on 10/10/20, 4:31 PM
What about using pytorch multiprocessing[1]?
[1] https://pytorch.org/docs/stable/notes/multiprocessing.html
by andrewbridger on 10/11/20, 12:57 AM
by mleonhard on 10/13/20, 11:36 AM
by egberts1 on 10/10/20, 5:41 PM
https://github.com/streamlit/demo-self-driving
It uses StreamLit