by cnuss on 11/26/24, 1:58 PM with 0 comments
- Uses EFS for model file persistence
- Pre-downloads models after deployment for faster cold starts
- Cold start: ~20s (model loading), warm requests: 5-20s (CPU inference)
- Fully automated container builds and deployment
- Works with private/gated models via HF_TOKEN
Example deployment: npx scaffoldly create app --template python-huggingface
cd python-huggingface && npx scaffoldly deploy
Scaffoldly is Open Source and I'm excited for all feedback and contributions from the community!https://github.com/scaffoldly/scaffoldly
https://github.com/scaffoldly/scaffoldly-examples/tree/pytho...