by theolivenbaum on 7/25/24, 1:16 PM with 0 comments
Our C# library supports:
- Image captioning (from concise to detailed)
- Optical Character Recognition (OCR)
- Region-based OCR
- Object detection
- Optional phrase grounding
The library is a C# port of Microsoft's Florence-2 model (from https://huggingface.co/microsoft/Florence-2-base), based on the original model and the JS port by Frank Krueger (https://github.com/praeclarum/transformers-js).