by dlock17 on 11/4/23, 11:06 PM with 24 comments
However if you are interested in OCR from Go without C complicating building and cross-compiling, there aren't any other options.
Wazero is a Go WASM runtime that doesn't have any CGo dependencies. With Emscripten Tesseract has been compiled to WASM and ran within Wazero.
Gogosseract provides a simple API on top of this. This project has been an interesting delve into the world of WASM.
by yklcs on 11/5/23, 8:03 AM
I think this method really shines in Go as not having CGo simplifies a lot of things, and as a decently performant JITed runtime exists in the form of wazero.
by iampims on 11/5/23, 1:57 AM
by richieartoul on 11/5/23, 2:00 AM
by mappu on 11/5/23, 6:23 AM
by donatj on 11/5/23, 4:56 AM
by tommiegannert on 11/5/23, 8:01 AM
Since OCR is a somewhat slow process, how does the WASM approach compare to running libtesseract in a subprocess and use some IPC layer to talk to Go? It would require a separate C++ compiler, but not CGo.
> one of the largest Open Source OCR
Tangential, but are there others as large as Tesseract? It seems to pop up anywhere I look.
by abdullahkhalids on 11/5/23, 9:11 AM
How much difference is there between Tesseract and the best proprietary solutions?
by honkotime on 11/5/23, 2:09 PM
by technics256 on 11/5/23, 8:26 AM
by breadchris on 11/5/23, 2:05 AM