by Epitom3 on 11/22/22, 11:10 AM with 46 comments
by recuter on 11/22/22, 12:11 PM
https://github.com/tesseract-ocr/tessdata
https://en.wikipedia.org/wiki/Tesseract_(software)
The demo of course works perfectly on a Mac as this is already built into Ventura.
If you haven't experienced it yet ye olde ctrl-f now seamlessly sneaks a peak into images on the page for example, surprisingly useful.
In November 2020, Brewster Kahle from the Internet Archive praised Tesseract saying:
Tesseract has made a major step forward in the last few years. When we last evaluated the accuracy it was not as good as the proprietary OCR, but that has changed– we have done evaluations and it is just as good, and can get better for our application because of its new architecture.
Anybody have an up to date breakdown of available OCR solutions?by twobitshifter on 11/22/22, 3:14 PM
https://learn.microsoft.com/en-us/windows/powertoys/text-ext...
by mavu on 11/22/22, 2:18 PM
Seems dishonest to me, but maybe I'm just too strict.
by rjzzleep on 11/22/22, 1:01 PM
#!/usr/bin/env bash
langs=(eng ara fas chi_sim chi_tra deu ell fin heb hun jpn kor nld rus tur)
lang=$(printf '%s\n' "${langs[@]}" | dmenu "$@")
maim -us | tesseract --dpi 145 -l eng+${lang} - - | xsel -bi
by lervag on 11/22/22, 9:58 PM
#!/usr/bin/env bash
rm -f /tmp/screen.png
flameshot gui -p /tmp/screen.png
tesseract \
-c page_separator="" \
-l "eng" \
--dpi 145 \
/tmp/screen.png /tmp/screen
if [ "$(wc -l < /tmp/screen.txt)" -eq 0 ]; then
notify-send "ocrmyscreen" "No text was detected!"
exit 1
fi
xclip /tmp/screen.txt
notify-send "ocrmyscreen" "$(cat /tmp/screen.txt)"
[0]: https://flameshot.org/by ensocode on 11/22/22, 1:07 PM
by xchip on 11/22/22, 12:33 PM
by seltzered_ on 11/22/22, 2:30 PM
Linux: dpScreenOCR - x11 only last I checked in and now Frog
MacOS: screenotate, prizmo
Windows: screenotate
I don't get all the nitpick comments. OCR tools like this are extremely useful when dealing with excerpting text from certain websites (slack) or taking class notes from video.
by habibur on 11/22/22, 12:13 PM
by schappim on 11/22/22, 7:30 PM
by noisediver on 11/22/22, 12:35 PM
by throwawaaarrgh on 11/22/22, 12:22 PM
by jalacang on 11/22/22, 1:29 PM