from Hacker News

Microsoft OCR Library for Windows Runtime

by maouida on 9/19/14, 7:10 AM with 45 comments

  • by steeve on 9/19/14, 9:12 AM

    We had great results using tesseract-ocr[1] with SWT (state of the art text detection algorithm, via libccv[2]) on Linux.

    You can use our python bindings for both[3,4], although they might be slightly outdated:

    [1] https://code.google.com/p/tesseract-ocr/

    [2] http://libccv.org/doc/doc-swt/

    [3] https://github.com/veezio/pytesseract

    [4] https://github.com/veezio/pyccv

  • by swalsh on 9/19/14, 2:30 PM

    This is very cool! I've been working on a receipt scanning tool in C# for keeping track of kitchen inventory (tired of calling my wife asking if we have sesame oil or some odd ball thing)

    I found a few libraries, but they only worked with relatively perfect scans (my goal is to be able to just use a phone). When I get home definitely going to give this a go.

  • by jamessantiago on 9/19/14, 8:23 AM

    Off topic, but this made me think that it would be neat if libraries on places like github and nuget could someout include "cited by" data. Something that referenced open source (maybe closed source too) projects that had a dependency to the library similar to google scholar or CiteSeerX.
  • by rikkus on 9/19/14, 8:57 AM

    It doesn't appear that you can use this in a 'normal' .NET app. Any ideas why?
  • by mdaniel on 9/20/14, 4:33 AM

    On http://msdn.microsoft.com/en-us/library/windows/apps/windows... they mention the supported languages and their statuses, but Korean is only "Good".

    I freely admit that I do not speak Korean, but if one compares "Chinese Simplified" characters (listed as "Very good") with those in the Korean alphabet, I am surprised those two entries aren't transposed.

    Is there something that makes recognizing Korean harder than Chinese Simplified, or was that just a product management decision?

  • by cipher0 on 9/19/14, 9:36 AM

    "demonstrated in code snippets below". The code snippets are actually images and even worse, they're JPEGs which is the reason why the text looks horrible.
  • by reallycurious on 9/19/14, 8:36 AM

    is this better than the terrassect OCR?
  • by jccodez on 9/19/14, 4:25 PM

    tesseract is really looking great with google adding searchable pdf as output in the latest release candidate.
  • by Norm-- on 9/19/14, 6:34 PM

    So from reading the list of reasons for inaccurate results, it sounds like this library is totally useless for images taken with mobile phones, yet it is only allowed to run on mobile ;)

    Now I would be more interested in an image correction library

    ".... Blurry images Handwritten or cursive text Artistic font styles Small text size (less than 15 pixels for Western languages, or less than 20 pixels for East Asian languages) Complex backgrounds Shadows or glare over text Perspective distortion Oversized or dropped capital letters at the beginnings of words Subscript, superscript, or strikethrough text"