from Hacker News

Project Naptha

by workah0lic on 7/11/22, 3:06 AM with 85 comments

  • by me_again on 7/11/22, 5:34 AM

    What I would really like is a little bit like this but not quite the same: full text search over everything I have ever seen on the computer. It would read and index the emails, web pages, word docs, etc as I open them, then later when I think "I know I saw a doc about cache oblivious algorithms", I can search for it without being distracted by 100K documents I haven't seen. Or I can find that email I read, without finding the same phrase in a bunch of junk mail I never opened.

    Does anything remotely similar exist?

  • by conorcleary on 7/11/22, 3:43 AM

    Submitters need to get back to more descriptive titles on HN. If this post is the first and only exposure to this project for a user, "Project Naptha" alone doesn't give me confidence to roll the dice and click on an unknown link unless there are existing comments I can investigate. Thus, chicken and egg, scroll by.
  • by keyle on 7/11/22, 3:54 AM

    Interesting but I thought my mac already does this? Maybe just M1 and Monterey.

    I was trying to debug some image the other day, upon inspection I got confused by the shadow dom doing weird stuff, but only in safari; to then realise that it was macOS converting the text in the image to text in shadow dom! ... Good/bad experience report I suppose.

    ref. https://support.apple.com/en-au/guide/preview/prvw625a5b2c/m...

  • by jordemort on 7/11/22, 5:38 AM

    This needs a year annotation - I almost emailed to express interest in a Firefox version until I noticed the references to Chrome 36 and Google+
  • by shubhamjain on 7/11/22, 5:32 AM

    This makes me feel old. I saw it for the first time in 2014 on HN, just when it was announced. I felt kind of envious of Kevin Kwok, author of this project. I had just graduated and he was still an undergrad. He had already shipped so many complex projects, including a full-fledged Flash Animator[1] for the web.

    It's pretty surprising to see that his site hasn't been updated since 2015 and not many projects have been shipped since then.

    [1]: https://antimatter15.com/project/ajax-animator/

  • by zxexz on 7/11/22, 6:45 AM

    I thought this seemed familiar. Discussed on HN 3 years ago [0]

    I remember using this for a while on a separate Chrome profile. It was quite useful, albeit quite CPU intensive.

    [0] https://news.ycombinator.com/item?id=20919147

  • by draugadrotten on 7/11/22, 6:56 AM

    This extension is a privacy nightmare. "By default, when you begin selecting text, it sends a secure HTTPS request containing the URL of the specific image... The server responds with a list of existing translations and OCR languages that have been done."

    That is some pretty sensitive data to keep around. There seems to be some rudimentary thinking around privacy: "no user tokens, no website information, no cookies or analytics" Yet keeping an index of all the image requests from any IP would not pass muster by any GDPR lawyer I have met.

    http://my-support-group/advice-for-disease.jpg http://my-political-group/campaign-ideas.jpg http://my-therapy-group/suicide-prevention.jpg

    https://ec.europa.eu/info/law/law-topic/data-protection/refo...

  • by emmelaich on 7/11/22, 7:08 AM

    FWIW, there is a Chrome app for cloud vision, OCR.

    https://chrome.google.com/webstore/detail/cloud-vision/nblmo...

    By a Google employee I understand but not official Google product of course.

  • by metadat on 7/11/22, 5:35 AM

    Does Naptha still work? I recently reviewed all installed chrome extensions and it seemed broken, so I removed it.
  • by Nition on 7/11/22, 8:43 AM

    A related trick for text that you can't usually select: Hold Alt. For instance try using the mouse to select text in http://www.google.com with and without Alt.
  • by tim-- on 7/11/22, 4:50 AM

    I had this extension on my Chrome browser a few years ago, and was dumbfounded when I thought that Chrome had added the ability to not only OCR PNG files - but also replace the text in them!

    Completely forgot that I installed this extension years earlier.

    Crazy extension!

  • by mcintyre1994 on 7/11/22, 8:12 AM

    I love this idea, Apple recently added it for saved photos on iOS and I think in preview too? It doesn't seem to be working for me though. I highlighted some text, hit ctrl + c and got this in my clipboard:

    <[ TEXT RECOGNITION IN PROGRESS / MORE INFO: http://projectnaptha.com/process/ (IDX:a:0-a:1-a:2-b:7&a:0&a:0&168&817:XDI) / ELAPSED 26.11SEC / DATE Mon, 11 Jul 2022 08:08:50 GMT / TEXT RECOGNITION IN PROGRESS ]>

    The right-click translate doesn't seem to work either, it just selects a whole paragraph.

  • by _tom_ on 7/11/22, 6:31 AM

    Nice! This works slightly better than apple's version, which makes it very hard to select the main image, once it detects text in the image. Naphtha seems to handle this correctly!
  • by ralfd on 7/11/22, 10:43 AM

    This is a cool project, but this is a bit embarrassing:

    > I started building a text recognizer algorithm specifically designed for Impact font, and it was actually working pretty well, but I kind of misplaced the code somewhere. So, until I find it or replace it, you'll have to use Tesseract configured with the "Internet Meme" language.

  • by jzer0cool on 7/12/22, 12:11 PM

    With a chrome extension, are there any trust, security, privacy issues enabling an extension with an unfamiliar organization as well as leak what sensitive images are being transferred.

    I wonder also why such "convert" options are not made available or are they in the works?

  • by jtth on 7/11/22, 5:04 PM

    I find things like TextSniper to be more useful than this functionality, which has been in iOS and MacOS for a little bit. I use it more on hypothetically selectable elements that for whatever DOM-related reason aren't selectable than I use it on text in images.
  • by LegitShady on 7/11/22, 3:54 AM

    the front page still contains references to google+
  • by rexreed on 7/11/22, 11:29 AM

    This project is almost a decade old! I wonder why it resurfaced? Coincidentally or perhaps not the book in the post "How to do nothing with nobody all by yourself" also is a top HN post right now. Maybe related?
  • by poulpy123 on 7/11/22, 12:47 PM

    what I would like to find is a free a cheap way to OCR my handwritten notes. I know that handwritten is much more difficult but it would be possible to use supervised learning mon my specific handwriting.
  • by holler on 7/11/22, 4:04 AM

    Really cool! I went to try the translate feature but each time I left-click to open the nav menu (macOs) it deselects the text? I know it says right-click but for me it's left-click to open the menu.
  • by a-dub on 7/11/22, 12:29 PM

    last time i saw this the ocr component of it was tesseract (originally hp's c++ ocr engine, later acquired and open sourced by google, even later rewritten to use neural networks) compiled into webassembly.
  • by avmich on 7/11/22, 6:57 PM

    > Unfortunately, your browser is not yet supported

    Latest version of Firefox? Seriously?

  • by nit987 on 7/11/22, 6:24 AM

    Hey, loved it. How much time it took to develop this?
  • by adastra22 on 7/11/22, 6:42 AM

    > Unfortunately, your browser is not yet supported

    Safari on macOS.

  • by moneywoes on 7/11/22, 5:23 AM

    What's the catch?