by wwoessi on 1/4/25, 6:25 PM with 33 comments
POC works using 'npx efficient-recorder'.
Is this useful to anyone?
by jazzyjackson on 1/4/25, 10:02 PM
The thing that enabled rewind.ai and MS Recall is storing the series of screenshots more like a HEIF, allowing for massive compression ratio and on-device storage + OCR provided by the OS (Live Text since Monterey 2021 [0], Microsoft introduced it last year for Snapdragon based AI-PCs [1])
I guess this is a good starting point if the goal is to fill S3 buckets with screencaps of multiple users, but then we're just back to corporate spyware, not tools for helping individuals use their machine more effectively.
That said, if I was using my own minio backend, it would be neat to archive my screen captures but I would change it so it captures after, say, every keystroke, and every time my mouse stops moving, and after every click. That way I have high density capture of taking actions, and low density otherwise. In any case collecting the data is not the issue, making an interface where that data becomes useful to help me remember something is.
[0] https://support.apple.com/guide/preview/interact-with-text-i...
[1] https://learn.microsoft.com/en-us/windows/ai/apis/text-recog...
by btown on 1/4/25, 9:30 PM
It's not quite screen captures, but the way in which any given email is responded to by competent users in your own organization is highly relevant in this context, especially if you place original+reply email pairs into a RAG framework and add function calls for structured domain knowledge.
Unified APIs like https://www.nylas.com/ which an admin can unilaterally connect across an entire org can make this quite viable - assuming you've done the work to build a culture where radical transparency is seen as an opportunity rather than a threat.
There's a lot of nuance required to avoid hallucinations, but organizations that are merely training chatbots on explicit Q&A documents are just scratching the surface of the depth of their semi-structured data.
by jdorfman on 1/5/25, 7:01 AM
by ssddanbrown on 1/4/25, 10:45 PM
by diggan on 1/4/25, 9:33 PM
> Attempt to create an Open Source Privacy Focused Rewind.ai Alternative for data capture
I'd assume this was something local or at least for your local network. But this exclusively sends the data over to S3. And based on the lack of encryption keys or even passwords, I'm assuming this is even unecrypted?
by dsiroker on 1/5/25, 6:37 AM
by wild_egg on 1/4/25, 8:10 PM
by patrickhogan1 on 1/5/25, 12:10 AM
I really like the rewind.ai retrieval mechanism. I believe their recording mechanism is highly broken. It often fails to sync to the os calendar and will ask you to record meetings you deleted months ago.
I don’t understand the webcam recording need. I’m not sure what signal you get from that since if you are in a web meeting you already have that on screen. Or if you are coding you might get a few WTF frown faces if working on a hard bug. But you made it optional, so that’s good.
by ThinkBeat on 1/4/25, 8:32 PM
Must one set up a S3 compatible stack on a home server somewhere?
by papichulo4 on 1/5/25, 12:25 AM
While we’re here, has anyone been able to export audio from Rewind.ai’s local storage?
by loxias on 1/4/25, 11:09 PM
It's also useful to me in that it's a solid example of what can be done with LLMs these days, wow!
Also, tangentially, a long long time ago I had a similar system set up, except for packets, not screencaps or audio. A 24h ringbuffer on my router to log _everything_ was a cool-to-have that made debugging network issues easier.
by athulmenon on 1/4/25, 8:09 PM
The issue with this one is that it misses the most important feature, the searchability. But you could probably focus on the low overhead aspect of your version.
[1] * Screenpipe https://github.com/mediar-ai/screenpipe * Memento https://github.com/apirrone/Memento * Rem https://github.com/jasonjmcghee/rem
by squeegee_scream on 1/4/25, 9:28 PM
by ieidkeheb on 1/5/25, 11:40 AM