【Hacker News搬运】我制作了一个开源的Windows应用程序,可以回放和搜索屏幕上的所有内容
-
Title: I made an open source Windows app to rewind and search everything on screen
我制作了一个开源的Windows应用程序,可以回放和搜索屏幕上的所有内容
Text:
Url: https://tonoko.notion.site/I-made-an-open-source-app-to-rewind-search-everything-happened-on-your-screen-on-Windows-184d1a9d5edb494dba0c2f46d311ec5c
该网页的内容介绍了作者开发的一款开源应用程序,该程序可以在Windows上回放和搜索屏幕上发生的一切。应用程序的主要灵感来自于Mac应用程序Rewind和电视剧《黑镜》第一集“你一生的历史”。该应用程序提供了一系列额外的好处,例如不可更改的本地个人记忆、免疫于丢失的记忆线索、在软件失败时恢复工作、追踪下载数据的来源,甚至挖掘过去被忽视的信息(对于作者个人来说,它已经成为一个与朋友开玩笑的回调工具)。 实现方式很简单,它使用ffmpeg将屏幕录制成15分钟的小片段文件,然后通过Windows本地OCR api和图像嵌入进行索引。用户还可以选择忽略指定的程序或屏幕范围。所有这些都在一个漂亮的本地web界面中进行回放或搜索(通过窗口标题、文本关键词或图像描述)。利用这些数据,作者还添加了一些概览功能,例如查看每日和周期性的屏幕时间、昼夜摘要,以及生成每月的幻灯片等。 作者非常重视隐私,所以所有操作都只发生在用户的计算机上,没有任何数据存储在云端。作者使用Python将所有内容粘合在一起,它是透明且易于自行修改的(这可能会导致性能上的缓慢)。当然,没有订阅费用——你的数据是你的,免费使用。 作者并不是专业的开发者,他出于个人兴趣和需求开发了这个工具,所以它可能不会非常完美,你可能会发现一些小问题。但是,经过一段时间的使用,作者认为它已经足够成熟和稳定。如果你对这个项目感兴趣,欢迎讨论、提出任何问题或帮助改进。
Post by: haruharuha
Comments:
compsciphd: We built in almost 2 decades ago now (including the ability to scrub to a point in the past and resume execution from there)<p><a href="http://www.cs.columbia.edu/~orenl/papers/sosp07-dejaview.pdf" rel="nofollow">http://www.cs.columbia.edu/~orenl/papers/sosp07-dejaview.pdf</a><p>Abstract: As users interact with the world and their peers through their computers, it is becoming important to archive and later search the information that they have viewed. We present DejaView, a personal virtual computer recorder that provides a complete record of a desktop computing experience that a user can playback, browse, search, and revive seamlessly. DejaView records visual output, checkpoints corresponding application and file system state, and captures displayed text with contextual information to index the record. A user can then browse and search the record for any visual information that has been displayed on the desktop, and revive and interact with the desktop computing state corresponding to any point in the record. DejaView combines display, operating system, and file system virtualization to provide its functionality transparently without any modifications to applications, window systems, or operating system kernels. We have implemented DejaView and evaluated its performance on real-world desktop applications. Our results demonstrate that DejaView can provide continuous low-overhead recording without any user noticeable performance degradation, and allows browsing, search and
playback of records fast enough for interactive use.compsciphd: 我们在近20年前就已经构建好了(包括擦除到过去的某个点并从那里恢复执行的能力)<p><a href=“http://www.cs.columbia.edu/;~orenl/;papers/!sosp07 dejaview.pdf”rel=“nofollow”>http://x2F/;www.cs.columbia.edu/~orenl;纸张;sosp07 dejaview.pdf</a><p>摘要:当用户通过电脑与世界和同龄人互动时,存档并稍后搜索他们查看的信息变得越来越重要。我们展示了DejaView,一款个人虚拟计算机记录器,它提供了桌面计算体验的完整记录,用户可以无缝播放、浏览、搜索和恢复。DejaView记录视觉输出,检查相应的应用程序和文件系统状态,并捕获具有上下文信息的显示文本以索引记录。然后,用户可以浏览和搜索记录以寻找已经显示在桌面上的任何视觉信息,并恢复与记录中的任何点对应的桌面计算状态并与之交互。DejaView将显示、操作系统和文件系统虚拟化相结合,以透明的方式提供其功能,而无需对应用程序、窗口系统或操作系统内核进行任何修改。我们已经实现了DejaView,并评估了它在真实桌面应用程序上的性能。我们的结果表明,DejaView可以提供连续的低开销录制,而不会出现任何用户明显的性能下降,并允许浏览、搜索和以足够快的速度播放记录以供交互使用。
aspenmayer: Similar project for macOS which is also open source:<p><a href="https://rem.ing" rel="nofollow">https://rem.ing</a><p><a href="https://github.com/jasonjmcghee/rem">https://github.com/jasonjmcghee/rem</a><p>Previously submitted on HN by the dev with some decent discussion 3 months ago here:<p><a href="https://news.ycombinator.com/item?id=38787892">https://news.ycombinator.com/item?id=38787892</a>
aspenmayer: macOS的类似项目也是开源的:<p><a href=“https:/;/!rem.ing”rel=“nofollow”>https:ȏ/;rem.ing</a><p><a href=“https://;/;github.com/!jasonjmcghee/:rem”>https:///;github.com/;jasonjmcghee;rem</a><p>之前由开发人员在HN上提交,3个月前在这里进行了一些不错的讨论:<p><a href=“https://;/;news.ycombinator.com/?id=38787892”>https:///;news.ycombinator.com/;项目id=38787892</a>
modeless: Has anyone built something like this using accessibility APIs instead of (or in addition to) OCR? It seems like a waste to OCR everything when you could just get the text directly from the accessibility APIs. Also seems like potentially a good way to connect LLMs to UIs, and something like this would be the way to collect the training data.
modeless: 有人使用辅助功能API而不是OCR(或除此之外)构建过这样的东西吗?当你可以直接从辅助功能API中获取文本时,OCR的一切似乎都是浪费。这似乎也是将LLM连接到UI的一个潜在的好方法,类似的方法将是收集训练数据的方法。
msephton: I remember a few projects like this, the first I saw was called Savant Recall in 2014. But it failed to be selected for YC, so was set free as open-source. Napster co-founder Ritter picked it up, renamed it Atlas Recall (2016), gave it a new UI, secured $20M in funding. A year later it was suddenly shutdown. On LinkedIn says "acquired by Xinova". Another one I'd heard of was called Apse (2019).
msephton: 我记得有几个这样的项目,我第一次看到的是2014年的Savant Recall。但它没能被YC选中,因此被作为开源软件免费提供。Napster联合创始人Ritter将其收购,并将其更名为Atlas Recall(2016),为其提供了一个新的UI,获得了2000万美元的资金。一年后,它突然关闭了。在LinkedIn上说";由Xinova收购”;。另一个I;d听说的名字叫Apse(2019)。
karencarits: For Windows, there is also TimseSnapper [1], not open source, but the developer is sometimes here on HackerNews<p>[1] <a href="https://timesnapper.com/" rel="nofollow">https://timesnapper.com/</a>
karencarits: 对于Windows,还有TimseSnapper[1],不是开源的,但开发人员有时会出现在HackerNews上<p>[1]<a href=“https://;/;timesnapper.com/”rel=“nofollow”>https:///;timesnapper.com/</一