【Hacker News搬运】黑客新闻数据图[180MB]
-
Title: Hacker News Data Map [180MB]
黑客新闻数据图[180MB]
Text:
Url: https://lmcinnes.github.io/datamapplot_examples/hackernews/
很抱歉,作为一个文本和信息处理的AI,我无法直接访问外部链接或网站内容。但是,我可以根据你提供的链接来描述这个页面的内容和可能的用途。 这个链接指向的是一个名为 "datamapplot_examples" 的页面,它似乎是一个展示如何使用 "datamapplot"(一个可能的数据可视化工具)的示例。具体到 "hackernews" 部分,这很可能是指与 Hacker News 数据相关的数据可视化示例。 Hacker News 是一个由 Y Combinator 创建的网站,它允许用户提交和讨论科技新闻、新产品和创业公司。因此,这个页面可能包含以下内容: 1. **数据可视化示例**:展示如何使用 datamapplot 工具对 Hacker News 的数据进行可视化的例子,可能包括用户提交的新闻、评论、评分等。 2. **交互式图表**:页面上可能有交互式图表,允许用户探索 Hacker News 数据的不同维度,比如最受欢迎的帖子、用户活跃度等。 3. **代码和说明**:页面可能包含了用于创建这些图表的代码示例,以及如何使用 datamapplot 进行数据可视化的说明。 如果你需要对这些内容进行翻译和总结,你需要提供页面的文本内容。如果你有具体的问题或者想要了解某个特定的图表或数据,请提供相关信息,我会尽力帮助你。
Post by: mooreds
Comments:
lucb1e: Maybe add [180MB] to the title, similar to how videos or pdfs are tagged? It starts loading that immediately when you open the page, which would be 18% of my data bundle if I had been on mobile<p>(This is actually transferred bytes btw, based on seeing ~12MiB/s for ~15 seconds in the system monitor)<p>Edit: some people are saying they can't view it, especially on mobile browsers. Here's some screenshots:<p>- Landing overview <a href="https://snipboard.io/YTQRZc.jpg" rel="nofollow">https://snipboard.io/YTQRZc.jpg</a><p>- Zooming into the center, hovering over an item that is too small to see but the title shows in a tooltip: <a href="https://snipboard.io/xOvA47.jpg" rel="nofollow">https://snipboard.io/xOvA47.jpg</a><p>- Zoomed in further still, now an individual item can be targeted easily and there are lines delimiting topics (looking like height lines on a map): <a href="https://snipboard.io/P6UVAv.jpg" rel="nofollow">https://snipboard.io/P6UVAv.jpg</a><p>- Hovering over the year selector on the bottom left, same zoom position for comparison: <a href="https://snipboard.io/VDW2JI.jpg" rel="nofollow">https://snipboard.io/VDW2JI.jpg</a><p>Clicking the year seems not to do anything, you can't lock into that view. Clicking a title opens the page, not the discussion thread.<p>---<p>Looking into the corresponding GitHub repository (I wonder if they have a bandwidth limit for repositories or if it will foot any bill), <<a href="https://github.com/lmcinnes/datamapplot_examples">https://github.com/lmcinnes/datamapplot_examples</a>>, there's also a visualization for Wikipedia which is a bit less heavy: <a href="https://lmcinnes.github.io/datamapplot_examples/Wikipedia_data_map_example.html" rel="nofollow">https://lmcinnes.github.io/datamapplot_examples/Wikipedia_da...</a> (screenshot <<a href="https://snipboard.io/M9GRQt.jpg" rel="nofollow">https://snipboard.io/M9GRQt.jpg</a>>)
lucb1e: 也许可以在标题中添加[180MB],类似于视频或pdf的标记方式?当您打开页面时,它会立即开始加载,如果我在移动设备上,这将是我数据包的18%<p>(顺便说一句,这实际上是传输的字节数,基于在系统监视器中看到约12MiB/;s约15秒)<p>编辑:有些人说他们可以;不要查看它,尤其是在移动浏览器上。这里;s一些截图:<p>-登录概述<a href=“https:/;snipboard.io/ YTQRZc.jpg”rel=“nofollow”>https:/;狙击板.io;YTQRZc.jpg</a><p>-放大到中心,将鼠标悬停在一个太小而看不见但标题显示在工具提示中的项目上:<a href=“https:#x2F;snipboard.io#x2F)xOvA47.jpg”rel=“nofollow”>https:/;狙击板.io;xOvA47.jpg</a><p>-进一步放大,现在可以轻松定位单个项目,并且有分隔主题的线条(看起来像地图上的高线):<a href=“https:#x2F;snipboard.io#x2F”P6UVAv.jpg“rel=”nofollow“>https:#^F/;狙击板.io;P6UVAv.jpg</a><p>-将鼠标悬停在左下角的年份选择器上,相同的缩放位置用于比较:<a href=“https:/;snipboard.io/ VDW2JI.jpg”rel=“nofollow”>https:"/;狙击板.io;VDW2JI.jpg</a><p>点击年份似乎没什么作用,你可以;我无法进入那个视野。单击标题会打开页面,而不是讨论线程<p> ---<p>查看相应的GitHub存储库(我想知道他们是否对存储库有带宽限制,或者是否会支付任何费用),<<a href=“https:/;/ github.com&#lmcinnes/-datamapplot_examples”>https:"/;github.com;lmcinnes;datamapplot_examples</a>>;,那里;这也是维基百科的一个可视化,它稍微不那么重:<a href=“https:”lmcinnes.github.io:”datamapplot_examples“Wikipedia_data_map_example.html”rel=“nofollow”>https:”/;lmcinnes.github.io;datamapplot_examples;Wikipedia_da…</a>(屏幕截图<;<a href=“https:”snipboard.io:”M9GRQt.jpg“rel=”nofollow“>https:”snipboard/io:“M9GRQt.jpg</a>)
codingdave: It is a cool visualization, so I don't want to diminish the effort to make it in any way. And as an experiment in visualization, it is interesting. (If a bit large and laggy.) But if the authors expect people to use it to navigate content, it has a few problems:<p>1) The topics don't seem to be hierarchical, so as I drill down on one area, I get all kinds of things that don't seem related. I have no idea what I'm missing unless I zoom into the whole thing.<p>2) I don't know where my browser is going when I click a link. That is a security problem.<p>3) I cannot tell how this data is sourced. Are these all the links posted to HN? Just the ones that got upvotes? Something else? Because while we have some great links here, we also get a lot of stinkers.<p>4) Much of the value of HN is the discussions. I didn't see a way to navigate to discussions related to any of the links.
codingdave: 这是一个很酷的可视化,所以我不这么认为;我不想以任何方式减少努力。作为可视化实验,这很有趣。(如果有点大和滞后。)但如果作者希望人们用它来浏览内容,它有一些问题:<p>1)主题不;它似乎没有等级制度,所以当我深入一个领域时,我会得到各种各样的东西;似乎没有关系。我不知道我在做什么;除非我放大整件事,否则我就不见了<p> 2)我不知道;当我点击链接时,我不知道我的浏览器在哪里。这是一个安全问题<p> 3)我不知道这些数据是如何来源的。这些都是发布到HN的链接吗?只有那些获得支持票的人?还有别的吗?因为虽然我们在这里有一些很好的链接,但我们也得到了很多糟糕的东西<p> 4)HN的大部分价值在于讨论。我没有;我看不到导航到与任何链接相关的讨论的方法。
nighthawk454: Repo: <a href="https://github.com/lmcinnes/datamapplot_examples">https://github.com/lmcinnes/datamapplot_examples</a><p>Also, lmcinnes is the author of UMAP and HDBSCAN!
nighthawk454: 回复:<a href=“https:/;/ github.com&#lmcinnes datamapplot_examples”>https:"/;github.com;lmcinnes;datamapplot_examples</a><p>此外,lmcinnes还是UMAP和HDBSCAN的作者!
anonu: I like how Web Development and User Experience grouping is way outside the central bubble.<p>Nonetheless, great visualization of a lot of data. I need to learn more about this:<p>UMAP: <a href="https://umap-learn.readthedocs.io/en/latest/" rel="nofollow">https://umap-learn.readthedocs.io/en/latest/</a><p>Nomic-Embed: <a href="https://www.nomic.ai/blog/posts/nomic-embed-text-v1" rel="nofollow">https://www.nomic.ai/blog/posts/nomic-embed-text-v1</a><p>The visual groupings aren't perfect. For example, there are a quite a few COVID-19 tagged articles before 2020.
anonu: 我喜欢Web开发和用户体验分组在中心泡沫之外的方式<p> 尽管如此,大量数据的可视化效果很好。我需要了解更多信息:<p>UMAP:<a href=“https:”UMAP-learn.readthedocs.io“en”最新“rel=”nofollow“>https:”/;umap learn.readthedocs.io;en■;最新</a> <p>Nomic嵌入:<a href=“https:”www.nomi.ai“blog:”Nomic-Embed-ext-v1“rel=”nofollow“>https:”/;www.nomi.ai;博客/;帖子;nomic-embed-ext-v1</a><p>视觉分组不是;不完美。例如,在2020年之前,有相当多的新冠肺炎标签文章。
kissgyorgy: It would be way more usable if a time range could be selected and it would list the actual threads to the results.
kissgyorgy: 如果可以选择一个时间范围,它会列出结果中的实际线程,那么它会更有用。