Something I often wanted to see on Hacker News was an automatic link to a cached copy of a submission, so when the site goes down, there’s still something to read. I thought about taking it one step further and making the text of the article automatically available.
When a story is submitted, it would cache the text (using Diffbot) and show it as a collapsed block in the story:
This text would be displayed as the RSS text, as well as be included in the Sphinx index so that articles could be found through the search engine.
Of course there are copyright concerns caching the pages of others, not to mention the decreased traffic and possible ad revenue that might result from people reading the story here or through RSS rather than clicking through. Diffbot doesn’t seem to honor robots.txt or meta tags that would otherwise restrict Google or Wayback Machine’s archiving.
What do you guys think?