Half a croissant, on a plate, with a sign in front of it saying '50c'
h a l f b a k e r y
You gonna finish that?

idea: add, search, annotate, link, view, overview, recent, by name, random

meta: news, help, about, links, report a problem

account: browse anonymously, or get an account and write.

user:
pass:
register,


               

Garbage Collection for the Web

If it's not linked, it vanishes.
  (-1)
(-1)
  [vote for,
against]

How many files there must be on the web that just aren't looked at any more? Than in fact can't be looked at because they're not linked? That are in a sense 'lost', but that somebody still bothers to host.

I suggest a service that uses web crawlers similar to a search engine's, and uses it to build a map of what's accessible. It than matches this against the complete list of files served from your machine, and lets you know when they are unreachable.

So in the process of rearranging your site you delete the link to an image, but don't delete the image itself. A few days later (just to be certain), you get an email:

" The file ....... .png is unreachable from your site or anywhere else on the web. Would you like to delete it? [yes], [no] "

Could save lots of server space.

sadie, Dec 23 2002

(?) MS Frontpage http://www.microsof...e/using/default.htm
[Amos Kito, Oct 04 2004]

[link]






       A noble cause but, I have files on my site that intentionally have no links pointing to them.   

       Additionally, the links may be built dynamically at the time the page is requested. It would be quite a chore to automatically analyze script code, databases, etc. to make sure a link doesn't exist.
half, Dec 23 2002
  

       Isn't this the way MS Frontpage [link] works? Move or remove a resource, and the software fixes everything for you. But then you'd have to use MS Frontpage --- blecch.
Also, what [half] said. I hide all kinds of junk around my server on purpose. But there's a lotta stuff I'm just too lazy to clean up. Looks worse than my garage.
There's probably a script that can tidy the mess. However, a service that does it for me might come in handy. +
Amos Kito, Dec 23 2002
  

       I use Macromedia Dreamweaver, a Frontpage competitor (though perhaps not for long), and it has a feature like this as well.
krelnik, Dec 23 2002
  

       There are servers/hosts that sell space for files that are never intended to be linked. Cleaning can be done by a stadard FTP program.
ImBack, Dec 23 2002
  

       Firstly, I have to agree with others mentioning that sometimes you dont want files looked at. I use my website for personal backup space.   

       Secondly, the files would still be linked to from the google and Wayback Machine caches!
ironfroggy, Dec 26 2002
  

       // the files would still be linked to from the google and Wayback Machine caches //   

       Don't count on it. These machines all operate by crawling - following links from page to page, the same mechanism i'm talking about. If a page is unreachable by links, then the chances are google can't find it either.
sadie, Dec 27 2002
  
      
[annotate]
  


 

back: main index

business  computer  culture  fashion  food  halfbakery  home  other  product  public  science  sport  vehicle