Half a croissant, on a plate, with a sign in front of it saying '50c'

h a l f b a k e r y

"More like a cross between an onion, a golf ball, and a roman multi-tiered arched aquaduct."

idea: add, search, annotate, link, view, overview, recent, by name, random

meta: news, help, about, links, report a problem

account: browse anonymously, or get an account and write.

user:

pass:
register,

...
GIF2
Hillsong compression algorithm
Image Folder Compression
"Irrational" compression
Labeling Gitifier
Lookup Text Compression
Lossy Text Compression
Phonetic Compression
Photo Uniquifier
...

computer:compression

Labeling Gitifier

Lossless way compress files in hard drives...

(+4)

[vote for,
against]

[Problem]

So we tend to have multiple copies of files, because it needed less time to buy a new hard drive, than to go over files than to find all the very similar files, and choose the latest version of a file.

Moreover, we avoided deleting similar files in different folders, because these directory provided contexts to understanding what these files did in different situations.

We forgot where these redundancies were, and now, every time we make backups of disks, we just copy over the whole stuff, creating exponential growth of space requirements.

[Solution]

If a file was under several different hierarchies, add it to a git repository as the same file. Arrange it by times, and commit.

For each commit, make the comment be the location where file was found.

Commit by modification times.

Then, create a browser of files defined this way, that can browse multiple virtual hierarchy as defined by labels.

Moreover, based on file location statistics, automatically suggest the best location for each file to be.

[Expectation]

Significant reduction of space + ability to easily see file in multiple contexts + discovery of better directory structure to organize your files.

—	Mindey, Apr 19 2015

Related idea File_20system_20sup...king_20hard_20links
Your idea renders mine obsolete. [scad mientist, Apr 21 2015]

[link]

I feel your pain.

Is the invention basically a shell script that scans the file system and makes intermittent calls into GIT?

Is some UNIX ninja going to lose their marbles trying to implement it in one line using "find"?

—	pertinax, Apr 21 2015

"Tired of young people ? Can't seem to make them see your point of view ? Introducing the La Beling Gitifier <shows device resembling a Buck Rogers ray gun>. . . One quick shot to the sternum and they'll be ranting right along with you . . ."

—	FlyingToaster, Apr 21 2015

[+] I had an idea that accomplishes some of the goals on a more limited scale but would be easier to implement <link>. It appears that the Gitifier would need to be incorporated into the file system and completely change how it worked, but would accomplish everything I wanted with my idea and much much more.

While were at it, lets let GIT keep a history of each file. Obviously that would fill up the hard drive too fast if all history was stored, but history could be pruned as needed to make space, leaving more recent changes tracked in case the user needs to go back to an old version of any file.

—	scad mientist, Apr 21 2015

[pertinax], yes, what I had described, could probably be done in one line, as you say. Any UNIX ninjas?

[scad mientist], I see. Indeed!

// While were at it, lets let GIT keep a history of each file. Obviously that would fill up the hard drive too fast if all history was stored //

If history is stored as change to files, but not copies of files, then it would not fill it up fast.

—	Mindey, Apr 22 2015

//but history could be pruned as needed to make space

I vote for losing 1982.

If you could do it on a repeated basis we could have a Millenium party every year and avoid whatever is supposed to happen with that Mayan calendar.

—	not_morrison_rm, Apr 22 2015

Doesn't git (and VCSes in general) only work on files that are text-based? I thought that was one of the reasons why many file formats (MS Office, CadSoft Eagle, …) have moved from being binary to being XML-based recently. But many formats are still binary (images, videos, …), so I don't see how this would work as well for those files.

—	notexactly, Apr 26 2016

Whilst not understanding the idea, I would like to comment on it herewith.

To the extent that I _do_ understand, GIT is some system for managing files to avoid redundancy, yes? OK, so how about a simpler option.

Have an application that runs in the background. Whenever it finds two identical files on the disk, and provided neither copy is being edited at that moment, it simply deletes one copy and replaces it with an alias. (Do aliases exist on non-Mac systems? I presume so.)

The alias will sit there in the directory where the file originally was, and will therefore be fully findable and will retain its context. Problem solved, no?

An extension of this system could save yet more space, by replacing the repetitive parts of large files with aliases, and then reinstating them on the fly. Maybe.

—	MaxwellBuchanan, Apr 26 2016

I have some experience that seems to contraindicate the use of aliases or other OSes' equivalents. I have a collection of reference documents (scientific papers, datasheets, etc.) in my Google Drive. I had them organized into topic folders, but recently I found that the folders I had weren't optimal, so I decided to rearrange the files. At the same time, I decided to consolidate all of the files into one folder and put only aliases to them in the topic folders, to be able to put a file in multiple categories without duplication. This worked great until I tried to access it from my Windows computer or the Google Drive web interface, which don't understand Mac aliases. I could use Windows shortcuts (equivalent to aliases) but Mac OS X doesn't understand those. I could use symlinks, but I found some reason that I don't remember for those to not work either. I also thought of using hardlinks, but I realized that Google Drive would see those as separate files, resulting in duplication in the cloud and then probably in the local folders when it sunc again.

So. I'm currently thinking I have to build some kind of document management system to keep track of my reference documents (and it has to be able to sync between my computers and ideally also be accessible from mobile and web). I would like to be able to tag documents with multiple tags each, rather than having each one in just one folder (which is why I started the alias thing in the first place). I considered Evernote, which would work perfectly for that, but I use Evernote Basic (the free version), and the 60 MB/month upload cap is an order of magnitude too small.

—	notexactly, Apr 27 2016

back: main index

business computer culture fashion food halfbakery home other product public science sport vehicle