Hello fellow lobsters,
I need to organize my images and photo’s and I’d like to ask you for tips and tricks. I’ve stored most of my images on a private Nextcloud instance, and a few years ago I gave my girlfriend her own account where she has uploaded hers.
As time progressed this became a bit messy. Not only are there literal duplicates - there are also similar files, i.e. resized and recompressed images. For example from our Signal caches, and images that were piped through a nifty Android app called Send Reduced Pro before publicising them or sharing with friends and family, who don’t need the huge originals.
My plan is to go to the data center and copy both our Nextcloud data directories to a temporary disk, so that I can process them at home more easily, on my Linux desktop computer. I know of a program called fdupes, that will allow me to find exact duplicates.
But how can I find similar pictures, that look the same even though their checksums differ? I would rather not resort to using online tools, I know for example Google supports a ‘similar image’ search, but are there standalone programs that can do this? I’d prefer to keep only the high-res versions.
The end result will be limited to saving pictures in directories - I’m aware that there are (pseudo-)filesystems that support tagging for example, and while those do seem useful, because I will move them back to my Nextcloud instance this isn’t an option for as far as I’m aware. Although it might still be an interesting thing to share, for others who might benefit from such things.
Looking forward to reading your input!