superkoder / picdedupe Goto Github PK
View Code? Open in Web Editor NEWSimple Python script to dedupe many pictures (macOS only!)
License: GNU General Public License v3.0
Simple Python script to dedupe many pictures (macOS only!)
License: GNU General Public License v3.0
If a better file format of the same image is detected (e.g. an HEIC of an existing JPEG), it should take steps:
filename.heic.jpg
).{filename}.txt
with info on what it did and where, for the User's verification.As always, it is up to the User to verify this and to only do this when backups have been made.
It can currently detect file dupes. But nothing is done yet, apart from printing that info.
It should probably:
./_DUPES
subfolder.{filename}.txt
with info on why it is considered a dupe.It will be the User's task to check this folder and, eventually, remove it.
If a short movie of a picture (e.g. Apple's Live Photo) is detected, it should take the right steps:
filename.jpg
(or similar)filename.jpg.mov
(or similar){filename}.txt
with info on what it did and where, for the User's verification.As always, it is up to the User to verify this and to only do this when backups have been made.
If a higher quality version of the same image is found, it should take the right steps:
filename_lowres.jpg
or filename_small.jpg
).{filename}.txt
with info on what it did and where, for the User's verification.As always, it is up to the User to verify this and to only do this when backups have been made.
In today's version, we find a lot of false positives for similar pictures. That is because we compare metadata, and for some pictures, there is hardly any metadata to work with. It could be that pictures from the same day, with the same resolution, all map to each other.
We should be resilient against this, by not trying to do anything when the metadata is simply too weak to be conclusive. We should add a {filename}.txt
file to explain the problem, though. Just so that the User is aware and can take it into account.
We are currently only looking for the most popular picture & video formats. We should expand this to include RAW files.
If we are certain about a RAW, we should take the right steps:
filename.jpg.CR2
and move it to the same location as the original. Alternatively, we might want to put all raws into a ./_RAW
subfolder??{filename}.txt
with info on what it did and where, for the User's verification.As always, it is up to the User to verify this and to only do this when backups have been made.
When we have high-quality metadata that clearly indicates that an image was captured on a different date than what the filesystem says, we should be able to correct the filesystem's ctime/mtime.
We should still add a {filename}.txt
so the User is aware of our change.
This is made for macOS specifically (it uses the Spotlight commands). So it needs to check that it is, in fact, running on a macOS to avoid terrible things from happening.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.