Comments (3)
I like that, I had to recently make a Python script to do that. Alas, I hadn't idea how to implement "close, but not exact copies" hashing system.
from czkawka.
There are hashing techniques that can not only tell if two pieces of data are identical, but also provide a measure of how much they differ from eachother.
Example: SimHash (similarity hashing), MinHash, Jaccard similarity (mathematical measure used to quantify the similarity between two sets or lists of elements) etc.
from czkawka.
Yeah, I know, thanks, but my problem was implementing it for specifically for directories - I think I'd have to perpetual hash contents of both directories and then compare it somehow - keeping in mind there are elements that doesn't fit set.
from czkawka.
Related Issues (20)
- Hardlink files found by name
- crash while sorting by selection HOT 1
- AVIF support
- czkawka_gui crashing on "Remove outdated results from cache." HOT 2
- M1 MAC - could not compile - build failed (Unrecognized option: 'diagnostic-width') HOT 2
- The program crashes when processing photos with incomplete filenames (e.g., ".jpeg", ".png", and likely others, untested). HOT 1
- libraw support breaks compilation for me
- krokiet compiles very long on Windows HOT 2
- [Feature request] mark a folder as anti-reference
- Flatpak version is out of date HOT 1
- Hash Size = 40 on Krokiet v7.0.0 (Windows)
- Ignore duplicates unless they're in the same sub-folder
- Finding duplicate Directory Structures HOT 1
- Reference Path Image Preview on Krokiet v7.0.0 (Windows)
- Slow Directory Browsing on Krokiet v7.0.0 (Windows)
- Bulk rename and sort by date
- How to ignore a group in subsequent searches?
- Additional Info Columns for Similar/Duplicate Videos | Resolution, Codec, Bitrate HOT 1
- Delete Only Files and not Groups?
- FInd Not duplicated files HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from czkawka.