Comments (3)
@alex-from-intuita
As far as I remember, this is related to the limitations of one of the bash commands used in the tool. As for me, I ended up using a higher similarity threshold to deal with that. Also, plz visit our lab website at https://sail.cs.queensu.ca/ to get more information about the state of our research. I am particularly studying the quality of smart contracts.
Best,
Amir.
from deckard.
Error: the structure supports at most 2097151 points (3238525 were specified).
The error is an inherent limitation in the LSH library used in Deckard; it can't handle more than 2million vectors at a time.
Using 0.79 similarity is not recommended as it often leads to many false positives.
Use a higher similarity, say 0.90.
Or as another alternative, split your input dataset into smaller ones before feeding it into Deckard. Then, after getting clone results for every smaller dataset, de-duplicate the vectors that have been identified as clones into one and then merge all the vectors left into one dataset to run Deckard again (need to write your own scripts for these, and there may be false negatives due to the split/merge process).
from deckard.
@sail-amir i'd love to know if you could get this to work and am curious to learn more about your research! let's stay in touch! (linkedin)
from deckard.
Related Issues (20)
- What parameters are fine? Need help HOT 1
- Command line options for filter IDs not implemented HOT 3
- Vec generator failure HOT 3
- how to use a slice ?
- Crash on "return A?B:C" HOT 1
- vector generation HOT 3
- Build fails HOT 7
- Clone detection failure?need help HOT 3
- build fails HOT 8
- typefile and nodefiles
- Error: problem in vec generator step. Stop and check logs in times/
- post_cluster file is 0 bytes HOT 1
- Building errors
- Any chance to update this to PHP 7 or 8? HOT 1
- Why does Deckard act differently from one run to another?
- Problem in running Deckard for C project
- Building error HOT 2
- Upgrade to Python 3 HOT 3
- build fails HOT 6
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from deckard.