Comments (5)
yes, will push the necessary error handling shortly
from finch-clustering.
Hi, yes, looks like in your case the FINCH fist partition has only 1 cluster? In general it wont provide you more clusters than you have in the very first partition. while I will add the necessary error handling, for now you can check the impact of using different distance metrics, it supports one of ['cityblock', 'cosine', 'euclidean', 'l1', 'l2', 'manhattan']. may be one of these will not cause to merge your data into one cluster in the very first step. As for having a better way to deliver clusters that are more than the FINCH first merge step, it is also possible but requires more though and time to think of an algorithm that can break some clusters to achieve this.
from finch-clustering.
I did try different metrics, it was the same error for both manhattan and cosine, but euclidean did work. I could do a loop to try the other metrics until no IndexError
is raised, but there will be cases, where no metric works, I suppose.
If it helps, I can provide the data.
from finch-clustering.
sure , please provide the data, I can also have a look. however if first partition is already 1 cluster then it wont currently split it into two. In general if you first merge results into a single cluster, that means the data has very similar close points/samples which really should be in one cluster.
from finch-clustering.
Data is here, loadable with numpy.loadtxt()
. Shape is (42, 256)
and only euclidean
has more than one cluster.
If data is not sepparable, it should not terminate with an exception, but maybe return a single cluster with a warning?
from finch-clustering.
Related Issues (20)
- features for Hollywood and MPII Cooking 2 HOT 1
- The code for TW-FINCH is not available
- TWFinch code missing FS "Eval" option; unable to reproduce accuracy HOT 1
- errors when run the run_on_dataset.m HOT 8
- There is a bug when using pynndescent.NNDescent HOT 2
- ValueError: all the input arrays must have same number of dimensions, but the array at index 0 has 4 dimensions(s) and the array at index 1 has 2 deminesion (s). HOT 1
- TW-FINCH feature extraction method HOT 1
- Is there any randomness in the clustering results? HOT 1
- element of adjacent matrix may greater than 1 HOT 4
- Different Clustering results when using python and matlab implementation HOT 2
- Finch Algo 2
- Segmentation fault for large dataset of 5M datapoints of 1024 dimensions HOT 1
- Unable to replicate numbers HOT 1
- TWFinch code missing YTI without 75% background option; unable to reproduce accuracy HOT 5
- Replace `sklearn` with `scikit-learn` in `setup.py` HOT 1
- If the way F1-scores is calculated is different in matlab and python HOT 3
- How to get the midpoint hit criterion for the MPII? HOT 1
- `sklearn` is still a dependency in `setup.py` HOT 1
- Error when runninng TW_FINCH and specifying the number of clusters.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from finch-clustering.