Comments (4)
This is because your node IDs should be factored when they get read into the graph.
Eg. the IDs of the nodes should start at 0 and the maximum node ID should be equal to the number of nodes.
CSRGraphs currently only supports up to 4B nodes (uint32_max) -- anyways that's about as much as you can fit on a single machine. Since you're passing in invalid node IDs here it explodes the logic.
I could fix it when I'm back next week, but in the mean time try factoring your node IDs.
from nodevectors.
Thanks for the response. I have factored the ids but get the following error -
~/anaconda3/envs/python3/lib/python3.6/site-packages/csrgraph/graph.py in init(self, data, nodenames, copy, threads)
107 raise ValueError(f"""
108 Out of bounds node: {max_idx}, nnodes: {self.nnodes}
--> 109 """)
110 # Manage threading through Numba hack
111 if type(threads) is not int:
ValueError:
Out of bounds node: 1897, nnodes: 956
Number of nodes in my edgelist are 1898, I am not sure how nnodes = 956. Attached sample edgelist for debugging.
sample_edgelist_0712.txt
from nodevectors.
@VHRanger any update here?
from nodevectors.
I'm tracking the changes for this in the CSRGraph repo: VHRanger/CSRGraph#3
I made some changes there which should make it work. Please test that it fixes your problem. If it does, I'll push the fix to the pip version of the CSRGraphs package (and update the dependency on nodevectors afterwards).
I'll close the issue here for now since this is a CSRGraphs issue, not really a nodevectors issue.
from nodevectors.
Related Issues (20)
- Embedding a VERY LARGE graph, upcoming? HOT 2
- When saving large graph, creating a temporary folder will cause the system disk resources to be exhausted. HOT 1
- Issue with gensim 4.0.0+ HOT 3
- is it possible to split n2v to generate walks only? HOT 4
- node2vec uses CBOW instead of skip-gram HOT 4
- Setting value of seed to make Node2vec embedding repeatable. HOT 1
- Print training progression (node2vec)? HOT 1
- Continue fitting process HOT 2
- Has node2vec implementation been updated to use skip-gram as default? HOT 3
- About painting HOT 1
- defining random state or seed option parameters HOT 3
- Why is generating walks so slow with non-default parameters? HOT 3
- word2vec parameters changed HOT 3
- Problem with underlying Word2vec HOT 1
- G.mat got an asymmetric sparse matrix
- ProNE option: "inconsistent shapes" error
- Node2Vec:About the return_weight and neighbor_weight
- ProNE multithread HOT 1
- NetworkX 3.0 remove adj_matrix in version HOT 1
- Old parameter shows up in Word2Vec call
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nodevectors.