Comments (4)
Thanks for your interesting of our project.
It is possible to support euclidean distance. Please take a look of the "score" package:
There are two suite of APIs in the implementation of a score function, one is for dense vector set, the other is for sparse vector set. And the rest of the API are all event hooks.
If you could implement this feature, it is highly plausible. Or we can take this but will be due in late next week.
Thanks again.
from simbase.
A quick implementation without verification and tests, please check with changeset 099ecf1 and help us to review it. if no problem, I will close the issue tomorrow.
And @bwlim please give us feedback on this issue. Thanks!
from simbase.
Supporting Manhattan distance also seems very good, thanks!
but, I couldn't fully understand integer vector score function because I didn't fully read and understand simbase code ==>
- @OverRide
- public float score(String srcVKey, int srcId, int[] source, int srclen, String tgtVKey, int tgtId, int[] target,
-
int tgtlen) {
I'm just in the phase of planning new service, I cannot test simbase code right now...
I don't have working system and test data now, (This is my hobby project with my wife :D)
Later I will test simbase~ I'm Sorry.
from simbase.
Hi, @bwlim ,
The integer vector API is for the sparse vectors. Sparsity is very common in high dimensional data, in this scenario, dense storage format is very ineffective, so we introduce sparse storage format.
For example, we have a 1024 dimensional base, the two format are as below
- dense storage format: cmp1, cmp2, ..., cmp1024
- sparse storage format: idx1, cmp1, idx2, cmp2, ... (where cmpi is a non-zero component, and idxi is the index of the compoent)
from simbase.
Related Issues (16)
- why not reuse the memory when set the vectors? HOT 1
- 在Basis类中,get方法和all方法有什么区别?
- lein uberjar报错 HOT 2
- 发现store/Recommendation.java中的代码bug一枚
- A more sophisticated test framework HOT 1
- Eliminate command layer by reflection or codegen
- Rewrite the server layer
- Support matrix and vector transforming
- Support clustering of vectors
- Enhence Info command for monitoring HOT 2
- is only "int" vector id type supported? HOT 4
- is "Instant Similarity Query" not supported? HOT 4
- how to run simbase? HOT 11
- Cosine Function
- Help me with my use case plz HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from simbase.