Giter Site home page Giter Site logo

Better selectivity estimation about cumulusrdf HOT 2 OPEN

mgaare avatar mgaare commented on September 2, 2024
Better selectivity estimation

from cumulusrdf.

Comments (2)

GoogleCodeExporter avatar GoogleCodeExporter commented on September 2, 2024
Hi Andreas,
what do you think about using JMX for those (and other similar) purposes?

Andrea

Original comment by [email protected] on 25 Jan 2014 at 8:59

from cumulusrdf.

GoogleCodeExporter avatar GoogleCodeExporter commented on September 2, 2024
Hi Andrea,

thanks for looking into this.

I probably should have may this issue munch more specific. In our current 
implementation we only have heuristic-based selectivity estimation [1]. This 
implementation mainly based on [2] and takes some ideas from the paper in [3].

Unfortunately, our SPARQL performance is not "too good" - as pointed out by our 
recent benchmark [4]. So, one way to improve this would be to create better 
query plans via a more accurate selectivity estimation.

In fact, a colleague of mine supervised a master thesis on this topic, where 
the student implemented a much better estimation for cumulusRDF. However, this 
code is completely untested and done by a student ;) So ... one would have to 
spend some time on it.

In fact, the actual problem is: how to efficiently create meaningful triple 
pattern (or even join pattern) statistics via Cassandra. There also have been 
some posts on the cassandra mailing list about this, e.g., [5].

Overall, this is not a trivial problem - however, I think we should target it 
as a longterm goal/issue.

Kind regards
Andreas

[1] edu.kit.aifb.cumulus.store.sel.HeuristicsBasedSelectivityEstimator
[2] org.openrdf.query.algebra.evaluation.impl.EvaluationStatistics
[3] Heuristics-based Query Optimisation for SPARQL
[4] NoSQL Databases for RDF: An Empirical Evaluation
[5] 
http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Tracking-word-f
requencies-td7592285.html

Original comment by andreas.josef.wagner on 26 Jan 2014 at 1:33

from cumulusrdf.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.