Giter Site home page Giter Site logo

Comments (3)

almondtools avatar almondtools commented on June 18, 2024

Thank you for the hint to this library, which I actually did not know from.

The Aho-Corasick variant of stringsearchalgorithms is not the fastest among the implementations known to me. There is at least one library that is faster.

You can see an overview at stringbench. for "128 strings in strings" you will see that aho corasick of HanKCS is faster (study the benchmark files to get the factor).

I will check the upper library on correctness and performance.

from stringsearchalgorithms.

almondtools avatar almondtools commented on June 18, 2024

I added multiple-string-searcher to stringbench with following results:

  • the documentation is wrong in describing how to enable Aho-Corasick-Fast (StringSearcherBuilder is hard-coded only to work with standard Aho-Corasick)
  • the standard Aho-Corasick is low performing for corner cases (like the algorithm of Aho-Corasick.org), as an example the test for alphabet size 2, pattern size 2 and pattern number 2 does not terminate after 40 seconds. An issue on Aho-Corasick.org was filed but ignored.

To keep the test suite of stringbench lean, the incubation test at this time does contain only one example failing test. Look in the history of the Test file if you are interested in performing the whole test.

from stringsearchalgorithms.

almondtools avatar almondtools commented on June 18, 2024

Please reopen this issue if the reason is not clear.

from stringsearchalgorithms.

Related Issues (12)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.