Giter Site home page Giter Site logo

about scikit-multilearn about skml HOT 7 CLOSED

christiansch avatar christiansch commented on September 3, 2024 1
about scikit-multilearn

from skml.

Comments (7)

niedakh avatar niedakh commented on September 3, 2024 2

Hi all,

I would more than love to extend the team that's working on scikit-multilearn given that up till now it was mostly myself, Tomasz's student's plus a couple of contributions like @ChristianSch. I do look at Christian's fork and skml repo regularly, but haven't seen any new code there recently. I've also sent Christian an invite to the dev group, but he must've missed it.

I'm more than willing to add you to the scikit-multilearn organization and work together, it is really hard to maintain the library alone. At the moment I'm concentrated on providing multilabel stratification methods for scikit-ml, fixing the code according to feedback from JMLR reviewers and providing a patch for a large scale problem with sparse outputs in scikit-learn - which is a lot of work but also is the crucial point to make the library more optimal - scikit-learn/scikit-learn#8908

In general my vision is for scikit-multilearn to be the library of choice for the multi-label classification problem with scikit-learn. I've sent you invites to the scikit-multilearn slack, let's continue discussion there shall we? The address is: https://scikit-ml.slack.com/

from skml.

ChristianSch avatar ChristianSch commented on September 3, 2024 1

Hey there,

I sadly am just an occasional contributor without any rights in the repo. The status of the projects bugs me a lot as well. I started skml primarily for myself, as the basic methods such as cc and ecc didn't work at the time, hence this project is somewhat suplementary for things I need and implemented myself as I didn't have any hopes that my pull requests would be merged.

I approached the maintainer of scikit-multilearn already about a slack/irc/mailing list communication method, but to no avail. I can't just get in touch with him. Maybe if we join forces we can do something about it.

Some important things from the top of my head:

  • communication (slack, irc, mailing list)
  • a project that manages the repo for easier collaboration distributing the responsibility over the project that actually serves a purpose and is used, with roles and everything
  • automatic testing is a must have (as you noted as well)
  • clean up the base code to adhere to python standards (if not done already)
  • cleaning up the issues and maybe have a TODO file for classifiers/methods that are missing
  • dependency management

I'm not sure where to go from here. Maybe we should look for more interested people, then approach the maintainer and see if we can work something out. If not, we fork it (we can use skml as a name, I'm happy to ditch this project if we get things going) and do it on our own.

What do you think?

Cheers!

from skml.

ljvmiranda921 avatar ljvmiranda921 commented on September 3, 2024

Sure! I'd be happy to help in any way I can.

But maybe just out of respect, we can contact the owner for one last time? It seems that the library has a pre-print in arxiv, and has received grants (and agreements) that might be quite tricky to tread on. Also, the authors may have a vision in the library that they wanted to share.

I agree with the things you pointed out. The style of the codebase seems to have a hint of Java/C++, and we can refactor them to be more Pythonic. In fact, some of the methods and mixins are quite redundant and can be removed.

Personally, I'm not yet that good in Python, but I can help you in refactoring some of the previous code, writing most of the documentation (contribution guidelines, API doc, code of conduct, etc. etc.), and designing new tests. 😄

As for major milestones, we can start things of by supporting those that are dependent only in the Python ecosystem. We can add support to MEKA and graph-tools later on. We can start small, then if we have a working base we can promote this in Reddit or HackerNews to find new contributors and whatnots.

Thank you for your reply, hopefully we get to build this much better!

Cheers!

from skml.

ChristianSch avatar ChristianSch commented on September 3, 2024

Oh, sorry, I didn't meant that we should just skip him! I was just thinking that if we have some things we want and have mapped that out, it's easier for him to help, as he obviously has carried on or doesn't have much time. Of course it would be best to have his blessing and his help. I think having contributor status in the existing project would be optimal, as well as having a method of communication where the maintainer has access as well, if he wants to. Maybe we should find out what he's up to. I'll try to find out his email.
I think we should get in touch with him as a group of "future contributors" or something. Is there anyone else you know about besides you and me?

from skml.

ljvmiranda921 avatar ljvmiranda921 commented on September 3, 2024

Ooops sorry I misunderstood, my bad. Yes you are totally correct! 👍

According to their publication, the emails are:

As of now, I don't know anyone else. I contacted you because you seem to be the most active contributor in the library. 😄

from skml.

ljvmiranda921 avatar ljvmiranda921 commented on September 3, 2024

Awesome @niedakh!

Oops, just one more thing, is it fine to send the invite to [email protected] instead?

from skml.

niedakh avatar niedakh commented on September 3, 2024

Cool, sent it!

from skml.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.