Giter Site home page Giter Site logo

ml-core's Introduction

ml-core's People

Contributors

glimow avatar irinakhismatullina avatar r0maink avatar vmarkovtsev avatar zurk avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

ml-core's Issues

How to extract UAST's path contexts from code?

Given a python piece of code:

print("hello")

How can I extract its associated UAST contexts paths?, I tried to explore sourced library and see if there's a function for extracting the paths of a piece of code:

In:

from sourced.ml.core.extractors import bags_extractor
bags_extractor.Extractor?

Out:

Init signature: bags_extractor.Extractor(log_level=20)
Docstring:     
Converts a single UAST via `algorithm` to anything you need.
It is a wrapper to use in `Uast2Features` Transformer in a pipeline.
Init docstring:
Class constructor
:param log_level: logging level.
File:           ~/anaconda3/envs/sourced/lib/python3.6/site-packages/sourced_ml_core-0.0.3-py3.6.egg/sourced/ml/core/extractors/bags_extractor.py
Type:           type
Subclasses:     BagsExtractor, RoleIdsExtractor

and

In:

from sourced.ml.core.extractors import Extractor
Extractor.extract?

Out:

Signature: Extractor.extract(self, uast:bblfsh.node.Node)
Docstring: <no docstring>
File:      ~/anaconda3/envs/sourced/lib/python3.6/site-packages/sourced_ml_core-0.0.3-py3.6.egg/sourced/ml/core/extractors/bags_extractor.py
Type:      function

Also, I tried to:

from sourced.ml.core.utils import bblfsh
bblfsh.BblfshClient.parse(filename='/home/user/Downloads/script.py')

But I got:

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-37-9dbc87e2a353> in <module>
      1 code = '''print("hi")'''
----> 2 bblfsh.BblfshClient.parse(filename='/home/user/Downloads/script.py')

TypeError: parse() missing 1 required positional argument: 'self'

After checking the available modules, I did not find a function for extracting the UAST path contexts. Is it possible to do it with sourced? Also I did not find any documentation (maybe someone can give me a pointer about where to find the docs?).

Switch from BiLSTM to the modern attention architecture

Our current NN splitter is based on BiLSTM, which has problems with performance. We should leverage the recent advancements in deep learning and implement the new attention-based (seq2seq-like?) architecture of the model.

Stage 1 - research

Follow the paper, take the same dataset, and design the model. Calculate the metrics.

Stage 2 - production

Package the model, publish it on Modelforge.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.