Giter Site home page Giter Site logo

src-d / formatml Goto Github PK

View Code? Open in Web Editor NEW
2.0 9.0 2.0 504 KB

Formatting with meta-learning experiments.

License: Apache License 2.0

Makefile 2.19% Python 82.81% Java 0.16% Dockerfile 1.50% HTML 0.22% TypeScript 12.57% CSS 0.54%
mloncode formatting

formatml's Introduction

formatml

Travis build status Code coverage License

Formatting with meta learning experiments.

formatml's People

Contributors

m09 avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

formatml's Issues

Meta-learning experiments

Here are the steps I want to go through to test out the meta-learning idea:

  • Extract format analyzer parser (bblfsh + operator, whitespace and special cases handling)
  • Transform parsed files (virtual nodes + uast) into a graph
  • Create a model made of a GGNN encoder and a LSTM decoder with Deep Graph Library and PyTorch
  • Overfit 1 file formatted by prettier to check that the model is expressive enough to learn the formatting of one file
  • Overfit 1 project formatted by prettier, still to check expressiveness
  • Gather a dataset of diverse and somewhat well maintained (ie formatted) projects to learn from (like @warenlg's top javascript repos dataset)
  • Define an evaluation scheme made of both interpolation (modeling style on training repos) and extrapolation (modeling style on unseen repos)
  • Test 4 approaches to train the model:
    • One model per repository (like style-analyzer)
    • One model for all repositories
    • One model for all repositories with multi-task learning (one task per repository)
    • One model for all repositories with meta-learning (one task per repository + learn to adapt)
  • Plug the system into the visualizer to understand results
  • If results seem promising, evaluate a bit more and report to give input to product

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.