Giter Site home page Giter Site logo

Comments (2)

jotok avatar jotok commented on May 28, 2024

Hi Prateek! You're right, we should add an example to the README. I've created a new issue (#43) specifically for this.

The Runner expects simple field-delimited lines where the default delimiter is a comma. We use String::split to split a line in to fields and Double::parseDouble to parse each field into a numeric value. This approach is not robust at all and will fail on simple things, like if the program you used to create the delimited file put quotes around the numeric field. This is intentional: the code is primarily intended to provide an example of library usage, and we didn't want to take additional dependencies to support proper file parsing. If your input file doesn't fit this restrictive format, you'll either have to transform your data or else write your own small program to parse records and submit them to a forest.

from random-cut-forest-by-aws.

prateekiiest avatar prateekiiest commented on May 28, 2024

Thanks, @jotok for creating the issue.
I agree that an outside file format might be needed later on to reformat any input file in the given format.
Currently, I am able to run pass a single column CSV file and it works fine.

from random-cut-forest-by-aws.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.