Comments (2)
Hi @mdabros
You can keep the OOB indices of records (record indices corresponding to the training set record lines) for each tree in a structure like:
dictionary<tree#,List>
This should keep the memory consumption to a minimum
Thank you and I hope to see this wonderful project continue to flourish!
Best,
Michael
from sharplearning.
Hi @MichaelBenAssor,
Originally I decided to leave this feature out of the RandomForest implementation to keep memory consumption low, and leave the option of getting an unbiased estimate of the model error to the CrossValidation classes. But it is quite a nice feature of the original RandomForest, so it is something that could be cool to have. So if I can find a good solution that is optional, and does not add memory usage or performance degradation when running without it, I can add it.
I have a few other features I want to complete before working on this though, and with this being a spare time effort, it might be a while before I get to it :)
Best regards
Mads
from sharplearning.
Related Issues (20)
- Add parallelism to Bayesian Optimizer. Also allow resampling non-deterministic algorithms HOT 2
- TrimSplitLineTrimColumnsToDictionary throws a "key already exists" exception HOT 1
- Issue with loading model using GenericXmlDataContractSerializer: The deserializer has no knowledge of any type that maps to this name HOT 4
- Order of results from RandomSearch is not deterministic with different iteration counts. HOT 1
- Is there a way to keep textual labels / targets as a part of the trained model? HOT 5
- Serialization Exception HOT 2
- Continuously improving a neural network over time using small batches. HOT 1
- SharpLearning.XGBoost.dll is not compatible with .net core HOT 6
- Exception when serializing neural net to XML HOT 2
- Code sharing HOT 3
- A way to Save Bayesian Optimizer progress and continue later. HOT 1
- how can i train the Neural Network with my own Training Pictures? HOT 1
- HPO wiki page
- How can I be able to learn with your repository HOT 3
- Looking for an example for loading from a Stream source HOT 4
- Error in getting started code example HOT 1
- Unhandled Exception: System.Runtime.Serialization.SerializationException: The internal array cannot expand to greater than Int32.MaxValue elements. HOT 1
- SharpLearning can only load models trained in python with xgboost==0.82 HOT 1
- Monitoring training progress HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sharplearning.