raffaello-camoriano / enitor Goto Github PK
View Code? Open in Web Editor NEWEnitor provides the MATLAB implementation of several large-scale kernel methods.
Enitor provides the MATLAB implementation of several large-scale kernel methods.
fix asap
formulation = 1
M = K_{mm}
formulation = 2
M = eye(m)
Parameters must be passed in a key/value fashion inside the kernel, filter and algorithm classes and subclasses. Otherwise, confusion can be made inside the methods. Code would be much cleaner.
L(y,f) = (-y*f)+
if strict == 1 use < or >
if strict == 0 use <= or >=
e. g. with an 'errorStorageMode' option, set to 'epoch' (default) or to 'iterations'
Now it stops the selection of lambda instead of m.
Refactor code to avoid cell array input arguments.
The training method only considers a single number as performance measure, even if the number of outputs is larger than 1. Another approach that should be considered for implementation is to compute the performance measure for each output and lambda guess and then select the best lambda guess according to a customizable function (e. g. mean, median).
Maximum lambda should be <=10e0. Larger lambdas do not make much sense.
Tested with
map = @nystromUniformIncremental;
and Covertype dataset
Numerical attributes (e.g. age) are currently not normalized/scaled/standardized. These attributes may dominate on binary dummy variables corresponding to categorical attributes (e.g. nationality).
it should not be limited to the number of training points.
Make common parent class with abstract compute() method, to avoid repetitions.
Add "next" method and cell array with kernel/filter parameter lists, in order for the algorithm classes to be able to deal with more complex model selection strategies (such as non-isotropic Gaussian kernels sigmas or gradient descent on sigma/lambda)
Add inputParser to class methods in order to rationalize input parameters.
See http://www.mathworks.it/help/matlab/ref/inputparser-class.html
Keep previously sampled columns in batch experiments.
Example
loop repetitions
shuffle dataset
loop nystrom low-rank, choosing the first m columns
loop Tikhonov
end
end
end
Datasets such as "sido" might require rebalancing.
Parameters to be CV'd:
losses:
Fix Hessenberger version of Tikhonov regularization filter.
Currently deactivated, generating a warning.
split in numMGuesses+1 parts and disregard the 1st one.
add option 'ordering' for samples ordering:
results with
numRFParGuesses = 10;
and
numRFParGuesses = 1;
do not match.
Dataset: InsuranceCompanyBenchmark
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.