Comments (5)
You can set this param approxQuantileRelativeError, which is the Relative Error for Approximate Quantile Calculation (0 <= value <= 1), default is 0 for calculating the exact value, which would be expensive for large datasets.
from spark-iforest.
Furthermore, ApproxQuantile is used to calculating the threshold of anomaly score for finding anomaly points according to the given param contamination. Regarding ApproxQuantile function, you can find more detail in the spark api doc:
from spark-iforest.
Hi, thanks for your reply. I am still bit confused.
Suppose, the contamination value is 0.1 . In this case, what could be the best value for ApproxQuantile ?
from spark-iforest.
It depends on what the relative error of the quantile value you can accept. The smaller the relative error is, the more expensive and more memory required for calculation. You can try with different values from small to large, so that it can fit your memory consume.
from spark-iforest.
Thanks a lot.
from spark-iforest.
Related Issues (20)
- Data type StringType is not supported. HOT 5
- Caused by: java.lang.ClassNotFoundException: com.sun.jersey.api.client.Client HOT 1
- Error while initializing the model HOT 2
- Getting NoSuchMethodError in IForest.fit HOT 2
- Pyspark Streaming Iforest HOT 5
- The library doesn't work with Spark 3.0.0 HOT 5
- Getting Py4JJavaError HOT 10
- Maven Central Repository HOT 5
- Compile for Pyspark 3.x.x HOT 2
- Parameter approxQuantileRelativeError not usable when running with Spark 3 HOT 1
- "Task not serializable" when loading trained model HOT 1
- issue with saving the model with python on windows HOT 7
- Maven build error HOT 2
- library import
- Cannot load iforest model HOT 1
- No version for spark1.x HOT 3
- load model HOT 2
- How to use on Databricks.
- Unexpected keyword argument `seed` when initializing IForest via Python API HOT 1
- PySpark iforest.fit() method error HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from spark-iforest.