pip freeze -l > requirements.txt
to update requirements.txt
- Project ran on Python 3.12.2 environment
- Run
./setup.sh
- Random Forest classifier for benign (
rf-minimal
) - Random Forest classifier for general class classification
- Random Forest classifier for lexical features
- Random Forest classifier for lexical and trigrams
- DistilBERT Cased
Models for rf-general
, rf-lexical
, rf-minimal
and distilBERT
cased can be found in:
https://drive.google.com/drive/u/0/folders/1v5IFIY7J0EJPg6VVZXGeGp1DY9VGPuvp
https://www.kaggle.com/datasets/sid321axn/malicious-urls-dataset
- Phishing test: https://phishtank.org/developer_info.php
- Benign test: https://www.kaggle.com/datasets/siddharthkumar25/malicious-and-benign-urls