Giter Site home page Giter Site logo

ml_project's Introduction

RadiO gaLaxy classiFication using a ResNet (ROLF)

This project aims to use a neural net to classify galaxies in the Radio Galaxy Dataset [1] created by Griese et al. The goal is to find one of four classes, FR-I, FR-II, compact, or bent, for each galaxy image.

Table of Contents

Dataset

The dataset is a collection of several catalogues using the FIRST radio galaxy survey [2]. To these images, the following license applies:

"Provenance: The FIRST project team: R.J. Becker, D.H. Helfand, R.L. White M.D. Gregg. S.A. Laurent-Muehleisen. Copyright: 1994, University of California. Permission is granted for publication and reproduction of this material for scholarly, educational, and private non-commercial use. Inquiries for potential commercial uses should be addressed to: Robert Becker, Physics Dept, University of California, Davis, CA 95616"

Installation and Usage

Installation

To use the code in this repository, install the conda virtual environment found in environment.yml. An installation of Miniforge3 is recommended since it provides a minimal installation of Python together with the fast and reliable mamba package manager.

The environment can be installed via

$ mamba env create -f environment.yml

and can be activated by calling

$ mamba activate rolf

A dev version of the environment can be installed via the file environment_dev.yml After you installed and activated the environment, please install the rolf package using pip

$ pip install -e .

Additionally, you will need an installation of cuda >= 12.1 to fully utilize PyTorch. Versions lower than 12.1 may work too, but have not been tested. Change the version in the environment file depending on the version installed on your system.

Usage

Make sure you have installed the environment and rolf and have activated it call

$ rolf-info

to print an overview of available commands.

$ rolf-info --tools

will print an overview of all available command-line interface (CLI) tools.

Data Download and Unpacking

Before any model can be trained, please call

$ rolf-data -n -o build

in the root directory of the repository. This will download the data from the list of URLs in urls.toml. Then call

$ rolf-unpack build/galaxy_data_h5.zip -o data

and

$ rolf-unpack build/galaxy_data.zip -o data

to unpack the data to the data directory. The directories are created automatically.

Training

Config files for both the training and the hyperparameter optimization are provided in the configs directory and are loaded per default when calling any of the following CLI tools.

  • The training of ROLF can be started using
$ rolf-train
  • The training of the random forest classifier ROMF can be started using
$ romf-train

Hyperparameter Optimization

  • The hyperparameter optimization for ROLF can be started via
$ rolf-optim
  • The hyperparameter optimization of the random forest classifier can be started by calling
$ romf-optim

All optional arguments of the CLI tools can be printed by adding the --help flag.

Pre-trained Models

We have provided pre-trained models in the trained_models directory. The ROLF model can be loaded and evaluated using the classification_viewer.ipynb notebook found in the notebooks directory. This notebook can also be used to evaluate checkpoints of models trained via rolf-train.

Monitoring

The training progress of ROLF can be monitored using TensorBoard:

$ tensorboard --logdir <path/to/checkpoints/directory>

The hyperparameter optimization can be monitored using Optuna Dashboard:

$ optuna-dashboard <path/to/database.sqlite3>

References

[1] Griese, F., Kummer, J., & Rustige, L., "Radio Galaxy Dataset (v0.1.3)", Zenodo (2022). https://zenodo.org/doi/10.5281/zenodo.7113623

[2] R. H. Becker, R. L. White, D. J. Helfand, "The FIRST Survey: Faint Images of the Radio Sky at Twenty Centimeters", The Astrophysical Journal, Vol. 450, p. 559 (1995).

ml_project's People

Contributors

aknierim avatar char-de avatar

Watchers

 avatar  avatar

ml_project's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.