Giter Site home page Giter Site logo

cufinufft-old's Introduction

cuFINUFFT

[Update] The main repository has moved to https://github.com/flatironinstitute/cufinufft. Please refer to it for latest development.

A GPU implementation of 2, 3 dimension type 1, 2 non-uniform FFT based on FINUFFT.

This is a work from Melody Shih's internship at Flatiron Institute, advised by CCM project leader Alex Barnett.

Installation

  • Get this code and dependency - git clone https://github.com/flatironinstitute/cufinufft.git
  • Review the makefile. - If you need to customize build settings, create and edit a make.inc. Example:
    • To override the standard CUDA /usr/local/cuda location your make.inc should contain: CUDA_ROOT=/your/path/to/cuda.
    • Two examples are provided, one for IBM machines (make.inc.power9), and another for the Courant Institute cluster (make.inc.CIMS).
  • Compile - make all -j
  • Run a test code - ./cufinufft2d1_test 2 128 128 10 1e-6

Interface

cuFINUFFT API contains 5 stages:

  • Set cufinufft default options - int ier=cufinufft_default_opts(type1, dim, &dplan.opts);
  • Make cufinufft plan - ier=cufinufft_makeplan(type1, dim, nmodes, iflag, ntransf, tol, maxbatchsize, &dplan);
  • Set the locations of non-uniform points x,y,z - ier=cufinufft_setNUpts(M, x, y, z, 0, NULL, NULL, NULL, &dplan);
  • Apply the transformation with data c,fk - ier=cufinufft_exec(c, fk, &dplan);
  • Destroy cufinufft plan - ier=cufinufft_destroy(&dplan);

Preprocessors

  • SINGLE - single-precision. Enable by calling PREC=SINGLE make ... or adding to your make.inc.
  • TIME - timing for each stage. Enable by adding "-DTIME" to NVCCFLAGS.
  • SPREADTIME - more detailed timing from spreading and interpolation
  • DEBUG - debug mode outputs all the middle stages' result

Other

  • If you are interested in optimizing for GPU Compute Capability, you may want to specicfy NVARCH=-arch=sm_XX in your make.inc to reduce compile times, or for other performance reasons. See Matching SM Architectures.

cufinufft-old's People

Contributors

garrettwrong avatar janden avatar melodyshih avatar raulppelaez avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

cufinufft-old's Issues

FFTW library is required or not?

Dear Yu-hsuan Shih,

Probably your cufinufft can works without fftw3 as the Makefile below?

# FFTWNAME=fftw3
# FFTW=$(FFTWNAME)$(PRECSUFFIX)
#
# LIBS=-lm -lcudart -lstdc++ -lnvToolsExt -lcufft -lcuda -l$(FFTW)
LIBS=-lm -lcudart -lstdc++ -lnvToolsExt -lcufft -lcuda

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.