Giter Site home page Giter Site logo

ameli / manylinux-cuda Goto Github PK

View Code? Open in Web Editor NEW
7.0 1.0 2.0 85 KB

manylinux docker images with CUDA Toolkit

Home Page: https://hub.docker.com/r/sameli/manylinux_2_28_x86_64_cuda_12.3

License: BSD 3-Clause "New" or "Revised" License

Dockerfile 84.41% Starlark 4.33% Shell 11.26%
dockerfile cuda-toolkit manylinux2014 cuda docker manylinux nvidia-cuda python-build python-wheel python-wheels manylinux-2-28

manylinux-cuda's People

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

manylinux-cuda's Issues

NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver

when I start a container "docker run -it sameli/manylinux2014_x86_64_cuda_12.3" and run nvida-smi command inside it fails with the following message, am I missing something here?

[root@24c96d985c30 /]# nvidia-smi
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installed and running.

I have also tried "docker run --gpus all sameli/manylinux2014_x86_64_cuda_12.3 nvidia-smi"
output:
docker: Error response from daemon: failed to create task for container: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #0: error running hook: exit status 1, stdout: , stderr: Auto-detected mode as 'legacy'
nvidia-container-cli: initialization error: load library failed: libnvidia-ml.so.1: cannot open shared object file: no such file or directory: unknown.
ERRO[0000] error waiting for container:

Provide full CUDA images as options

To start with, I am from EECS UC Berkeley as well. Go Bears.

I am using PyO3 to compile a Python module from Rust that uses CUDA. The Rust crate that invokes CUDA is cust, which would specifically ask for libcuda.so. It seems that the current minimal version does not have this file right there.

So, I forked this repo and made a one-line change.

RUN yum -y install cuda-${VER}.${ARCH}

https://github.com/l2iterative/manylinux-cuda/blob/main/docker/manylinux2014_x86_64_cuda_12.3/Dockerfile#L33

This works for me. I did eventually create a separate Docker image on Docker Hub because my GitHub actions need to. It has been tested to be needed and useful.

Nevertheless, I wonder if you would include such a full version (the image would be larger, 4.83 GB) as one of the images you are offering. Compared with pytorch's one which does not work for me, the repo here seems to have more predictable behaviors and is simplistic, and would be go-to options.

using container in github action

Hi there,

I've been using your container in github actions like so

jobs:
  maketest:
    runs-on: ubuntu-latest
    container:
      image: sameli/manylinux2014_x86_64_cuda_11

but recently I get the error
failed to register layer: write /usr/local/cuda-11.7/targets/x86_64-linux/lib/libcusolver_static.a: no space left on device

Do you know if there is any way to get the container up and running in a github action?
did github shrink their disk space available..?

Many thanks in advance 👍
Paul

`manylinux_2_28`?

First of all, thank you very much for putting this together - I had some wheels break many months ago because I couldn't get CUDA installed correctly and these enabled me to resume building those wheels.

But a question/feature-request: would it be possible to move to manylinux_2_28 instead of manylinux2014? The reason being that manylinux2014 wheels set _GLIBCXX_USE_CXX11_ABI=0 and thus libs in those wheels cannot linked against in projects that use the new ABI (if 13 years old can be called new...). In addition, CentOS 7 (and manylinux2014 with it) are EOL this year so you're probably gonna be forced to bump soon anyway...

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.