Giter Site home page Giter Site logo

sdcfinder's Introduction

SDCfinder

pipeline status

This tool aims to detect Silent Data Corruption (SDC) ocurring in supercomputers and large datacenters. The current implementation is in C but we will be adding a Python version soon. We also are planning to cover GPU memory as well. The way this tool works is rather simple. At the begining of the execution two hexadecimal paterns will be selected randomly. Then it allocates a large buffer of the available memory. Then, it will iterate doing the three following actions: write, sleep and read. When reading it will check whether any of the read patterns is different from the one written, if it is the case it will log it as a detected SDC. The next iteration the second pattern will be used for writing and it will continue swtiching between both paterns until the end of the execution.

sdcfinder's People

Contributors

kellekai avatar undertherain avatar leobago avatar hopobcn avatar

Watchers

 avatar mjorda avatar  avatar  avatar

Forkers

hopobcn kellekai

sdcfinder's Issues

CMAKE version (2.6) required is too old

CMake version 2.6 is too old, and accessing cmake documentation of versions older than 3.0 is a bit tricky.

List of CMake versions in some BSC machines:

  • Marenostrum: 3.5.2 (default), 3.8.2, 3.9.2, 3.9.6
  • Minotauro: 2.8.5 (default), 3.2.1, 3.6.2, 3.7.0, 3.7.2, 3.9.6
  • Nord3: 2.6 (default), 2.8.10, 2.8.12, 3.2.3, 3.5.0, 3.6.2, 3.7.1, 3.10.0
  • CTE-Power8: 3.8.0, 3.8.1, 3.9.6

For everyones sanity I would advise on upgrading CMake minimum version required to at least (3.0).

MemoryReliability binary installation directory

A general unix convention is to install binaries on '/bin directories. But the current cmake behaviour installs the MemoryReliability into /usr/local/ instead of /usr/local/bin:

$ make install
[ 83%] Built target MRlib
[100%] Built target MemoryReliability
Install the project...
-- Install configuration: ""
-- Installing: /usr/local/MemoryReliability

Is this an error or on purpose?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.