Unnamed namespace causes ODR violations in C++

fastmod

A header file for fast 32-bit division remainders on 64-bit hardware.

How fast? Faster than your compiler can do it!

Compilers cleverly replace divisions by multiplications and shifts, if the divisor is known at compile time. In a hashing benchmark, our simple C code can beat state-of-the-art compilers (e.g., LLVM clang, GNU GCC) on a recent Intel processor (Skylake).

Usage

We support all major compilers (LLVM's clang, GNU GCC, Visual Studio). This library only makes sense when compiling 64-bit binaries.

It is a header-only library but we have unit tests. Assuming a Linux/macOS setting:

make
./unit

The tests are exhaustive and take some time.

You can also build the tests using cmake which will work nearly everywhere (including under Windows).

cmake -B build

To enable the exhaustive tests, do...

cmake -B build -D FASTMOD_EXHAUSTIVE_TESTS=ON

Under Windows, you can run tests as follows:

cmake --build build --config Release
cd build
ctest . --config Release

Users of Visual Studio need to compile to a 64-bit binary, typically by selecting x64 or ARM64 in the build settings. Visual Studio should default on 64-bit builds on 64-bit systems.

Code samples

In C, you can use the header as follows.

#include "fastmod.h"

// unsigned...

uint32_t d = ... ; // divisor, should be non-zero
uint64_t M = computeM_u32(d); // do once

fastmod_u32(a,M,d);// is a % d for all 32-bit unsigned values a.

fastdiv_u32(a,M);// is a / d for all 32-bit unsigned values a, d>1.


is_divisible(a,M);// tells you if a is divisible by d

// signed...

int32_t d = ... ; // should be non-zero and between [-2147483647,2147483647]
int32_t positive_d = d < 0 ? -d : d; // absolute value
uint64_t M = computeM_s32(d); // do once

fastmod_s32(a,M,positive_d);// is a % d for all 32-bit a

fastdiv_s32(a,M,d);// is a / d for all 32-bit a,  d must not be one of -1, 1, or -2147483648

In C++, it is much the same except that every function is in the fastmod namespace so you need to prefix the calls with fastmod:: (e.g., fastmod::is_divisible).

Go version

There is a Go version of this library: https://github.com/bmkessler/fastdiv

(Speculative work) 64-bit benchmark

It is an open problem to derive 64-bit divisions that are faster than what the compiler can produce for constant divisors. For comparisons to native % and / operations, as well as bitmasks, we have provided a benchmark with 64-bit div/mod. You can compile these benchmarks with make benchmark. These require C++11. It is not currently supported under Visual Studio.

Label	128-97	96-64	64-33	32-1
M			h	l
a			0	a
out1			[la	la]
out2		[ha	ha]
out3		[0	0]
out4	[0	0]
result	0	high32(h*a)	high32(la) + low32(ha)	low32(l*a)
		^ What we want

	FASTMOD_API uint64_t mul128_u32(uint64_t lowbits, uint32_t d) {
	return ((__uint128_t)lowbits * d) >> 64;
	}

	// fastdiv computes (a / d) given precomputed M for d>1
	FASTMOD_API uint32_t fastdiv_u32(uint32_t a, uint64_t M) {
	return (uint32_t)(mul128_u32(M, a));
	}

lemire / fastmod Goto Github PK

fastmod's Introduction

fastmod

Usage

Code samples

Go version

(Speculative work) 64-bit benchmark

fastmod's People

Stargazers

Watchers

Forkers

fastmod's Issues

Recommend Projects

Recommend Topics

Recommend Org