Checks <input type="c

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

And furthermore, we actually optimize (x / c).floor()</code

<a class="user-mention notranslate" data-hovercard-type="user" data-hover

Wrong results from `floordiv` about polars HOT 9 CLOSED

knl commented on June 20, 2024

Wrong results from `floordiv`

from polars.

Comments (9)

orlp commented on June 20, 2024 3

After some more internal discussion we've decided that we want to keep the current behavior, where x // y for floats in Polars is defined as (x / y).floor(), and when y is a scalar constant as (x * (1/y)).floor(), even if this subtly diverges from what Python does. The primary rationale is simple: speed. What we do is much faster (up to 25x on one AMD machine, compared to numpy). It is unfortunate that it doesn't match Python or numpy's behavior, but I think it's worth it.

Python implements (as far as I can see) true flooring division in a single logical step, whereas we first do a division followed by a flooring operation. This can be subtly different. Note that in general introducing hard discontinuous points with floating point will almost always result in surprising behavior. For example in Python:

>>> 6.0 / 0.1
60.0
>>> 6.0 // 0.1
59.0

This is because 0.1 isn't exactly representable but in reality is the value 0.1000000000000000055511151231257827021181583404541015625, and thus if you perform the flooring division technically speaking the 'correct' result is ever so slightly smaller than 60, and thus rounds down to 59. Whereas with an intermediate step, where you first divide and then floor you get 60.0, like in Polars:

>>> pl.Series([6.0]) // 0.1
shape: (1,)
Series: '' [f64]
[
        60.0
]

In general in Polars we have the following guideline for floating point operations:

Polars always attempts to provide reasonably accurate results for floating point computations but does not provide guarantees on the error unless mentioned otherwise. Generally speaking 100% accurate results are infeasibly expensive to acquire (requiring much larger internal representations than 64-bit floats), and thus some error is always to be expected.

I think (x / y).floor() is a reasonable enough implementation for flooring division, it just happens that by the very nature of a discontinuous operation like floor that any tiny errors can be immensely magnified. But one also has to ask, what is the error on the input of the operation? Usually one does not read in 100% accurate floating point numbers, do one operation, and expect to write out 100% accurate floating point numbers. There generally is a data pipeline which introduces some sort of floating point error along its path, which reduces the value added in having a fused divide-and-floor operation.

Take your motivating example. You start with the integer value 1391332687909454210 and cast it to a float. But this conversion in itself already introduces error:

>>> 1391332687909454210 - int(float(1391332687909454210))
-126

For your specific example of converting integer nanosecond timestamps to a whole number of 'jiffies', I'd suggest the following using entirely integer arithmetic, making sure not to overflow since your example values are very close to the limits of a 64-bit signed integer:

JIFFIES_PER_SECOND = 65536
NANOSECONDS_PER_SECOND = 10**9

# Logical calculation: (pl.col.ts / NANOSECONDS_PER_SECOND * JIFFIES_PER_SECOND).floor()
# Calculation without risk of overflow, or using floating point:
whole_seconds = pl.col.ts // NANOSECONDS_PER_SECOND
rest_nanosec = pl.col.ts % NANOSECONDS_PER_SECOND
jiffies = whole_seconds * JIFFIES_PER_SECOND + rest_nanosec * JIFFIES_PER_SECOND // NANOSECONDS_PER_SECOND

from polars.

MarcoGorelli commented on June 20, 2024

Thanks for the report

Labelling as high prio as silently incorrect results can potentially break analyses

from polars.

ritchie46 commented on June 20, 2024

@orlp can you take a look?

from polars.

orlp commented on June 20, 2024

First I note that 1e9 / JIFFIES_IN_ONE_SECOND is the value 15258.7890625 exactly (this can be exactly represented by a f64 without loss of precision).

Unfortunately 1391332687909454210 can not be exactly represented by a f64. The nearest representable f64 is 1391332687909454336, which Polars converts to before doing a float-float flooring division (and I believe Python to do the same).

So in reality you are calculating 1391332687909454336 / 15258.7890625. If we put this into WolframAlpha we find the full correct answer:

Now, in Polars we compute floating point flooring division as (x / y).floor(). Unfortunately the closest floating point number to 91182379034833 + 1951883/1953125 is 91182379034834, after which the floor has no effect. This can also be seen in Python:

>>> 1391332687909454210 / (1e9 / JIFFIES_IN_ONE_SECOND)
91182379034834.0
>>> 1391332687909454210 // (1e9 / JIFFIES_IN_ONE_SECOND)
91182379034833.0

Python computes floating point flooring division as (x - x % y) / y followed by a bunch of ifs to fix a variety of corner cases. This is overall more accurate, but I'm not entirely convinced it's worth the performance penalty.

from polars.

orlp commented on June 20, 2024

And furthermore, we actually optimize (x / c).floor() where c is a constant to (x * (1/c)).floor(). This is an optimization I'm particularly unwilling to let go of.

import timeit
import polars as pl
import numpy as np

a = np.random.rand(10**7)
pa = pl.Series(a)
print(timeit.timeit(lambda: a // 42.0, number=100))   # 2.65
print(timeit.timeit(lambda: pa // 42.0, number=100))  # 0.83

from polars.

mcrumiller commented on June 20, 2024

we actually optimize (x / c).floor() where c is a constant to (x * (1/c)).floor()

Does 1/c have a fast path in floating point arithmetic? Curious how two ops is faster than one otherwise. Edit: I see, precompute 1/c and then it's a single mult for all elements.

from polars.

knl commented on June 20, 2024

I understand your point and I see how it matches the rest of the rationale for handling floats.

However, I would still kindly ask for a change in the documentation for the function, as it says:

Method equivalent of integer division operator expr // other.

clearly, they are not equivalent as these two give different results. I was already computing correct jiffies in python using \\, and was quite surprised that floordiv diverges.

Thanks for the suggestion, I actually started using it after discovering the issue. The way I got there is to convert seconds to jiffies, that take the rest, as it doesn’t overflow.

from polars.

orlp commented on June 20, 2024

However, I would still kindly ask for a change in the documentation for the function

@knl I will open an issue to improve the documentation of the floor division.

I was already computing correct jiffies in python using //, and was quite surprised that floordiv diverges.

Well, if you were using n // (1e9 / JIFFIES_IN_ONE_SECOND) in Python, I'm afraid that also wasn't correct. Take for example 4611686021 seconds:

>>> 4611686021 * 65536  # Clearly correct.
302231455072256
>>> 4611686021000000000 // (1e9 / JIFFIES_IN_ONE_SECOND)  # Oops.
302231455072255.0

If you want exact integer results for exact integer inputs, you're better off avoiding floating-point arithmetic unless you're very careful about error bounds.

from polars.

knl commented on June 20, 2024

@knl I will open an issue to improve the documentation of the floor division.

thanks a lot @orlp. And many thanks for the explanation of the issue, it’s quite educational!

Well, if you were using n // (1e9 / JIFFIES_IN_ONE_SECOND) in Python, I'm afraid that also wasn't correct. Take for example 4611686021 seconds:
>>> 4611686021 * 65536  # Clearly correct.
302231455072256
>>> 4611686021000000000 // (1e9 / JIFFIES_IN_ONE_SECOND)  # Oops.
302231455072255.0
If you want exact integer results for exact integer inputs, you're better off avoiding floating-point arithmetic unless you're very careful about error bounds.

hah, true! I’m avoiding floats as much as possible exactly for the reasons you mention. Dealing with these timestamps produces a lot of surprises when there are silent conversions to floats. It was painful with pandas to do a simple shift, as that would turn the missing element to null, turning the whole column to float and rounding to possible float values. One of the things I like with polars is that integers are explicit and sneaking in null values doesn’t produce silent conversions to floats.

from polars.

Wrong results from `floordiv` about polars HOT 9 CLOSED

Comments (9)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent