Giter Site home page Giter Site logo

On an M1 on MacOS, when profiling with Tracy, ReadWriterQueue's try_enqueue / try_dequeue sometimes seem to cause spikes of several milliseconds about readerwriterqueue HOT 6 CLOSED

IliasBergstrom avatar IliasBergstrom commented on June 3, 2024 1
On an M1 on MacOS, when profiling with Tracy, ReadWriterQueue's try_enqueue / try_dequeue sometimes seem to cause spikes of several milliseconds

from readerwriterqueue.

Comments (6)

cameron314 avatar cameron314 commented on June 3, 2024

There's no loop in try_dequeue, so whatever is causing the 1 ms delay is not executing code. Perhaps the thread was pre-empted?

from readerwriterqueue.

IliasBergstrom avatar IliasBergstrom commented on June 3, 2024

Thank you Cameron for the quick reply!

That's a very good suggestion - I was assuming that these lock-free calls would always be fast irrespective of the thread they're invoked in (real-time audio thread or "worker"), but I think I'm setting myself up for false positives there. I will make the needed changes to my test setup so that only invocations from the real-time thread are registered!

from readerwriterqueue.

IliasBergstrom avatar IliasBergstrom commented on June 3, 2024

I now ensured that I only log from within the real-time (audio) thread, and indeed the spikes are much fewer!

I still note some spikes though, so I will test tomorrow with an Intel Mac, and could also test with setting up Tracy directly with your benchmark to separate testing the queue code from my own on the M1.

Screenshot 2022-11-29 at 16 34 13

from readerwriterqueue.

cameron314 avatar cameron314 commented on June 3, 2024

The microsecond range can be explained by uncached (or cached but requiring cross-core synchronization) memory accesses, but the millisecond range has to be something else.

from readerwriterqueue.

IliasBergstrom avatar IliasBergstrom commented on June 3, 2024

I ran the same code on an Intel mac now - and while I haven't managed to isolate the Tracy logging to the "real-time" audio thread only yet, I can already confirm that the behaviour is vastly different, it's indeed in the micro or nanosecond range.

I'll still need to implement a way to only log the real-time thread also on the Intel mac for the comparison to be direct - and to also do the test with Tracing your benchmark code, to rule out user error, which is of course still the most likely cause.

from readerwriterqueue.

IliasBergstrom avatar IliasBergstrom commented on June 3, 2024

It was indeed that some threads were aggressively pre-empted on the M1 - By invoking the following for each such thread:
pthread_set_qos_class_self_np(QOS_CLASS_USER_INTERACTIVE, 0);
I got performance comparable to the Intel Mac I tested on.
Some performance issues still remain but I would be very surprised if they are caused by ReadWriterQueue - if they seem to be I'll re-open this, but for now it's best to close - Thank you!

from readerwriterqueue.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.