orottier / web-audio-api-rs Goto Github PK

View Code? Open in Web Editor NEW

268.0 268.0 17.0 11.11 MB

A Rust implementation of the Web Audio API, for use in non-browser contexts

Home Page: https://docs.rs/web-audio-api/

License: MIT License

Rust 100.00%

audio dsp rust web-audio-api webaudio

web-audio-api-rs's People

Contributors

Stargazers

Watchers

Forkers

jerboas86 wandalen b-ma tasogare3710 nabladev ork821 william-gooch lowlevelcodehustler waltersmuts icodein iq-scm uklotzde cybersoulk corvusprudens canardoux geng-engine jason-shen

web-audio-api-rs's Issues

Optimization in audio graph traversal

Some improvements could be made in graph.rs when rendering an audio quantum:

Store edges more efficiently (currently a single HashMap that needs to be iterated many times)
Use a specialized container type for the nodes. e.g. https://crates.io/crates/intmap
Avoid the remove and insert calls of the currently processing node. That was necessary for borrow reasons. Wrap nodes in Cell or equivalent
Clear the input buffers when processing of a Node is done (we don't need them anymore and this way they can be reused)
~~When a Node has only one outgoing connection in the graph, its outputs can be moved instead of copied to that Node's inputs~~ this is not useful because the inputs are immutable anyway and need to be copied/mutated to outputs
Decouple the graph-topology code from the audio-specific code

MergerNode not working correctly

Executing the following code

use web_audio_api::context::{AsBaseAudioContext, AudioContext};
use web_audio_api::node::{AudioNode, AudioScheduledSourceNode};

fn main() {
    let context = AudioContext::new();

    // Create an oscillator
    let left = context.create_oscillator();

    //Create an oscillator
    let right = context.create_oscillator();
    // set a different frequency to distinguish left from right osc
    right.frequency().set_value(1000.);

    // Create a merger
    let merger = context.create_channel_merger(2);

    // connect left osc to left input of the merger
    left.connect_at(&merger, 0, 0).unwrap();
    // connect right osc to left input of the merger
    right.connect_at(&merger, 0, 1).unwrap();

    // Connect the merger to speakers
    merger.connect(&context.destination());

    // Start the oscillators
    left.start();
    right.start();

    // connect left osc to splitter

    // enjoy listening
    std::thread::sleep(std::time::Duration::from_secs(4));
}

Expected:

440Hz on the left speaker
1000Hz on the right speaker

Observed:

440Hz + 1000Hz on the left speaker
440Hz + 1000Hz on the right speaker

Possible cause:

I think the root cause is Graph::add_node() function which always allocate mono AudioBuffer for output

    pub fn add_node(
        &mut self,
        index: NodeIndex,
        processor: Box<dyn AudioProcessor>,
        inputs: usize,
        outputs: usize,
        channel_config: ChannelConfig,
    ) {
        // todo, allocate on control thread, make single alloc..?
        let inputs = vec![AudioBuffer::new(self.alloc.silence()); inputs];
        let outputs = vec![AudioBuffer::new(self.alloc.silence()); outputs];

        [...]
    }

Majority of audio nodes have 1 output including MergerNode, but MergerNode has N channels in this output stream.
N corresponding to its number of inputs.

There is a single output whose audio stream has a number of channels equal to the number of inputs when any of the inputs is actively processing.

There is hidden states in the specs that make it difficult to read, my understanding is that a node has

Inputs and/or Outputs
Inputs and Outputs are streams
A stream is composed of one or more channels
A channel is a mono audio data array
numberOfInputs == number of input streams
numberOfOutputs == number of output streams
channelCount is NOT number of input channels
channelCount is NOT number of output channels
computedChannelCount is the number of input channels
computedChannelCount is computed from channelCount, channelCountMode, and channelInterpretation (see),
[hidden] computedOutputChannelCount == number of output channels. In the case of AudioWorklet, the number of output channel is not computed but explicitly specified by the outputChannelCount parameter.
computedOutputChannelCount depends on the node type and "actively processing" state of the node

Possible solution:

Add computed_channel_count()
Add computed_output_channel_count()
take a look into AudioNode trait , Node struct, and channel_config

Make AudioBuffer more compliant

There is some differences between Rust AudioBuffer API and Web audio specs AudioBuffer API. (see specs for more details)

Possible solution:

Replace current implementation with the following interface AudioBuffer::sample_rate(&self) -> f32
Rename sample_len() function as length()
Rename channel_data function as get_channel_data
Rename channel_data_mut function as get_channel_data_mut
Make from_channels function private and add a public copy_to_channel function
Add a public copy_from_channel function

There is this concept of "acquire the content" in the spec (see specs for more details). I think it could be emulate by using systematically move semantics with AudioBuffer. What do you think ?
If yes, it should be documented

Support cyclic audio graph when it contains DelayNodes

https://webaudio.github.io/web-audio-api/#rendering-loop

Cycles are currently fully muted

Improve error messages in decode_audio_data

One small thing is that the error messages are a bit confusing, nothing really important but maybe it would be better to override them with some generic message (at contrary it would possibly make further debugging more complicated, don't really know)

> --------------------------------
> Error decoding audio file: "samples/empty_2c.wav"
> IoError(Custom { kind: UnexpectedEof, error: "end of stream" })
> --------------------------------
> --------------------------------
> Error decoding audio file: "samples/corrupt.wav"
> IoError(Custom { kind: UnexpectedEof, error: "end of stream" })
> --------------------------------
> --------------------------------
> Error decoding audio file: "samples/sample.aiff"
> DecodeError("mp3: invalid mpeg audio layer")
> --------------------------------
> --------------------------------
> Error decoding audio file: "samples/sample.webm"
> DecodeError("mp3: invalid main_data_begin")
> --------------------------------

Originally posted by @b-ma in #87 (comment)

Add DecodeAudioData to AudioContext

The implementation should follow this part of the specification (see DecodeAudioData)

Fix clippy warnings and errors

Fix all clippy warnings and errors (> 90).

Solution: I can make a PR to solve this, if accepted

Decide how to deal with fallible operations (e.g. exceptions thrown)

In the spec, exceptions are thrown all over the place (when creating nodes, when updating attributes, when calling methods..)

There a few options for the rust version:

panic on exception (this is what we do currently)
all fallible operations must return a Result<..>
all operations must return a Result<..> (for consistency, even infallible ones)
panic on exception but offer a try_... method for fallible operations returning a Result
ignore errors (log them)

All variants have pros and cons. We need to strike a balance between correctness and pragmatism.

e.g. https://docs.rs/curl/0.4.41/curl/ returns a Result for every operation meaning your code will be littered with unwraps

The rust stdlib strikes a reasonable balance between (well documented) panics and offering try_... functions, in my opinion (e.g. https://doc.rust-lang.org/std/cell/struct.RefCell.html#method.try_borrow )

Add IIRFilterNode

https://webaudio.github.io/web-audio-api/#IIRFilterNode

AudioProcessor::process should return a boolean for lifetime control

This is currently handled with AudioProcessor::tail_time
To conform to the spec of AudioWorkletProcessCallback, this should be refactored

Related discussion in #46 (comment)

https://www.w3.org/TR/webaudio/#callback-audioworketprocess-callback

Update AudioParam automation rate

https://www.w3.org/TR/webaudio/#dom-audioparam-automationrate

The automation rate, barring some constraints, can be changed by the control thread.

change AudioParam automation_rate to something like an AtomicBool
add set_automation_rate to impl AudioParam
how to deal with constraints? We can panic for now, but where do we encode this logic?
fetch automation_rate in the Renderer and use it accordingly
reconsider signature of AudioParamValues.get(..). It now returns a slice of full render quantum length always. But the spec says it can be length 1 too (for k-rate and for constant pieces) [1]. For type safety we can also make it return an enum.

[1] https://www.w3.org/TR/webaudio/#audioworkletprocess-callback-parameters

Implement all up/down-mixing combinations

https://www.w3.org/TR/webaudio/#UpMix-sub
https://www.w3.org/TR/webaudio/#down-mix

A few common mixing implementations are already in src/node.rs
pub fn mix(&mut self, channels: usize, interpretation: ChannelInterpretation)

Prevent allocations on every quantum in AudioParamProcessor

I profiled heap allocations with heaptrack and found that the bottleneck is AudioParamProcessor::tick method which seems to allocate at each call ( at least for A-rate parameter).

impl AudioParamProcessor {
    pub fn value(&self) -> f32 {
        if self.value.is_nan() {
            self.default_value
        } else {
            self.value.clamp(self.min_value, self.max_value)
        }
    }

    fn tick(&mut self, ts: f64, dt: f64, count: usize) -> Vec<f32> {
        // store incoming automation events in sorted queue
        for event in self.receiver.try_iter() {
            self.events.push(event);
        }

        // setup return value buffer
        let a_rate = self.automation_rate == AutomationRate::A;
        let mut result = if a_rate {
            // empty buffer
           ////////// I think this is were the allocation happens //////////////////////////
            Vec::with_capacity(count)
        } else {
            // filling the vec already, no expensive calculations are performed later
            vec![self.value(); count]
        };

I am learning the profiling tools in the process, so what i am saying should be double check to be sure.

Originally posted by @Jerboas86 in #15 (comment)

Spurious test failure `src/media.rs - media::MediaElement (line 74)`

The following test fails sometimes. We should probably use some locks or sleep to make sure output is generated before it is read

test src/media.rs - media::MediaElement (line 74) ... FAILED

failures:

---- src/media.rs - media::MediaElement (line 74) stdout ----
Test executable failed (exit code 101).

stderr:
thread 'main' panicked at 'called `Result::unwrap()` on an `Err` value: BufferDepletedError', src/media.rs:29:13
note: run with `RUST_BACKTRACE=1` environment variable to display a backtrace

Context::current_time is not updated correctly

Context::current_time always returns 0

Runs the following example to reproduce:

use web_audio_api::context::{AsBaseAudioContext, AudioContext};
use web_audio_api::node::{AudioNode, AudioScheduledSourceNode};

fn main() {
    let context = AudioContext::new();

    let osc1 = context.create_oscillator();
    osc1.connect(&context.destination());
    osc1.frequency().set_value_at_time(440., 0.);
    osc1.start();

    loop {
        println!("Time: {}", context.current_time());
    }
}

The example keeps printing 0:

Time: 0
Time: 0
Time: 0
Time: 0

from web audio api spec:

Thus, for a running context, currentTime increases steadily as the system processes audio blocks, and always represents the time of the start of the next audio block to be processed.

Add DynamicsCompressorNode

https://webaudio.github.io/web-audio-api/#DynamicsCompressorNode

Support all AudioParam automation methods

File: param.rs
Relevant section in spec: https://webaudio.github.io/web-audio-api/#AudioParam

Implement automation methods

Improvements / Spec

spec - All panic calls should be replaced by clean and compliant Errors
spec - Automation methods should return self or Result(self, Error) to allow method chaining
spec - Sanity check all method arguments (e.g. times, time_constant)
refactoring - Review all AudioNode::new to remove set_value or set_value_at_time, default values should be properly initialised at instantiation without calling an automation method explicitly (this is maybe a bad idea as it would create a confusion between the param default value and user defined default value)
perf - set_target_at_time can run forever as it as no "real" end time or value, a possible performance improvement would be to internally cancel the event when intrisic_value is close enough to the target (note that Chrome implements such strategy)

To be discussed further

Maintain AutomationEventQueue in control thread instead of audio thread (e.g. Arc<Mutex<AutomationEventQueue>>?) - is maybe necessary for proper Error handling (cf. discussion in #62 (comment)
Possibly review set_value_curve_at_time signature to use Vec<32> instead of &[f32] (cf. discussion in #62 (comment))

Add AudioContextOptions to AudioContext constructor

In the specs, we can build audioContext with an optional argument AudioContextOptions

dictionary AudioContextOptions {
  (AudioContextLatencyCategory or double) latencyHint = "interactive";
  float sampleRate;
};

I think we should extend this options like that:

dictionary AudioContextOptions {
  (AudioContextLatencyCategory or double) latencyHint = "interactive";
  float sampleRate;
  unsigned long channels = 2;
};

This way, the user can specify the number of output channels that it would like. The current behavior is that the user has only access to default configuration given by CPAL.

Should AudioContext::new(options: Option<AudioContextOptions>) be fallible ?

My opinion is that it should not. The method should fallback to CPAL default config if user requested config fails.
And provides supported config as an error message.

related to #28
related to #32

Implement `resume` and `suspend` on `OfflineAudioContext`

https://www.w3.org/TR/webaudio/#OfflineAudioContext-methods

Notice the suspend function has the rather odd feature that you can specify the suspend time. We can implement that (later) as an addiditonal method suspend_at

Add BiquadFilterNode

Add the BiquadFilterNode to web-audio-api-rs to be compliant with (web audio api spec)

Solution:
I have not yet written a prototype, but i think i can reach at least partial compliance for this node

May i begin to work on it ?

Add StereoPannerNode

https://webaudio.github.io/web-audio-api/#StereoPannerNode

The more complicated PannerNode is already implemented, definitely have a look at that one when implementing this

`AudioBuffer`s remarks, questions and some proposals

Hey, I open this issue because reading the code and trying to understand it, there are still some aspects that I struggle with and that I think could be clarified and maybe improved.

disclaimer: I just go through my understanding to try to be clear, I don't pretend anything is ground truth here

So, my impression is that there should be 3 different types of audio buffers while there are only 2 for now:

The buffers used to compute blocks of signal (i.e. alloc::AudioBuffer). Nothing special to say about this one except that it could be renamed to AudioBlock, AudioBus or AudioRenderQuantum to avoid confusion with the AudioBuffer exposed by the API (which would also lead to renaming BUFFER_SIZE to BLOCK_SIZE or RENDER_QUANTUM_SIZE for consistency)
The AudioBuffer as defined by the WebAudio API, which is only consumed by the AudioBufferSourceNode and the ConvolverNode or returned through a Promise by OfflineContext.startRendering. This one should be completely loaded in memory for very fast and accurate access by the nodes that consume it, audioContext.decodeAudioData should therefore work from a asset (e.g. some audio file) completely loaded in memory (retrieved from network (XHR call in JS land) or file system) and perform the following steps:
- 1. decode to PCM data
- 1. resample to the actual audioContext.sampleRate
- 1. return an AudioBuffer that can be consumed without further processing by e.g. the AudioBufferSourceNode
Some other audio buffer which is more related to a streaming paradigm (data is received on the fly with questions of buffering, back pressure, etc.) and can be piped to some WebAudio nodes but is defined or related to other specs such as WebRTC (microphone, audio streams from network and MediaStreamAudioSourceNode) or HTML (<audio> tag and MediaElementAudioSourceNode). In that case decoding and resampling can only be done on the fly when the chunks are received, and the WebAudio nodes do not expose any start, stop or seek possibilities.

In the current implementation, my impression is that the AudioBuffer defined buffer.rs try to handle both 2. and 3. paradigms in the same way and, is finally more close to 3. and not really optimized for 2.

Do this makes sens or am I mistaking somewhere here? If you think it's a good idea, I would be happy to try to propose something really dedicated to 2.

Related issues

Hide unfinished functionality behind a `nightly` feature flag

Some functionality was added but is subject to change. Let's introduce an opt-in nightly feature flag and make unstable functionality only available there.

~~decodeAudioData~~ (finished now)
~~MediaElement~~ (dropped for now)
public methods in src/spatial.rs
Resampler

linear_ramp_to_value_at_time applies new value immediatly at endtime

Observed:

linear_ramp_to_value_at_time(v,end) sets the parameter value immediatly at time specified in end argument.

Expected:

from [web audio api spec](https://www.w3.org/TR/webaudio/#:~:text=linearRampToValueAtTime(value%2C%20endTime,the%20given%20value):

linearRampToValueAtTime(value, endTime)

Schedules a linear continuous change in parameter value from the previous scheduled parameter value to the given value

Example to reproduce

Run

use web_audio_api::context::{AsBaseAudioContext, AudioContext};
use web_audio_api::node::{AudioNode, AudioScheduledSourceNode};

fn main() {
    let context = AudioContext::new();

    let osc1 = context.create_oscillator();
    osc1.connect(&context.destination());
    osc1.frequency().set_value_at_time(440., 0.);
    osc1.start();

    osc1.frequency().linear_ramp_to_value_at_time(220., 0.4);
    std::thread::sleep(std::time::Duration::from_millis(1000));
}

And run

use web_audio_api::context::{AsBaseAudioContext, AudioContext};
use web_audio_api::node::{AudioNode, AudioScheduledSourceNode};

fn main() {
    let context = AudioContext::new();

    let osc1 = context.create_oscillator();
    osc1.connect(&context.destination());
    osc1.frequency().set_value_at_time(220., 0.4);
    osc1.start();

    osc1.frequency().set(220., 0.4);
    std::thread::sleep(std::time::Duration::from_millis(1000));
}

No difference in scheduling the paramter change is observed.

Rewrite `CpalBackend::build_input` to make it similar to `build_output`

Connection to the audio hardware is made through io::build_output and io::build_input functions.

build_output instantiates the output stream.
build_input instantiates the input stream.

A rewrite of io::build_output has been made by PR #36. This PR makes it possible for the end user to change output stream parameters like:

number of output channels
latency settings
sample rate
But this PR stopped there, and the work has not been done for build_input function

A rewrite of io::build_input should be made to make stream configuration coherent between build_output and build_input functions. It would also be helpful if both functions used similar type system to define them.

Possible memory leak?

Hi,

First, thanks for your work on this promising project.

I was playing with your library to create some Node.js bindings (using https://napi.rs/) and after some minimal work to wrap your code, I wrote this small javascript test. However the sound rapidly starts to be very noisy (a kind of nice noise but not the kind we would expect...), with the script rapidly growing to take almost 100% of a processor:

function triggerSine(audioContext) {
  const sine = audioContext.createOscillator();
  sine.connect(audioContext.destination);

  const now = audioContext.currentTime;
  sine.start(now);
  sine.stop(now + 0.03);
}

(async function() {
  const audioContext = new AudioContext();
  await audioContext.resume();
  setInterval(() => triggerSine(audioContext), 0.05 * 1000);
}());

While I agree this may appear as a kind of "agressive" test, this kind of pattern is quite common e.g. for granular synthesis or for generative music.

To make sure, the problem does not come from my own code, I re-wrote the test in pure Rust but ran into the exact same issue:

use web_audio_api::context::{AsBaseAudioContext, AudioContext};
use web_audio_api::node::{
  AudioNode, AudioScheduledSourceNode, OscillatorType, PeriodicWaveOptions,
};
use std::{thread, time};

fn trigger_sine(audio_context: &AudioContext) {
  let osc = audio_context.create_oscillator();
  osc.connect(&audio_context.destination());

  let now = audio_context.current_time();
  osc.start_at(now);
  osc.stop_at(now + 0.03)
}

fn main() {
  let audio_context = AudioContext::new();

  // mimic setInterval
  loop {
    trigger_sine(&audio_context);
    thread::sleep(time::Duration::from_millis(50));
  }
}

As I'm a newbie with Rust, I didn't try to dig more deeply into your code, but I would be happy to help the best I can.

Cheers,
Benjamin

Check current API surface

API surface: cargo doc --lib --open

Check for

Add channelCount Constraints to ChannelConfigOptions

Channel configuration is constrained for some node (more details in the specs). I run into this problem trying to implement ConvolverNode api.

Currently, theses constraints seems not implemented.

I think the most compliant implementation would be to panic when building the constrained node. Another solution would be to catch the constraints in the type system. What do you think ?

Change BUFFER_SIZE type to usize

I don't think u32 is a great choice, we're almost always casting it to usize anyway

See related discussion #47 (comment)_

Handle non-trivial tail time for some nodes

Nodes:

From the spec at https://webaudio.github.io/web-audio-api/#iirfilternode

tail-time 	Yes 	Continues to output non-silent audio with zero input. Since this is an IIR filter, the filter produces non-zero input forever, but in practice, this can be limited after some finite time where the output is sufficiently close to zero. The actual time depends on the filter coefficients.

Originally posted by @orottier in #27 (comment)

Sample rate high on ALSA

context.sample_rate() displays a sample rate of 384 000 Hz on my ALSA system.
This default sample rate is coming from the CPAL default output config.

Possible solution: try to build a stream with a lower sample rate before defaulting into the CPAL default sample rate.

Change sample rate type from SampleRate(u32) to SampleRate(f32)

Which conforms to the spec: https://www.w3.org/TR/webaudio/#dom-baseaudiocontext-samplerate

I'd like to keep the newtype wrapper in place so we don't accidentally mix up a f32 sample_rate parameter with a f32 timestamp one. See related discussion #33 (comment)

Finalize all node constructor structs (e.g. ConstantSourceOptions)

For the v1.0 release, structs with public fields must be finalized since altering them would mean a breaking API change.

decide on types (f64 for all float values?)
add all fields to all constructor structs, even if the implementation is not yet using them
add all enum variants (e.g. PanningModelType) even if the implementation is not yet using them
how to handle optional fields?
add a Default, Clone and Debug implementation on all structs

AudioBuffer "acquire the content" - should we implement this?

see full discussion at #67

Do we follow the spec literally? Or can we use the rust "ownership" semantics to achieve the goal of the spec?

Add ConvolverNode

https://webaudio.github.io/web-audio-api/#ConvolverNode

Decide on convention for optional fields and optional/default arguments

Optional fields from the spec

// This specifies options for constructing a DelayNode. All members are optional;
// if not given, the node is constructed using the normal defaults.

dictionary DelayOptions : AudioNodeOptions {
  double maxDelayTime = 1;
  double delayTime = 0;
};

Default and optional arguments from the spec

  undefined start (optional double when = 0,
                   optional double offset,
                   optional double duration);

Add WaveShaperNode

https://webaudio.github.io/web-audio-api/#WaveShaperNode

Support zero delay in DelayNode

The current implementation does not allow for zero delay. The minimum delay is one render quantum because we have split up the DelayNode up front in a DelayWriter and a DelayReader

As discussed in #70 :

The idea would be to create a dummy connection between DelayReader and DelayWriter in the DelayNode::new constructor, this connection would do almost nothing (as basically the writer do not use output and the reader do not use input) expect guarantee the order of the processing if the delay is not in a loop. In the graph process, if the node is found in a cycle, the graph could just delete this connection and somehow flag the Reader as "in_cycle" so that the later would know it must clamp its minimum delay to the quantum duration. (I guess that's not that far from what is described in the spec, and I don't even think this would need to be reversible, e.g. once in a cycle the node behaves like that forever even if the cycle is broken later, which is really a weird edge-case I can't even imagine where users should anyway know what they are doing)

"Eat your own dog food" for release candidate

Port a few examples/tutorials from MDN/others to a rust version and see what we run into. For example:

Enhancement: Rendering oscillator with wavetable synthesis

The current implementation of OscillatorRenderer is based on transcendental function sin().

An alternative implementation based on wave table may lead to:

Better performance
No aliasing behavior for non limited bandwidth signal type like square, triangle, and sawtooth

The current implementation is non compliant due to aliasing behavior (web audio spec)

Solution:

i got already a prototype working for sine type.

May i pursue work into this direction ?

Possible panic in race condition in graph cleanup

C = control thread, R = render thread

C: A GainNode goes out of scope. It's fields are dropped in order of their definition [1], so the registration field is dropped before the gain: AudioParam field
C: registration is dropped which sends a FreeWhenFinished event for the Gain Processor
R: a render quantum starts, the event queue so far is read
C: the gain field is dropped, so a FreeWhenFinished event is sent for the Audio Param processor, which will be picked up in the next quantum
R: render quantum ends. If the gain node has no inputs left, it is removed from the graph. From the logic you mentioned, the audio param is removed too
R: next render quantum: the FreeWhenFinished event for the gain param is handled, but it was already removed!

Cool stuff. The fix will be simple (just ignore the case where the node is missing)

[1] https://github.com/rust-lang/rfcs/blob/master/text/1857-stabilize-drop-order.md

Originally posted by @orottier in #90 (comment)

implement clone for AudioParamDescriptor

Showcase example: OutputBuilder::build panicks with ALSA device

Running showcase example on ALSA device, OutputBuilder::build panicks

thread 'main' panicked at 'called `Result::unwrap()` on an `Err` value: BackendSpecific { err: BackendSpecificError { description: "ALSA function 'snd_pcm_hw_params_set_buffer_size' failed with error 'EINVAL: Invalid argument'" } }', src/io.rs:81:10
stack backtrace:
   0: rust_begin_unwind
             at /rustc/a178d0322ce20e33eac124758e837cbd80a6f633/library/std/src/panicking.rs:515:5
   1: core::panicking::panic_fmt
             at /rustc/a178d0322ce20e33eac124758e837cbd80a6f633/library/core/src/panicking.rs:92:14
   2: core::result::unwrap_failed
             at /rustc/a178d0322ce20e33eac124758e837cbd80a6f633/library/core/src/result.rs:1355:5
   3: core::result::Result<T,E>::unwrap
             at /rustc/a178d0322ce20e33eac124758e837cbd80a6f633/library/core/src/result.rs:1037:23
   4: web_audio_api::io::OutputBuilder::build
             at ./src/io.rs:64:22
   5: web_audio_api::context::AudioContext::new
             at ./src/context.rs:293:22
   6: showcase::main
             at ./examples/showcase.rs:7:19
   7: core::ops::function::FnOnce::call_once
             at /rustc/a178d0322ce20e33eac124758e837cbd80a6f633/library/core/src/ops/function.rs:227:5
note: Some details are omitted, run with `RUST_BACKTRACE=full` for a verbose backtrace.

If i set BUFFER_SIZE to 256, the program doesn't panick anymore.

But another bug appears. An assertion fails in debug mode (underflow in release mode)

thread 'cpal_alsa_out' panicked at 'assertion failed: `(left == right)`
  left: `78`,
 right: `0`', src/graph.rs:117:9

I think due to #495 build_output_stream is not called at fixed rate on ALSA devices.

More PannerNode features

https://www.w3.org/TR/webaudio/#PannerNode
https://www.w3.org/TR/webaudio/#Spatialization

DistanceModelType option (good first issue)
options refDistance, maxDistance, rolloffFactor (good first issue)
handle stereo input (currently mono is assumed)
a-rate processing (take care for single_valued AudioRenderQuantum)
sound cones https://www.w3.org/TR/webaudio/#Spatialization-sound-cones
PanningModelType HRTF (lots of work)

Add `AudioContext::with_capacity` and strictly deny new allocations

The golden audio processing standard is that the render thread should be 100% allocation free.
This can be achieved by allocating up front and (optionally) deny any new allocations.

What I think we need is a

AudioContextCapacity {
    max_nodes: usize,
    max_audio_quanta_in_flight: usize,
    max_control_messages: usize
}

max_nodes should be used as capacity of the Graph fields

nodes (a HashMap currently)
ordered, marked, marked_temp, in_cycle (Vec)

max_audio_quanta_in_flight should be used as the capacity of the AllocInner.pool Vec.

~~The meaning of "audio quanta in flight" is a bit hard to describe, technically it is the sum of all output port counts * channel counts of the leaf nodes of the audio graph.~~
~~For example, if your graph is entirely linear (e.g. buffersource -> biquad -> gain -> speakers) and emits on 2 channels, a max of 2 AudioRenderQuantumChannel should suffice.~~
~~If you graph forms a perfectly balanced binary tree with 10 leaves having 2 outputs with 2 channels each, theoretically it could have 40 AudioRenderQuantumChannel concurrently in flight~~
I realize this is not true because output buffers are not freed after each individual processor runs, todo

max_control_messages is the Control-Render communication channel capacity.

We could add a cfg(debug) assertion that no new allocations are allowed to be made with the alloc_counter crate (or equivalent). This will probably directly show some allocator calls we need to address:

Graph.add_node allocates Vecs for the input and output containers (this can be fixed with allocation in the control thread and an unsafe transfer to the render thread
Node.outgoing_edges is a smallvec now with stack-capacity of 2 items. Todo figure out a plan here
When a node is decommissioned, some containers are deallocated. Idea - ship them to control thread?

[TAUDIO]Audio Streams under Flutter and React Native

Hi everybody,

This is not an issue. Just an information.
We are now working on the port of the Web Audio API under flutter, react native and Microsoft MAUI.
We need for that a native implementation of the Web Audio API on several platforms :

iOS
Android
Linux (optional)
MacOS (optional)
Windows (optional)

We found two existing libraries which can be used :

Your library seems very interesting. We appreciate that it is dependant on CPAL, and CPAL is dependant of OBOE.
This seems a good design.

Unfortunately we are not expert with Rust.
We will probably need a little help from you, to start.

Good luck for your library.

Increase margins in tests when checking signal values

On some hardware, our test suite fails when checking the values of the output signal. We're using float_eq with ulps margins, but are now considering testing with absolute value margins.

Differences of up to 1E-7 will absolutely not be audible so this a margin we can stick to in tests.

See discussion by @Jerboas86 in #52 (comment)_

Suggestions:

Maybe we should switch to absolute tolerance comparison. ULP are difficult to reason about, and since we are using values from a small interval, absolute error should be fine.

Implement `BaseAudioContext.state`

https://www.w3.org/TR/webaudio/#dom-baseaudiocontext-control-thread-state-slot

Before we think about also adding onStateChange handlers ( https://www.w3.org/TR/webaudio/#dom-baseaudiocontext-onstatechange ) we need to think about the architecture for event handling (probably in a separate issue)

How to deal with related web concepts (<audio> element, WebRTC, MediaCapture)

As mentioned in #82:

To be fair I'm struggling a bit with the 'boundaries' of the Web Audio API also in other cases. For #77 I was playing around with microphone input and playback and realized I needed to have something like https://developer.mozilla.org/en-US/docs/Web/API/MediaRecorder

There are a few conflicting goals:

stay as close as possible to the spec (as that is the unique selling point of this project, provide the comfort of not learning a new API when coming from a web background)
leverage all the power of rust (no nullables, no exceptions, use ownership etc)
create a small library (it's a lot of work already in its current state), versus:
include related APIs (https://w3c.github.io/mediacapture-main/ https://w3c.github.io/webrtc-pc/ https://html.spec.whatwg.org/multipage/media.html#the-audio-element )
or include include a few intefaces acting as building blocks (the MediaStream trait)

I will create a separate issue for this. It's probably best to steer away from names such as MediaStream and MediaElement because as you pointed out these are different from the w3c specs. I do think we should include them in the library because you would need them in many applications anyway

Originally posted by @orottier in #82 (comment)

Support sub-quantum scheduling

The following processors only support scheduling per BUFFER_SIZEd quantum, this should be improved:

ConstantSourceRenderer
AudioBufferSourceRenderer
OscillatorRenderer
DelayRenderer
~~MediaStreamRenderer~~ (this does not make much sense)

e.g. in the current implementation the timestamp (start of the quantum) is evaluated

// todo, sub-quantum start/stop
match self.scheduler.state(timestamp) {
    ScheduledState::Active => (),
    ScheduledState::NotStarted => {
        output.make_silent();
        return true; // will output in the future
    }
    ScheduledState::Ended => {
        output.make_silent();
        return false; // can clean up
    }
}