AudioEncoderConfig.latencyMode (or similar) #371

chcunningham · 2021-09-29T21:02:43Z

VideoEncoderConfig has latencyMode (#269).

AudioEncoder has similar concerns around latency (realtime) vs quality. For example, Chrome's opus encoder currently uses 60ms buffers (framesize) to maximize quality. This is in keeping with our default philosophy of "don't constrain features/quality unless the user asks for it". But we should offer realtime users a way to choose a lower-latency value.

An alternative would be to specify a more granular knob for just framesize. Or do both.

stefanholmer · 2022-02-08T08:13:35Z

60 ms is too high as a default for RTC. There's a quite noticeable difference to use 20 ms frames.

I think it could be useful with a frame size knob instead of a latencyMode that implicitly controls the frame size. It can be useful to increase the frame size to 60-120 ms when the bitrate is low (10-16 kbps) in order to reduce packet overhead. It's possible to also do that in the transport layer though, by aggregating frames into a single packet, so the question is if there are compression gains of using larger frame sizes.

tguilbert-google · 2022-09-01T16:54:45Z

Here are my proposals with some context.

Do not add AudioEncoderConfig.frameSize

frameSize is a well defined Opus concept, but it isn't generic enough to fit all codec use cases. For example, FLAC uses block size instead of frame size, and we would need to convert from milliseconds to a number of samples. AAC has a fixed frame size, but has some configurable amount of silent priming frames that affect encoder delay.

Add OpusEncoderConfig.frameSize

From @stefanholmer's comment, having a fine grained tuning knob to experiment with is desirable.
OpusEncoderConfig is already defined, and will likely be expanded when Fixed audio chunk size support #405 is resolved (EDIT: opus configuration (e.g. in-band fec) #244 was the issue I had in mind). Adding an extra field now shouldn't be an issue.
Developers that are already interested in advanced use cases will be able to handle the extra complexity.
Limited risk of breaking changes once this knob is introduced; we wouldn't remove it, and it directly exposes an parameter with no nuance.
The range of valid values will be defined in the codec registry; suggesting frame size for various scenarios might not be as immediately discoverable to developers as a latencyMode flag.
Adding this knob does not prevent us from adding an AudioEncoderConfig.latencyMode in the future.

Do not add AudioEncoderConfig.latencyMode (yet)

A latencyMode flag that mirrors the one we have for video decoders would offer some nice symmetry with VideoEncoderConfig.
latencyMode would choose between two sets of sensible defaults parameters for each codec (frameSize for OPUS, blockSize for FLAC, priming frames for AAC, etc).
The current language behind latencyMode for video is flexible, but we might need to explicitly choose default per-codec parameters to ensure cross-platform and cross-browser consistency. Choosing those defaults could be problematic.
This simple flag might abstract away a lot of details and nuance, which could ultimately surprise developers.
We can always add this flag later, if any usability complaints are surfaced.

I'd love to hear if there are objections to adding OpusEncoderConfig.frameSize right now, and whether anyone thinks we should also (or instead) prioritize adding AudioEncoderConfig.latencyMode.

bdrtc · 2022-09-02T02:42:05Z

Here are my proposals with some context.

Do not add AudioEncoderConfig.frameSize

frameSize is a well defined Opus concept, but it isn't generic enough to fit all codec use cases. For example, FLAC uses block size instead of frame size, and we would need to convert from milliseconds to a number of samples. AAC has a fixed frame size, but has some configurable amount of silent priming frames that affect encoder delay.

Add OpusEncoderConfig.frameSize

From @stefanholmer's comment, having a fine grained tuning knob to experiment with is desirable.

OpusEncoderConfig is already defined, and will likely be expanded when Fixed audio chunk size support #405 is resolved. Adding an extra field now shouldn't be an issue.

Developers that are already interested in advanced use cases will be able to handle the extra complexity.

Limited risk of breaking changes once this knob is introduced; we wouldn't remove it, and it directly exposes an parameter with no nuance.

The range of valid values will be defined in the codec registry; suggesting frame size for various scenarios might not be as immediately discoverable to developers as a latencyMode flag.

Adding this knob does not prevent us from adding an AudioEncoderConfig.latencyMode in the future.

Do not add AudioEncoderConfig.latencyMode (yet)

A latencyMode flag that mirrors the one we have for video decoders would offer some nice symmetry with VideoEncoderConfig.

latencyMode would choose between two sets of sensible defaults parameters for each codec (frameSize for OPUS, blockSize for FLAC, priming frames for AAC, etc).

The current language behind latencyMode for video is flexible, but we might need to explicitly choose default per-codec parameters to ensure cross-platform and cross-browser consistency. Choosing those defaults could be problematic.

This simple flag might abstract away a lot of details and nuance, which could ultimately surprise developers.

We can always add this flag later, if any usability complaints are surfaced.

I'd love to hear if there are objections to adding OpusEncoderConfig.frameSize right now, and whether anyone thinks we should also (or instead) prioritize adding AudioEncoderConfig.latencyMode.

we recommend use frameDuration/ptime in miiliseconds is more suitable, The parameter we need actually is duration per frame, this called ptime（packet time）in sdp rfc8866 6.4 ,
ptime is widely used in sip base voip system include webrtc(opus use ptime=10 as sdp attribute by default), and frameDuration is correspond to duration attribute in audioData struct，we think either one is ok and more understandable.

tguilbert-google · 2022-09-02T16:12:09Z

Thanks! I did intend frameSize to be in milliseconds, but never mentioned it. The Opus spec does mention frame duration.

Naming the parameter OpusEncoderConfig.frameDuration also seems good to me. It also clearly conveys a unit of time.

I am not an Opus expert though. If others have a strong preference for one or the other, both seem fine to me.

bdrtc · 2022-09-05T02:28:29Z

Thanks! I did intend frameSize to be in milliseconds, but never mentioned it. The Opus spec does mention frame duration.

Naming the parameter OpusEncoderConfig.frameDuration also seems good to me. It also clearly conveys a unit of time.

I am not an Opus expert though. If others have a strong preference for one or the other, both seem fine to me.

for opus encoder, the frameSize actually is encoder buffer size , it's determined by sampleRate channel numbers and frameDuration.
frameSize = numberOfChannels*sampleRate * frameDuration / 1000
here sampleRate for opus maybe: 8000, 12000, 16000, 24000, 48000, and this parameter exist in spec already.
and numberOfChannels is also exist in spec audioEncoderConfig.
frameDuation is in milliseconds here , maybe 2.5, 5, 10, 20, 40, 60 or 120 ms etc.
we can make a PR for this :>

tguilbert-google · 2022-09-17T00:13:51Z

From TPAC: No pushback on this issue. Adding frameDuration seems good. I would let the spec editors adopt this PR.

chcunningham mentioned this issue Sep 29, 2021

AudioEncoder, AudioDecoder, Serialize to JSON, Deserialize JSON to EncodedAudioChunk, Play Music chcunningham/wc-talk#2

Closed

chcunningham added the extension Interface changes that extend without breaking. label Oct 11, 2021

onthegit mentioned this issue Jun 29, 2022

How to properly copy and restore AudioData? #502

Closed

chcunningham added the TPAC 2022 Issues to discuss in upcoming TPAC meeting label Jul 27, 2022

bdrtc mentioned this issue Sep 5, 2022

Add frameDuration attribute to OpusEncoderConfig #551

Merged

tguilbert-google mentioned this issue Sep 14, 2022

opus configuration (e.g. in-band fec) #244

Closed

tguilbert-google closed this as completed in #551 Oct 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AudioEncoderConfig.latencyMode (or similar) #371

AudioEncoderConfig.latencyMode (or similar) #371

chcunningham commented Sep 29, 2021

stefanholmer commented Feb 8, 2022

tguilbert-google commented Sep 1, 2022 •

edited

bdrtc commented Sep 2, 2022

Do not add AudioEncoderConfig.frameSize

Add OpusEncoderConfig.frameSize

Do not add AudioEncoderConfig.latencyMode (yet)

tguilbert-google commented Sep 2, 2022 •

edited

bdrtc commented Sep 5, 2022 •

edited

tguilbert-google commented Sep 17, 2022

AudioEncoderConfig.latencyMode (or similar) #371

AudioEncoderConfig.latencyMode (or similar) #371

Comments

chcunningham commented Sep 29, 2021

stefanholmer commented Feb 8, 2022

tguilbert-google commented Sep 1, 2022 • edited

Do not add AudioEncoderConfig.frameSize

Add OpusEncoderConfig.frameSize

Do not add AudioEncoderConfig.latencyMode (yet)

bdrtc commented Sep 2, 2022

Do not add AudioEncoderConfig.frameSize

Add OpusEncoderConfig.frameSize

Do not add AudioEncoderConfig.latencyMode (yet)

tguilbert-google commented Sep 2, 2022 • edited

bdrtc commented Sep 5, 2022 • edited

tguilbert-google commented Sep 17, 2022

tguilbert-google commented Sep 1, 2022 •

edited

tguilbert-google commented Sep 2, 2022 •

edited

bdrtc commented Sep 5, 2022 •

edited