generate downmixed stereo buffer for Windows #668

theeternalsw0rd · 2025-07-27T20:07:49Z

Always feed a buffer of at most 2 channels to cava regardless of audio output to device. There may still be edge cases that need covered where it has the comment // Unsupported format, handle error (line 405). This handles 16-bit pcm multichannel, 32-bit pcm multichannel, and 32-bit floating point multichannel.

… than 2 channels

theeternalsw0rd · 2025-07-28T02:01:37Z

After checking on my second computer, I saw that 24-bit seems to be common as well, so I added downmixing for that format.

In further testing there was inconsistency if cava was not fed 16-bit, so I added conversion for mono and stereo 24 and 32-bit as well.

Another thing I did learn is that Dolby Atmos, DTS:X, basically anything with Spatial processing enabled is not available via loopback. So getting that to work would either require developing an audio driver to ship with cava or utilizing third-party software to handle audio mixing and virtual cables.

As it stands right now, cava freezes if fed Dolby Atmos. It seems to handle a failure properly and print that the device is not supported but then locks up. I'll see if I can determine more at a later time.

karlstav · 2025-07-30T07:29:05Z

hi @theeternalsw0rd.

A lot of stuff going on here I can see! I'm afraid I don't have time to go through it in detail, but browsing through it looks like you have added a lot of converting that I don't understand. cava is supposed to handle s8, s16, s24 and float both mono and stereo. But you have added stuff like "convert_mono_f32_to_s16". Maybe the mono processing in cava is not working?

If cava is not working when not fed 16 bit stereo, than that needs to be fixed in cava.

down/up mixing:

https://github.com/karlstav/cava/blob/master/input/common.c#L21

stereo/mono handling is happening around here:

https://github.com/karlstav/cava/blob/master/cavacore.c#L362

theeternalsw0rd · 2025-07-30T18:10:29Z

I've pulled the conversion of stereo and mono out. Turns out the issue my previous refactor resolved that had to do with the graph freezing before reaching zero level when going from audible audio to silence, also resolved the issue with non-16-bit audio. Quite bizarre.

karlstav · 2025-07-31T18:49:36Z

input/winscap.c

+
+	write_to_cava_input_buffers(silent_channels, (unsigned char *)silence, audio);
+	pCapture->lpVtbl->ReleaseBuffer(pCapture, numFramesAvailable);
+	pCapture->lpVtbl->GetNextPacketSize(pCapture, &packetLength);


is it maybe just packetLength here, without the &?

or maybe better remove the * from the write_silent_frame packetLength function argument and remove the & when calling the function.

theeternalsw0rd · 2025-07-31T19:21:56Z

That’s probably it. I think I already corrected that in another method using the former solution. I’ll make sure it’s consistent using the latter. Get Outlook for iOS<https://aka.ms/o0ukef>

________________________________ From: karl ***@***.***> Sent: Thursday, July 31, 2025 2:53:59 PM To: karlstav/cava ***@***.***> Cc: Micah Bucy ***@***.***>; Mention ***@***.***> Subject: Re: [karlstav/cava] generate downmixed stereo buffer for Windows (PR #668) @karlstav commented on this pull request.

________________________________ In input/winscap.c<#668 (comment)>:

@@ -132,6 +304,44 @@ struct {

{AUDCLNT_E_UNSUPPORTED_FORMAT, L"Requested sound format unsupported"}, }; +void write_silent_frame(struct audio_data *audio, IAudioCaptureClient *pCapture, + UINT32 numFramesAvailable, UINT32 *packetLength) { + // Send one silent frame to the spectrometer + int silent_channels = audio->channels; + int silent_bytes = silent_channels * sizeof(int16_t); // 16-bit PCM + int16_t silence[2] = {0}; + + write_to_cava_input_buffers(silent_channels, (unsigned char *)silence, audio); + pCapture->lpVtbl->ReleaseBuffer(pCapture, numFramesAvailable); + pCapture->lpVtbl->GetNextPacketSize(pCapture, &packetLength); or maybe better remove the * from the write_silent_frame packetLength function argument and remove the & when calling the function. — Reply to this email directly, view it on GitHub<#668 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAGAZMKO73APANKBIK6CWYT3LJQ4PAVCNFSM6AAAAACCPHNR5KVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZTANZWGU3DMMBZGA>. You are receiving this because you were mentioned.Message ID: ***@***.***>

theeternalsw0rd · 2025-07-31T22:18:03Z

20-bit audio is also used it seems. I've set up a separate branch for that since I cannot test it currently. I'll try to locate some hardware to test on. I think the virtual cable I have at work supports it, so I can check that tomorrow. The changes for the use of packetLength have been pushed.

…udio

theeternalsw0rd · 2025-08-01T16:25:32Z

I did find a device that supports 20-bit audio, but that format is not selectable in Windows 11 sound settings. Windows itself may have removed support completely or 20-bit is only available via direct hardware or the audio driver isn't properly sending supported formats to Windows. I will not worry about it at this point. It's available on my repo under the 20-bit branch if anyone should have need. I am assuming anything with 20-bit support also supports other formats too.

karlstav · 2025-08-01T19:56:31Z

just use the .clang-format file to make the linter happy. don't worry about the commits, I will squash them when i merge

theeternalsw0rd · 2025-08-01T23:00:27Z

Hopefully the linter is happy now. I made the handling of overflowing lines as consistent as possible across the board too.

theeternalsw0rd · 2025-08-02T16:25:01Z

Sorry, didn't realize the linter failure actually generates a patch that it would be happy with. I'll get that applied. I guess the reason for the inconsistent long line formatting is the linter itself being at fault.

theeternalsw0rd · 2025-08-02T16:41:05Z

The linter supplied patch has been applied. Hopefully now it should have no reason to complain.

add stereo downmixing for Windows to apply if audio device is greater…

798b4f6

… than 2 channels

theeternalsw0rd mentioned this pull request Jul 27, 2025

Logitech Pro X (and 2) Wireless output exceed channel limits #664

Closed

add 24-bit downmix and conversion for stereo and mono

8f63267

theeternalsw0rd added 2 commits July 29, 2025 14:55

mono should only ever use mono_buffer

bdef09f

refactored to debug an issue and the problem went away, go figure

09c57ac

refactoring also resolved issues with non-16-bit audio. weird

f31ca5b

karlstav reviewed Jul 31, 2025

View reviewed changes

theeternalsw0rd added 2 commits July 31, 2025 17:41

Merge remote-tracking branch 'upstream/master'

2675d8f

correct usage of packetLength in method calls and definitions

f29f65e

add some more error handling to eliminate warnings shown by Visual St…

08d984e

…udio

make linter happy and handling of long lines consistent

a141430

apply linter supplied patch

5d6b788

karlstav merged commit 1f8c348 into karlstav:master Aug 3, 2025
6 checks passed

generate downmixed stereo buffer for Windows #668

generate downmixed stereo buffer for Windows #668

Uh oh!

Conversation

theeternalsw0rd commented Jul 27, 2025

Uh oh!

theeternalsw0rd commented Jul 28, 2025

Uh oh!

karlstav commented Jul 30, 2025

Uh oh!

theeternalsw0rd commented Jul 30, 2025

Uh oh!

karlstav Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

karlstav Jul 31, 2025

Choose a reason for hiding this comment

Uh oh!

theeternalsw0rd commented Jul 31, 2025 via email

Uh oh!

theeternalsw0rd commented Jul 31, 2025

Uh oh!

theeternalsw0rd commented Aug 1, 2025

Uh oh!

karlstav commented Aug 1, 2025

Uh oh!

theeternalsw0rd commented Aug 1, 2025

Uh oh!

theeternalsw0rd commented Aug 2, 2025

Uh oh!

theeternalsw0rd commented Aug 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants