Speech gate #48

pragmatrix · 2025-08-01T14:48:06Z

Azure speech detection seems to be very sensitive. It detects very faint speech which makes it to also detect echos from speakers, for example when speech detection is active and someone else speaks on the other line.

This PR adds a speech gate that is loosely configured to dampen low volume audio, noise and also whispering sounds. The algorithm was mostly done by Claude Sonnet 3.7 and the parameterization was derived from iterating on two samples (normal speech and echo speech).

Copilot

Pull Request Overview

This PR implements a "speech gate" to address Azure speech detection sensitivity issues by filtering out low-volume audio, noise, and echo detection. The gate uses configurable parameters to dampen unwanted audio while preserving normal speech.

Key changes:

Adds a new speech gate module with RMS-based filtering algorithm
Integrates the speech gate into Azure transcription service
Includes a standalone test utility for audio processing validation

Reviewed Changes

Copilot reviewed 8 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
core/src/speech_gate.rs	Implements the main speech gate algorithm with multiple variants (hard/soft/RMS-based)
core/src/lib.rs	Exports the speech gate module
src/lib.rs	Exports the speech gate processor function
services/azure/src/transcribe.rs	Integrates speech gate into Azure transcription pipeline
filter-test/	Adds a standalone CLI tool for testing speech gate on audio files
Cargo.toml	Adds fundsp dependency and filter-test workspace member

Comments suppressed due to low confidence (1)

filter-test/Cargo.toml:4

The Rust edition "2024" does not exist. The latest stable edition is "2021". Change this to "2021".

edition = "2024"

core/src/speech_gate.rs

filter-test/src/main.rs

Co-authored-by: Copilot <[email protected]>

pragmatrix added 9 commits July 31, 2025 16:51

Intermediate commit

f3d54d9

Add a speech gate processor

f664e8c

Speech gate hard and soft

518a64a

filter-test: Support processing multiple files

654cd51

speech gate: Use energy instead of absolute values

07d5a29

Collect reasonable values for the speech gate processor (soft variant)

4e4097e

wrap

132c21a

Add soft rms speech gate

4bd34a1

Integrate speech gate processor into azure speech detection

8452a09

pragmatrix force-pushed the speech-gate branch from 3ceafc3 to 8452a09 Compare August 1, 2025 14:50

pragmatrix marked this pull request as ready for review August 1, 2025 14:50

pragmatrix requested a review from Copilot August 1, 2025 14:50

Copilot AI reviewed Aug 1, 2025

View reviewed changes

core/src/speech_gate.rs Show resolved Hide resolved

filter-test/src/main.rs Show resolved Hide resolved

filter-test/src/main.rs Outdated Show resolved Hide resolved

pragmatrix and others added 3 commits August 1, 2025 16:55

Update filter-test/src/main.rs

a02cff9

Co-authored-by: Copilot <[email protected]>

Remove fundsp dependency

4267d6d

Bump versions

d499752

pragmatrix merged commit 93998af into master Aug 1, 2025
6 checks passed

pragmatrix deleted the speech-gate branch August 1, 2025 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speech gate #48

Speech gate #48

Uh oh!

pragmatrix commented Aug 1, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Speech gate #48

Speech gate #48

Uh oh!

Conversation

pragmatrix commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pragmatrix commented Aug 1, 2025 •

edited

Loading