Windows audio drivers development

jotalanusse · March 18, 2024, 7:29pm

Hello everyone! I’m working on a project where I need to capture input from a computer’s microphone. I want to process this input to merge it with another audio stream. This way, I can play the combined audio as if it’s coming directly from the user’s microphone.

So far, I’ve attempted to capture microphone input, merge it with the other audio stream, and then output it using virtual cables. However, we’re encountering significant latency issues. Our product is built on C#, and currently, we’re utilizing NAudio for our audio processing needs. Despite trying every available API, we’re struggling to reduce the input delay.

After some research, I discovered that certain software solutions tackle this problem differently. They install a specific driver on the user’s microphone, which seemingly enables them to modify the microphone’s input by adding extra data or sounds. This approach eliminates the necessity of performing all the processing within the application itself. Instead, the driver handles receiving the additional stream and outputs it through the microphone.

It seems like that might be the case. I’ve been exploring various resources to understand how these software solutions accomplish this, and it appears that Windows Audio Processing Objects (APOs) might be what I’m searching for, although I’m not entirely certain yet.

I came here to ask the following questions:

Are Windows APOs the solution for this? Am I on the right track?

While I consider myself to be a fairly seasoned developer, I’m completely new to the world of Windows ecosystem development, especially regarding drivers. Any help or guidance would be greatly appreciated.

jotalanusse · March 20, 2024, 2:26pm

After further research, I found that another software indeed uses an APO to accomplish what I’m trying to do. It appears they register an Endpoint Effect (EFX) APO on one or a number of microphones, which then enables them to modify the device’s input.

If anyone is familiar with audio driver development, I would greatly appreciate some guidance on how to start implementing something like this. Thanks!

Tim_Roberts · March 20, 2024, 7:21pm

There are sample APOs in the enormous SYSVAD sample driver. APOs are user-mode DLLs that run in the special Audio Engine process, and run in real-time. Usually, the trickiest part is getting them installed. Microsoft doesn’t really want independent third-party APOs at all; they believe an APO should be part of the hardware/driver pair, and should be installed with the driver.

The cool audio driver kids all hang out in the [wdmaudiodev] list server at https://www.freelists.org/list/wdmaudiodev . Several very helpful members of the Microsoft Audio team monitor that list and respond to the trickier questions.

jotalanusse · March 21, 2024, 1:18am

Hi Tim, thanks for your response. I’ll definitely check out wdmaudiodev to see to see if anyone on there can help me out.

system · April 20, 2024, 1:19am

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.