-
Notifications
You must be signed in to change notification settings - Fork 313
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
100% cpu usage with I2C audio hats #491
Comments
Are you using 32bit or 64bit Raspberry Pi OS? Can you please post your Direwolf configuration? Can you also please try the newest version of Direwolf from the Dev branch as Dev-A is rather old now |
Hey David, 32bit Bullseye thanks, |
Using "-n 2" on the direwolf command line with no other options and no config file fixes the problem. cpu usage is normal (17%) related unresolved thread, which also uses an i2c sound device , |
Generally speaking, Direwolf really wants a configuration file to read so I recommend you enable that. Without specifying the "-n 2" item, can you copy/paste in what you're seeing when you start Direwolf from the command line and we can see what it's assuming? |
replicated on Buster, direwolf 1.6 |
MYCALL KM6LYW-2 export ALSA_CARD=0 Dire Wolf version 1.6 Reading config file /run/direwolf.tnc.conf Now connected to IGate server noam.aprs2.net (44.25.16.4) [ig] # aprsc 2.1.14-g5e22b37 |
Using "-r 22000" also fixes the 100% cpu thread.
Channel 0: 1200 baud, AFSK 1200 & 2200 Hz, E+, 22000 sample rate. edit: anything but 44100 and 48000 seems to keep direwolf from consuming an entire cpu core |
Interesting.. btw, the Fe-Pi now the wb7fhc Nexus DR-X sound hat offers more sampling rates than most sound devices. You can get a full list of it's sampling rates by running: sudo lsusb -vv | grep -e Audio -e tSamFreq | grep -v -e Descriptor -e bInterfaceClass Anyway.. it's strange that the common sampling rates create high load for you on this truly stereo input / output sound device. We will have to wait to hear back from WB2OSZ on this one. |
Thanks David, this isn't a USB device (I2C device), and yah, it has something to do with the supported hardware rates for sure. 32K seems to be a hardware rate as well Running stack trace on the audio thread is even more revealing, During normal cpu usage (44001 for example) we see device polling During high cpu usage (44100, 48000, 32000) we don't see the poll call at all, and we see a nanosleep instead |
One thing I noticed here is that the direwolf startup output is using the "default" audio device which is a PulseAudio device which I don't recommend. PulseAudio can do all kinds of unwanted audio processing things to the audio as well as doing silent resampling for non-native sound device sampling rates which will increase CPU load. In your direwolf.conf file, instead try specifying a native ALSA device name such as "ADEVICE plughw:1,0". Btw, also in your startup, I see you're using the shell environment variable "export ALSA_CARD=0" but I don't believe Direwolf uses that at all. If you're doing other ALSA tricks using DMIX, etc. please disable all of this and retest to see if that might be impacting Direwolf. Ultimately, Direwolf is a ALSA-native application and while it works, it's not a PulseAudio or PIpewire native application. |
Understood. No Pulse Audio -- ever. libasound2 honors "ALSA_CARD", so direwolf does by extension. I get the same result when specifying an audio device in direwolf.conf (ADEVICE plughw:0,0) no dmix afaik |
Increasing frames per period in audio.c resolves the issue.
Unintended consequences, besides latency, unknown. ./src/direwolf -t 0 -r 44100 Reading config file direwolf.conf
|
I don't have one of these devices so I can't do any experiments. All I know is what is found here. https://github.com/NexusDR-X/nexus-audio/blob/main/README.md Your test results are very perplexing. 44000 works (14% cpu) direwolf treats the sound device very simplistically. I picked the default of 44100 because that is the CD rate and seems to be supported by all hardware. The USB audio devices also do 48000, which I believe is the DVD rate. I don't know the internals of the audio device driver, but one might reasonably expect that it would instruct the hardware to operate at the requested sample rate if available. That would be most efficient with no rate conversion necessary. If the desired rate is not available in hardware, it would pick some higher rate, possibly an integer multiple or at least a rational number, if possible, to make rate conversion easier/more efficient. That's not what we see at all in the test results. It could be something weird going on in the device driver. I added a hack, oops, I mean heuristic, for ARM processors on the assumption they would be much much slower than x86 machines. Downsampling/decimation by a factor of 3 is applied if the sample rate is much higher than needed for good AFSK decoding. Nothing fancy; it just averages 'n' incoming samples together. An old single core RPi model 1 is more than adequate with this little trick and the user not having to do anything special. You will see this change to -D2 or disapear as the incoming rate decreases. Trying to use ALSA to map two independent virtual mono audio channels adds another layer of complexity and strange possible cases. What happens if the two derived virtual channels want different rates. Would they fight over the hardware rates or it just use 96000 and burn up CPU resources downsampling it. The buffer size is a tradeoff. The specific direwolf version shouldn't matter. I don't think the audio input processing has been touched for a long time. You mentioned bullseye. Have you tried bookworm. direwolf compiles OK (newer compile usually means new warnings) on the 64 bit version on an RPi 3 and passes the self tests. That's as far as I got. Maybe it has a better I2C audio driver. Steve Magnuson, AG7GN should be brought into this conversation. 73, |
Thanks for the info. If we're dividing sample rate by 3 on ARM, is there a downside to simply specifying a rate of "14700"? That yields 10% cpu on a Pi3 and would not introduce latency. New observation, 90 percent of CPU time is spent in a system call (not user time) when using 44100/48000. I'm still curious about nanosleep vs polling in the audio thread. Nanosleep is only seen when using 44100/4800 rates. We see poll() otherwise. It's like nanosleep doesn't sleep. |
Bookworm observations. 32bit/lite, Direwolf 1.6, Linux 6.1.0 CPU usage is much better than bullseye, but significantly higher when using 44100 or 48000 44101 12% (10.0user, 2.3sys) time direwolf -r44100 time direwolf -r 44101 Like with bullseye we still see nanosleeps when using 44100/48000, and we see polling with any other rate strace -Ttf direwolf -r 44101 Bullseye invokes clock_nanosleep_time64() |
dbgsym version of alsa-libs compiled, this is where the direwolf process is spending time, still need to see which driver the "sys" time is being spent... 22.82% direwolf direwolf [.] demod_afsk_process_sample |
time being spent in snd_pcm driver, function snd_pcm_stream_unlock_irq()
|
patch for audio.c, testing okay at this point. Bumping up the frames per period seems alleviate the snd_pcm driver problem with a minimal increase in latency. In fact, the buffer/period is similar to what it comes up with for a USB/ic705 device. CPU usage goes from 100% to 16% on Raspberry Pi Zero 2W with this patch. pi@digipi4:~/git/direwolf/clean/direwolf/build $ ./src/direwolf -t 0 -r 48000 Reading config file direwolf.conf pi@digipi4:~/git/direwolf/direwolf/build $ sudo cat /proc/asound/card0/pcm0p/sub0/hw_params audio.c
|
I'm observing direwolf with 100% cpu usage when using I2C audio boards, specifically
observed on Pi 3A+, Pi Zero2W, Raspberry Pi OS bullseye
not observed on the original Pi Zero.
not observed with USB audio dongle
arecord and mpg123 do not generate significant cpu usage with the above i2c boards
observed on direwolf 1.6 and 1.7-dev-A
-craig
KM6LYW
The text was updated successfully, but these errors were encountered: