Rawaudio dev#3653
Conversation
|
I'd prefer not to check for the Jamulus version number but rather based on capabilities - we don't have 4.0.0 out yet and it might break during the dev process. |
I wanted to reuse information already available as much as possible so I just added the code where there were version checks already implemented. (For sequence number and pan feature) |
|
Tested it and yes, the noise would be unacceptable. What is our fallback if max is selected but the server doesn't support it? |
I just noticed that if you connect to a server with Max selected you get the noise unless you switch audio quality again while connected. The server code is fine and doesn't need changes, I misplaced the check for my introduced bRawAudioSupported in the client code. I'll have a closer look |
|
Adding a slot to the client to reinit when it receives the server version seems to have fixed the noise issue. |
|
We still get crashes on windows, especially when using more coplex setups including audio routing software. Linux, Mac and android builds work fine so far. Sounds great but still needs more testing and fixes |
|
The last commits fixed the crash on windows and make the client fall back to opus reliably. This is now ready to be tested thoroughly. |
|
A buffersize of 256 on Max quality setting gives garbled audio and the packet sizes seem wrong and contain blocks of zeroes. Only that particular setting is affected. Opus still works |
I plan to try out this enhancement over the next few days. I've had a look through the diffs so far. Could you specify exactly the steps to produce this error? |
This is reproducible with a buffersize of 256 samples only. Edit: The packets become bigger than the MTU allows for on 256 samples buffersize and get fragmented once I corrected the calculation of the packet sizes. Does this mean we need to disable raw audio for buffersizes of 256 or is there some mechanism to receive fragmented packets? |
|
I've just tried a build of Using a buffer size of 10.67ms (256) results in each packet containing two frames of audio, each with its own sequence number. In that setting, I was seeing one packet every 10.67ms coming from the Windows client, but still one packet every 5.33ms coming back from the server. They alternated between having zeros in the first frame and zeros in the second frame. So it could possibly be some issue in Note that the client will encode according to the settings in the Client Settings dialog, but the server will encode according to the information in received in the Talking of which, the codec field in the Lines 484 to 492 in 849e823 So when sending props for raw encoding, it should either use |
This build is not taking into account |
|
Ah, so the issue is that the client is not sending enough data to satisfy the server, and the server is therefore adding in packets of zeros to maintain the data rate. Fragmentation should not be an issue, at least with IPv4, as fragmentation and re-assembly happens transparently at the IP layer. In fact, I don't think it will occur anyway, as the traffic from the server is not fragmented. We should just get packets from the client at 5.33ms instead of 10.67ms. In fact, I've been doing some tests with Wireshark of all the various data rates, qualities and mono/stereo, and it seems that the packet interval is normally half the buffer time specified in the Client Settings. Except when "Small buffers" is not checked, and then 2.67 (64) is exactly the same as 5.33 (128). |
Yes please - I'm building directly from your |
|
I think in I don't have any more time today to try it... |
| } | ||
|
|
||
| const int iOffset = iB * SYSTEM_FRAME_SIZE_SAMPLES * vecNumAudioChannels[iChanCnt]; | ||
| // Recognise a raw audio packet by its size |
There was a problem hiding this comment.
I think it would be better to recognise the audio frame by a sentinel byte. Protocol frames begin with 00 00 and must have a good checksum. Otherwise they are considered to be audio. Opus frames always begin with 00 for mono and 04 for stereo. So maybe for raw audio, the audio data could be prepended with a byte of f0 for mono and f4 for stereo? Then it could be recognised unambiguously. Both client and server need to recognise the format of a received frame correctly without relying on an out-of-band context.
There was a problem hiding this comment.
As far as I understood these sentinel bytes come from the opus codec itself and are not deliberately set by Jamulus as a message ID of sorts. I'd have to overwrite actual audio bytes for that to work with my code. Or am I wrong here?
|
I had misunderstood the packet size calculation and it seems fixed with the last commit. |
I think OPUS and OPUS64 only refer to the setting of small network buffers. It isn't related to the actual opus coding. |
Maybe - I hadn't got around to examining how the value was used in the code. It just felt wrong for the message to state OPUS when it wasn't, and maybe a specific value could also be useful to the server. |
The server isn't aware of the opus quality setting. It is only being sent the packet size and feeds that into the opus codec. There is currently no other way for the server than to determine the codec (or none) by the expected packet sizes for rawaudio. OPUS and OPUS64 refer to 128 or 64 samples internal buffering, no relations to audio quality settings. |
|
Just tried the latest build. It's looking good in Wireshark and sounding good too. No fragmentation either, as the max packet size is only 1068 for stereo, max quality, 10.67ms(256). |
|
Have you guys also tested with small network buffers? This seems to work as well for me and with a huge improvement on latency as well (as expected). Tested on arm64 ubuntu server side and win11 for the client. |
|
Hi Nils, I have some significant suggestions to give you, which are too involved for just review comments. I have created my own branch from your |
@dingodoppelt - https://github.com/softins/jamulus/tree/rawaudio-dev-softins is the link to my branch. What I've done so far:
I have built and tested it as server and client, and it seems to work as expected. I have other suggestions yet to be coded, but I thought you might like to see these first. |
That's great! Good to hear the code is in good hands now ;) |
|
In the meantime, I've just updated my Wireshark dissector to understand raw audio and the |
Perfect. That's way better than the version check. |
@softins |
I think of it more as a refactor because I was already using a protocol message, only a different one. This is rather a new mechanism for a feature check which was previously done via the server version message and I feel the other version checks should be refactored to use this message as well to save on messages being sent and to not have the same functionality split over different parts of the code. |
|
Using the version number to enable the rawaudio had a couple of drawbacks:
We could possibly turn the If we did introduce a Jamulus has always placed great importance on backward compatibility in both directions, and I don't think that will be able to change with the release of 4.0.0. I also don't think that can apply to those who have been using the 3.11.1 versions. The takeup of that was quick and enthusiastic, but those users would by their nature very easily act on a notice that they needed to update their client or server to use the new mechanism instead of the version check. We also shouldn't let the enthusiasm for an important new feature hasten us to set something in stone without taking the time to get it right for the long term. Refactoring / trimming-down of the Jamulus code could certainly happen in some places, but that can't be at the expense of backward compatibility. We can't remove protocol messages that older clients or servers rely on. |
|
As the author of Jamalizer i appreciate you guys giving a thought about visibility of raw audio servers, but i second @softins assumption that this will only be a temporary issue until the next major release when all (most) servers and clients update. Until then, i'm happy with parsing some more well-known raw audio versions to highlight them. After 4.0 it's redundant anyway and might be removed on Jamalizer because it no longer distinguishes servers. |
|
I echo @foobarth position and will look for other ways to highlight supported servers on jamulusjams.com if possible. On the server side is the intention to bake the PCM support in, or have a compile time option, or have a cli arg to enable it? |
443c92b to
bd7992e
Compare
|
Ok, this is now pushed and the old clients won't work on new servers and vice versa. I'd ask everybody involved to spread the word that there is a new client and server. |
|
Great news! I have a better idea than the welcome message. The way that the jamulus-php Explorer backend fetches the welcome message is to send a single frame of audio and to then wait for the CHAT_TEXT message. A new rawaudio server will send RAWAUDIO_SUPPORTED before the CHAT_TEXT. So tomorrow I will update the backend to listen out for RAWAUDIO_SUPPORTED and report its receipt with a new item in the JSON it returns. |
It will be a standard feature in the upcoming 4.0.0, always enabled, subject to testing of forward and backward compatibility. The intention is to get this feature into |
|
@dingodoppelt I forgot to push the changes I made to |
|
Although already mentioned that it was intentional, I do find it a bit counterintuitive for some types of users that an upgrade to 3.12.0dev from an older 3.11.0dev client will transparely fail to deliver the higher quality uncompressed audio on any server not running 3.12.0dev as well (even though there are technically quite alot of 3.11.1dev builds in the wild right now running and perfectly capable of supporting raw audio to a newer client. Is this really the intention? Isn't there a more elegant way to handle this? Is there any existing way for the server to push a message to a client that has been downgraded to high quality because of the missing protocol handshake at connection init? Some sort of alert that they need to upgrade their to achieve max quality on this server ? (Just thinking out loud)... I do agree with @softins that most of the early adopters are mostly staying tuned into the developments happening over here on this branch but it would still be "nice to have" if there was any easy way to implement? |
|
No. This never was an official release. Thus there's no backward compatibility requirement IMO. |
This now done and live. The JSON returned from https://explorer.jamulus.io/servers.php?query=host:port will include the item I've also updated Jamulus Explorer to show a banner above the welcome message if the server has advertised raw audio support as above. |
f9b0514 to
583c61b
Compare
Done! (After quite an intense time with git ;) |

Add a new "raw" audio quality setting
This PR adds uncompressed audio ("raw") to the quality settings so there is no Opus compression along the way
Discussion in #3654
This feature improves latency as well. I gained 2ms by using uncompressed audio while having a better audio quality.
CHANGELOG: Add uncompressed audio transmission - dedicated to the memory of Hans Petter Selasky (1982 - 2023)
Does this change need documentation? What needs to be documented and how?
Corresponding PR in jamulussoftware/jamuluswebsite #1133
Checklist