pipewire

mirror of https://gitlab.freedesktop.org/pipewire/pipewire.git synced 2025-10-31 22:25:38 -04:00

Author	SHA1	Message	Date
Wim Taymans	0c8f803d59	audioconvert: optimise f32d to/from s16s conversions They are mostly used for sending and receiving RTP so it might be worth to make them a bit faster.	2025-01-24 10:37:02 +01:00
Roman Lebedev	7c40cafa7c	audioconvert: avoid even more precision loss in F32 to S32 conversion This is somewhat similar to the S32->F32 conversion improvements, but here things a bit more tricky... The main consideration is that the limits to which we clamp must be valid 32-bit signed integers, but not all such integers are exactly losslessly representable in `float32_t`. For example it we'd clamp to `2147483647`, that is actually a `2147483648.0f`, and `2147483648` is not a valid 32-bit signed integer, so the post-clamp conversion would basically be UB. We don't have this problem for negative bound, though. But as we know, any 25-bit signed integer is losslessly round-trippable through float32_t, and since multiplying by 2 only changes the float's exponent, we can clamp to `2147483520`! The algorithm of selection of the pre-clamping scale is unaffected. This additionally avoids right-shift, and thus is even faster. As `test_lossless_s32_lossless_subset` shows, if the integer is in the form of s25+shift, the maximal absolute error is finally zero. Without going through `float`->`double`->`int`, i'm not sure if the `float`->`int` conversion can be improved further.	2024-06-27 19:41:20 +03:00
Roman Lebedev	f4c89b1b40	audioconvert: avoid even more precision loss in S32 to F32 conversion There's really no point in doing that s25_32 intermediate step, to be honest i don't have a clue why the original implementation did that \_(ツ)_/¯. Both `S25_SCALE` and `S32_SCALE` are powers of two, and thus are both exactly representable as floats, and reprocial of power-of-two is also exactly representable, so it's not like that rescaling results in precision loss. This additionally avoids right-shift, and thus is even faster. As `test_lossless_s32_lossless_subset` shows, if the integer is in the form of s25+shift, the maximal absolute error became even lower, but not zero, because F32->S32 still goes through S25 intermediate. I think we could theoretically do better, but then the clamping becomes pretty finicky, so i don't feel like touching that here.	2024-06-27 19:41:20 +03:00
Roman Lebedev	c517865864	audioconvert: somewhat avoid precision loss in S32 to F32 conversion At the very least, we should go through s25_32 intermediate instead of s24_32, to avoid needlessly loosing 1 LSB precision bit. That being said, i suspect it's still not doing the right thing. Why are we silently dropping those 7 LSB bits? Is that really the way to do it?	2024-06-27 19:41:20 +03:00
Roman Lebedev	175d533b56	audioconvert: somewhat avoid precision loss in F32 to S32 conversion At the very least, we should go through s25_32 intermediate instead of s24_32, to avoid needlessly loosing 1 LSB precision bit. FIXME: the noise codepath is not covered with tests.	2024-06-27 19:41:20 +03:00
Wim Taymans	1ae4374ccf	Fix compilation with -Werror=float-conversion Better make the conversions explicit so that we don't get any surprises. Fixes #4065	2024-06-18 12:17:56 +02:00
Wim Taymans	7ecea07a63	audioconvert: use alternative store to avoid ASAN errors See #3572	2023-10-16 12:59:41 +02:00
Wim Taymans	16ad067cc9	audioconvert: use spa_write_unaligned Use a macro to write out unaligned data to avoid ASAN errors. See #3572	2023-10-16 12:21:33 +02:00
Wim Taymans	fdc1391b19	audioconvert: avoid unaligned reads using memcpy See #3572	2023-10-15 22:09:44 +02:00
Wim Taymans	20b336b1d7	audioconvert: avoid unaligned writes See #3572	2023-10-15 21:03:52 +02:00
Wim Taymans	f3942fca29	audioconvert: fix unaligned writes Use 16 bits writes for 16 bits samples or else we might be doing unlaigned writes. See #3572	2023-10-15 19:10:45 +02:00
Barnabás Pőcze	8c17a6626d	treewide: mark some functions `static` These were found by enabling the "missing-declarations" warning.	2023-07-03 19:40:31 +02:00
Barnabás Pőcze	934ab3036e	treewide: use SPDX tags to specify copyright information SPDX tags make the licensing information easy to understand and clear, and they are machine parseable. See https://spdx.dev for more information.	2023-02-16 10:54:48 +00:00
Wim Taymans	a145c42ec4	audioconvert: make separate noise functions So that we can reuse optimized versions in unoptimized noise functions. Do allocation a little different so that we can align everything from the start.	2022-09-29 21:36:06 +02:00
Wim Taymans	14028c67cd	audioconvert: some optimizations	2022-09-29 21:36:06 +02:00
Wim Taymans	c743f4936a	audioconvert: use calculated random value Use the calculated random value instead of making a new one.	2022-09-29 09:27:28 +02:00
Wim Taymans	ada39f3048	audioconvert: improve noise bits Make a new noise method called PATTERN and use it to add a slow (every 1024 samples) repeating pattern of -1, 0. Only use this method when we don't already use triangular dither. See #2540	2022-07-18 11:41:57 +02:00
Wim Taymans	a4db745a7e	audioconvert: improve noise shaping Reorganize things a little so we can add more noise shapers. Add sloped triangular noise. Add wanamaker3 noise shaping.	2022-07-15 12:36:15 +02:00
Wim Taymans	0ba3e7c5db	audioconvert: round instead of truncate, to reduce distortion See #2543	2022-07-13 20:56:13 +02:00
Wim Taymans	7745346292	audioconvert: add sse2 s16 dither functions	2022-07-12 10:34:13 +02:00
Wim Taymans	c31928c5f0	audioconvert: add and use CLAMP macros	2022-07-12 10:33:37 +02:00
Wim Taymans	c35006f040	audioconvert: move scaling to setup	2022-07-11 17:50:20 +02:00
Wim Taymans	68f883ff77	audioconvert: fix dither scale Rectangular dither should be [-0.5, 0.5] Triangular dither should be [-1.0, 1.0] Noise should add extra bits.	2022-07-11 17:19:28 +02:00
Wim Taymans	e313149f7f	audioconvert: improve SSE2 dither generation	2022-07-11 16:41:12 +02:00
Wim Taymans	277addcca6	audioconvert: add triangular dither	2022-07-11 16:28:51 +02:00
Wim Taymans	dd1d5960b4	audioconvert: implement f64s Add swapping functions for f64s. Fix the awkward interleave/deinterleave names for 32s.	2022-07-11 10:58:51 +02:00
Wim Taymans	0343e0da73	fmt-ops: fix some missing shifts and min/max	2022-07-07 18:57:03 +02:00
Wim Taymans	b8a4bf880f	audioconvert: use 24 bits for 32 formats The float only preserves 24 bits so use this. Otherwise we get overflows and errors in clang.	2022-07-07 18:38:32 +02:00
Wim Taymans	22317da685	audioconvert: tweak conversion constants Tweak the conversion constants a bit so that they handle the extreme ranges a bit better. Align the C and vector instructions. Reactivate the unit test asserts when a conversion fails.	2022-07-05 12:20:02 +02:00
Wim Taymans	817d5bd7a4	audioconvert: simplify 24 bits handling Make a new uint42_t and int24_t type and use that to handle 24 bits samples. This makes it easier because we can iterate and copy the structs like other types.	2022-07-01 12:25:58 +02:00
Wim Taymans	06b1cf8663	audioconvert: implement (de)interleave with existing functions	2022-06-30 18:24:05 +02:00
Wim Taymans	938f2b123e	audioconvert: improve format conversion Make dither noise as a value between -0.5 and 0.5 and add this to the scaled samples. For this, we first need to do the scaling and then the CLAMP to the target depth. This optimizes to the same code but allows us to avoid under and overflows when we add the dither noise. Add more dithering methods. Expose a dither.method property on audioconvert. Disable dither when the target depth > 16.	2022-06-29 14:10:15 +02:00
Wim Taymans	d23b96b033	audioconvert: fix some compiler warnings	2022-06-29 14:08:30 +02:00
Wim Taymans	abe87f8008	audioconvert: use unaligned loads We collect 4 channels from the source. When the number of channels is not a multiple of 4, we need to do unaligned reads. See #2221	2022-03-17 10:02:07 +01:00
Barnabás Pőcze	0f28309f07	audioconvert: fix sse2 deinterleave_32 Fixes #2213	2022-03-16 10:54:27 +01:00
Wim Taymans	734470f0cd	audioconvert: add SSE (de)interleave_32(s) versions	2022-03-11 15:02:02 +01:00
Wim Taymans	a520335581	fmt-ops: add sse2 version of f32 to s16 planar/interleaved	2020-07-03 11:05:26 +02:00
Wim Taymans	3a911dfe3b	fmt-ops: add avx2 optimized version Only one optimized version but the sse2 version are compiled with the avx2 flags so that they get optimized better.	2020-03-16 16:11:29 +01:00
Wim Taymans	fd3dd14ebe	fmt-ops: add optimized f32 to s16 stereo conversion	2020-03-16 13:04:21 +01:00
Wim Taymans	0548fbf690	fmt-ops: fix bug in s32 to f32 conversion	2020-03-16 12:57:21 +01:00
Wim Taymans	cb7bfdf98a	sprinkly SPA_LIKELY/UNLIKELY around	2020-03-16 12:52:28 +01:00
Wim Taymans	3c9aa2e96c	fmt: add sse2 optimized s32->f32d	2019-10-24 11:06:04 +02:00
Wim Taymans	ae3e5787af	audioconvert: unroll some loop	2019-04-10 12:53:25 +02:00
Wim Taymans	d8e399dee9	audioconvert: pass state to functions Pass some state to convert and channelmix functions. This makes it possible to select per channel optimized convert functions but also makes it possible to implement noise shaping later. Pass the channelmix matrix and volume in the state. Handle specialized 2 channel s16 -> f32 conversion	2019-03-29 17:39:59 +01:00
Wim Taymans	e7ef13e310	audioconvert: add avx optimizations	2019-03-28 16:45:57 +01:00
Wim Taymans	7f041f4098	audioconvert: improve benchmark Also include the simd versions in the benchmark Fix some issues found by new test	2019-03-28 13:26:06 +01:00
Wim Taymans	c8d3d475bb	audioconvert: handle more optimizations Compile an optimized library for the given CPU with the right flags, then link it with the main library.	2019-03-27 17:58:48 +01:00
Wim Taymans	ecc3e78ad4	fmt-ops: use faster f32 -> s32 conversion	2019-03-26 17:24:14 +01:00
Wim Taymans	c9bd95ae5b	audioconvert: small optimization	2019-03-20 15:57:13 +01:00
Wim Taymans	b265081d6d	audioconvert: use macro to transpose	2019-03-20 13:37:20 +01:00

1 2

56 commits