linux

mirror of https://github.com/torvalds/linux.git synced 2026-05-20 21:08:08 +02:00

History

Christophe Leroy 3af722cb73 powerpc/net: Implement powerpc specific csum_shift() to remove branch Today's implementation of csum_shift() leads to branching based on parity of 'offset' 000002f8 <csum_block_add>: 2f8: 70 a5 00 01 andi. r5,r5,1 2fc: 41 a2 00 08 beq 304 <csum_block_add+0xc> 300: 54 84 c0 3e rotlwi r4,r4,24 304: 7c 63 20 14 addc r3,r3,r4 308: 7c 63 01 94 addze r3,r3 30c: 4e 80 00 20 blr Use first bit of 'offset' directly as input of the rotation instead of branching. 000002f8 <csum_block_add>: 2f8: 54 a5 1f 38 rlwinm r5,r5,3,28,28 2fc: 20 a5 00 20 subfic r5,r5,32 300: 5c 84 28 3e rotlw r4,r4,r5 304: 7c 63 20 14 addc r3,r3,r4 308: 7c 63 01 94 addze r3,r3 30c: 4e 80 00 20 blr And change to left shift instead of right shift to skip one more instruction. This has no impact on the final sum. 000002f8 <csum_block_add>: 2f8: 54 a5 1f 38 rlwinm r5,r5,3,28,28 2fc: 5c 84 28 3e rotlw r4,r4,r5 300: 7c 63 20 14 addc r3,r3,r4 304: 7c 63 01 94 addze r3,r3 308: 4e 80 00 20 blr Seems like only powerpc benefits from a branchless implementation. Other main architectures like ARM or X86 get better code with the generic implementation and its branch. Signed-off-by: Christophe Leroy <christophe.leroy@csgroup.eu> Signed-off-by: David S. Miller <davem@davemloft.net>		2022-03-11 10:57:22 +00:00
..
acpi	USB/Thunderbolt changes for 5.17-rc1	2022-01-12 11:27:57 -08:00
asm-generic	bitmap patches for 5.17-rc1	2022-01-23 06:20:44 +02:00
clocksource
crypto	lib/crypto: blake2s: avoid indirect calls to compression function for Clang CFI	2022-02-04 19:22:32 +01:00
drm
dt-bindings	Fixes for omaps	2022-02-07 17:42:44 +01:00
keys
kunit	kunit: replace kernel.h with the necessary inclusions	2022-01-20 08:52:54 +02:00
kvm	RISCV:	2022-01-16 16:15:14 +02:00
linux	net/mlx5: Parse module mapping using mlx5_ifc	2022-03-10 23:38:25 -08:00
math-emu
media
memory
misc
net	powerpc/net: Implement powerpc specific csum_shift() to remove branch	2022-03-11 10:57:22 +00:00
pcmcia
ras	mm/hwpoison: remove MF_MSG_BUDDY_2ND and MF_MSG_POISONED_HUGE	2022-01-15 16:30:31 +02:00
rdma
scsi	Merge branch 'akpm' (patches from Andrew)	2022-01-22 11:28:23 +02:00
soc	Merge git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net	2022-03-03 11:55:12 -08:00
sound	ASoC: Fixes for v5.17	2022-02-01 16:52:54 +01:00
target
trace	mptcp: add tracepoint in mptcp_sendmsg_frag	2022-03-08 22:06:10 -08:00
uapi	net: openvswitch: fix uAPI incompatibility with existing user space	2022-03-10 20:14:52 -08:00
vdso
video
xen	xen/gnttab: fix gnttab_end_foreign_access() without page specified	2022-03-07 09:48:55 +01:00