openbsd/src/lib/libcrypto/arch/amd64/Makefile.inc, branch OPENBSD_7_8

openbsd/src/lib/libcrypto/arch/amd64/Makefile.inc, branch OPENBSD_7_8_BASE A mirror of https://github.com/libressl/openbsd.git https://git.lua4.win/openbsd/atom?h=OPENBSD_7_8_BASE 2025-08-14T15:12:53+00:00 Hook additional s2n-bignum routines to the amd64 build. 2025-08-14T15:12:53+00:00 jsing 2025-08-14T15:12:53+00:00 urn:sha1:07bac2b524750c4dfd196516fe31b97d149acde4 Rework gcm128 implementation selection for amd64/i386. 2025-06-28T12:39:10+00:00 jsing 2025-06-28T12:39:10+00:00 urn:sha1:48723f4db60f6f8a8ad8424ffe5e0262d30f397c Provide gcm128_amd64.c and gcm128_i386.c, which contain the appropriate gcm128 initialisation and CPU feature tests for the respective platform. This allows for all of the #define spagetti to be removed from gcm128.c and removes one of the two remaining consumers of crypto_cpu_caps_ia32(). ok tb@ Integrate AES-NI into the AES code. 2025-06-15T15:11:50+00:00 jsing 2025-06-15T15:11:50+00:00 urn:sha1:51f804b282da1b0813454f9f6027d3726deea4c7 Currently, the AES-NI code is only integrated into EVP - add code to integrate AES-NI into AES. Rename the assembly provided functions and provide C versions for the original names, which check for AES-NI support and dispatch to the appropriate function. This means that the AES_* public API will now use AES-NI, if available. ok tb@ Stop defining OPENSSL_IA32_SSE2 on amd64. 2025-06-09T13:51:48+00:00 jsing 2025-06-09T13:51:48+00:00 urn:sha1:083258dce472c3642c1fc04f51099b226a1db816 This no longer does anything on this architecture. ok bcook@ beck@ Remove BS-AES and VP-AES from EVP. 2025-04-18T13:19:39+00:00 jsing 2025-04-18T13:19:39+00:00 urn:sha1:710fa3f5ab1ebef2c8c4b9bfdefe9a64dbaef0e5 The bitsliced and vector permutation AES implementations were created around 2009, in attempts to speed up AES on Intel hardware. Both require SSSE3 which existed from around 2006. Intel introduced AES-NI in 2008 and a large percentage of Intel/AMD CPUs made in the last 15 years include it. AES-NI is significantly faster and requires less code. Furthermore, the BS-AES and VP-AES implementations are wired directly into EVP (as is AES-NI currently), which means that any consumers of the AES_* API are not able to benefit from acceleration. Removing these greatly simplifies the EVP AES code - if you just happen to have a CPU that supports SSSE3 but not AES-NI, then you'll now use the regular AES assembly implementations instead. ok kettenis@ tb@ Replace Makefile based SHA*_ASM defines with HAVE_SHA_* defines. 2025-02-14T12:01:58+00:00 jsing 2025-02-14T12:01:58+00:00 urn:sha1:a89810379a758c9cd27af2462547dc646dcfaa61 Currently, SHA{1,256,512}_ASM defines are used to remove the C implementation of sha{1,256,512}_block_data_order() when it is provided by assembly. However, this prevents the C implementation from being used as a fallback. Rename the C sha*_block_data_order() to sha*_block_generic() and provide a sha*_block_data_order() that calls sha*_block_generic(). Replace the Makefile based SHA*_ASM defines with two HAVE_SHA_* defines that allow these functions to be compiled in or removed, such that machine specific verisons can be provided. This should effectively be a no-op on any platform that defined SHA{1,256,512}_ASM. ok tb@ Provide a readable assembly implementation for MD5 on amd64. 2025-01-24T13:35:04+00:00 jsing 2025-01-24T13:35:04+00:00 urn:sha1:e645f65a85d604ca35a8889b91950b72ea837f74 This appears to be about 5% faster than the current perlasm version on a modern Intel CPU. While here rename md5_block_asm_data_order to md5_block_data_order, for consistency with other hashes. ok tb@ Provide a SHA-1 assembly implementation for amd64 using SHA-NI. 2024-12-06T11:57:18+00:00 jsing 2024-12-06T11:57:18+00:00 urn:sha1:52aaf400e5a619fcab8ae52524e5aaf96e0b0894 This provides a SHA-1 assembly implementation for amd64, which uses the Intel SHA Extensions (aka SHA New Instructions or SHA-NI). This provides a 2-2.5x performance gain on some Intel CPUs and many AMD CPUs. ok tb@ Provide a replacement assembly implementation for SHA-1 on amd64. 2024-12-04T13:13:33+00:00 jsing 2024-12-04T13:13:33+00:00 urn:sha1:2de3ee8c5940ebad54feb4303f6fc816daca784b As already done for SHA-256 and SHA-512, replace the perlasm generated SHA-1 assembly implementation with one that is actually readable. Call the assembly implementation from a C wrapper that can, in the future, dispatch to alternate implementations. On a modern CPU the performance is around 5% faster than the base implementation generated by sha1-x86_64.pl, however it is around 15% slower than the excessively complex SSSE2/AVX version that is also generated by the same script (a SHA-NI version will greatly outperform this and is much cleaner/simpler). ok tb@ Provide a SHA-256 assembly implementation for amd64 using SHA-NI. 2024-11-16T15:31:36+00:00 jsing 2024-11-16T15:31:36+00:00 urn:sha1:a06b7340f2af374d4581bec7db2775ae686ce1ab This provides a SHA-256 assembly implementation for amd64, which uses the Intel SHA Extensions (aka SHA New Instructions or SHA-NI). This provides a 3-5x performance gain on some Intel CPUs and many AMD CPUs. ok tb@