40 Commits

Author SHA1 Message Date
Jan
dc5b071e25 Create add-x_x_x-il_1_2-adds-x_x_x-TP.S 2022-02-10 14:04:04 +01:00
Jan
fa6d5fb400 Merge pull request #1 from RRZE-HPC/NEON
Neon
2022-01-31 18:16:21 +01:00
JanLJL
dd60865b31 adjusted Makefile for master branch 2022-01-31 12:09:30 -05:00
JanLJL
ca87e93fd6 unified callee save register handling 2022-01-31 12:04:28 -05:00
JanLJL
97bfe42eac added callee save 2022-01-31 08:53:42 -05:00
JanLJL
299b17e460 removed duplicates 2022-01-31 08:53:27 -05:00
JanLJL
1dbe352401 changed Makefiles for ARM 2022-01-31 08:08:35 -05:00
JanLJL
3cf9c93c20 fixed bugs from merge 2022-01-31 08:08:00 -05:00
JanLJL
f1a0fcf3e8 Merge branch 'A64FX' into NEON 2022-01-31 04:44:03 -05:00
JanLJL
6a915eb8b7 new instructions 2022-01-31 04:09:42 -05:00
JanLJL
ce05692884 benchmarks for A64FX 2020-08-12 19:29:01 +02:00
JanLJL
de3bda1e3c added sample SVE instruction 2020-07-24 14:56:13 +02:00
JanLJL
686f44be7a more benchmarks 2020-07-11 12:57:35 +00:00
JanLJL
d9e607d25c one more str benchmark 2020-06-11 15:23:00 +02:00
JanLJL
b808cdc09f more instrs 2020-06-11 15:21:07 +02:00
JanLJL
863b0b4c41 more arm instructions 2020-06-11 15:04:00 +02:00
JanLJL
1491d8cd40 Merge branch 'master' of github.com:hofm/ibench 2020-06-09 15:13:35 +02:00
JanLJL
cd5d4631fa more arm benchmarks 2020-06-09 15:13:24 +02:00
JanLJL
1bf5191f31 moved benchmark creation from OSACA to ibench 2019-10-24 10:52:30 +02:00
Johannes Hofmann
0d23cce999 remove test benchmarks 2019-03-29 17:13:24 +01:00
Johannes Hofmann
a8b004c9e2 reorganize 2019-03-22 12:44:03 +01:00
Johannes Hofmann
f8f004d575 mark registers before using them 2019-03-22 12:43:44 +01:00
Johannes Hofmann
6e2d119109 remove results from repo 2019-03-13 10:35:33 +01:00
Johannes Hofmann
8bab9d3d45 add results 2019-03-11 12:31:21 +01:00
Johannes Hofmann
2909771fe8 TP and LAT for sqrt; NB: subtract add lat from sqrt lat! 2019-03-06 15:29:52 +01:00
Johannes Hofmann
37a86cbb02 add include_NEON.mk 2019-03-06 15:04:18 +01:00
Johannes Hofmann
903de0bb73 add div and reciprocal 2019-03-06 14:26:30 +01:00
Johannes Hofmann
1d15a4bc76 validated TP and LAT benchmarks for add, sub, and fmla 2019-03-06 13:58:38 +01:00
Johannes Hofmann
988cf6ccd7 add first set of ARM NEON instructions 2019-03-05 13:34:04 +01:00
Johannes Hofmann
b4c33a7963 adjust *.mk files for new directory names 2018-01-19 13:05:41 +01:00
Johannes Hofmann
2b798b825c group assembly files based on SIMD extension 2018-01-19 13:03:05 +01:00
Johannes Hofmann
22bfaf31fa bring directiory names in line with constants 2018-01-19 12:53:50 +01:00
Johannes Hofmann
599fc97ac1 Merge branch 'master' of github.com:hofm/ibench 2018-01-08 12:47:07 +01:00
Johannes Hofmann
b8a664920a add SSE load/store instructions 2018-01-08 12:46:51 +01:00
Johannes Hofmann
a0b2c199c3 add sqrt and rsqrt14 for AVX-512 2017-06-06 10:20:44 +02:00
Johannes Hofmann
5c4db847bd add sqrt to AVX benchmarks 2017-06-06 10:19:25 +02:00
Johannes Hofmann
7c5b855eaa remove immintrin.h header 2017-05-29 04:25:58 -07:00
Johannes Hofmann
af07f9cf6f add AVX load/store templates 2017-05-23 07:41:54 +02:00
Johannes Hofmann
b99cacd528 initial import 2017-05-19 12:18:17 +02:00
Johannes Hofmann
ccf4f3333d Initial commit 2017-05-19 10:02:29 +02:00