Jan
|
dc5b071e25
|
Create add-x_x_x-il_1_2-adds-x_x_x-TP.S
|
2022-02-10 14:04:04 +01:00 |
|
Jan
|
fa6d5fb400
|
Merge pull request #1 from RRZE-HPC/NEON
Neon
|
2022-01-31 18:16:21 +01:00 |
|
JanLJL
|
dd60865b31
|
adjusted Makefile for master branch
|
2022-01-31 12:09:30 -05:00 |
|
JanLJL
|
ca87e93fd6
|
unified callee save register handling
|
2022-01-31 12:04:28 -05:00 |
|
JanLJL
|
97bfe42eac
|
added callee save
|
2022-01-31 08:53:42 -05:00 |
|
JanLJL
|
299b17e460
|
removed duplicates
|
2022-01-31 08:53:27 -05:00 |
|
JanLJL
|
1dbe352401
|
changed Makefiles for ARM
|
2022-01-31 08:08:35 -05:00 |
|
JanLJL
|
3cf9c93c20
|
fixed bugs from merge
|
2022-01-31 08:08:00 -05:00 |
|
JanLJL
|
f1a0fcf3e8
|
Merge branch 'A64FX' into NEON
|
2022-01-31 04:44:03 -05:00 |
|
JanLJL
|
6a915eb8b7
|
new instructions
|
2022-01-31 04:09:42 -05:00 |
|
JanLJL
|
ce05692884
|
benchmarks for A64FX
|
2020-08-12 19:29:01 +02:00 |
|
JanLJL
|
de3bda1e3c
|
added sample SVE instruction
|
2020-07-24 14:56:13 +02:00 |
|
JanLJL
|
686f44be7a
|
more benchmarks
|
2020-07-11 12:57:35 +00:00 |
|
JanLJL
|
d9e607d25c
|
one more str benchmark
|
2020-06-11 15:23:00 +02:00 |
|
JanLJL
|
b808cdc09f
|
more instrs
|
2020-06-11 15:21:07 +02:00 |
|
JanLJL
|
863b0b4c41
|
more arm instructions
|
2020-06-11 15:04:00 +02:00 |
|
JanLJL
|
1491d8cd40
|
Merge branch 'master' of github.com:hofm/ibench
|
2020-06-09 15:13:35 +02:00 |
|
JanLJL
|
cd5d4631fa
|
more arm benchmarks
|
2020-06-09 15:13:24 +02:00 |
|
JanLJL
|
1bf5191f31
|
moved benchmark creation from OSACA to ibench
|
2019-10-24 10:52:30 +02:00 |
|
Johannes Hofmann
|
0d23cce999
|
remove test benchmarks
|
2019-03-29 17:13:24 +01:00 |
|
Johannes Hofmann
|
a8b004c9e2
|
reorganize
|
2019-03-22 12:44:03 +01:00 |
|
Johannes Hofmann
|
f8f004d575
|
mark registers before using them
|
2019-03-22 12:43:44 +01:00 |
|
Johannes Hofmann
|
6e2d119109
|
remove results from repo
|
2019-03-13 10:35:33 +01:00 |
|
Johannes Hofmann
|
8bab9d3d45
|
add results
|
2019-03-11 12:31:21 +01:00 |
|
Johannes Hofmann
|
2909771fe8
|
TP and LAT for sqrt; NB: subtract add lat from sqrt lat!
|
2019-03-06 15:29:52 +01:00 |
|
Johannes Hofmann
|
37a86cbb02
|
add include_NEON.mk
|
2019-03-06 15:04:18 +01:00 |
|
Johannes Hofmann
|
903de0bb73
|
add div and reciprocal
|
2019-03-06 14:26:30 +01:00 |
|
Johannes Hofmann
|
1d15a4bc76
|
validated TP and LAT benchmarks for add, sub, and fmla
|
2019-03-06 13:58:38 +01:00 |
|
Johannes Hofmann
|
988cf6ccd7
|
add first set of ARM NEON instructions
|
2019-03-05 13:34:04 +01:00 |
|
Johannes Hofmann
|
b4c33a7963
|
adjust *.mk files for new directory names
|
2018-01-19 13:05:41 +01:00 |
|
Johannes Hofmann
|
2b798b825c
|
group assembly files based on SIMD extension
|
2018-01-19 13:03:05 +01:00 |
|
Johannes Hofmann
|
22bfaf31fa
|
bring directiory names in line with constants
|
2018-01-19 12:53:50 +01:00 |
|
Johannes Hofmann
|
599fc97ac1
|
Merge branch 'master' of github.com:hofm/ibench
|
2018-01-08 12:47:07 +01:00 |
|
Johannes Hofmann
|
b8a664920a
|
add SSE load/store instructions
|
2018-01-08 12:46:51 +01:00 |
|
Johannes Hofmann
|
a0b2c199c3
|
add sqrt and rsqrt14 for AVX-512
|
2017-06-06 10:20:44 +02:00 |
|
Johannes Hofmann
|
5c4db847bd
|
add sqrt to AVX benchmarks
|
2017-06-06 10:19:25 +02:00 |
|
Johannes Hofmann
|
7c5b855eaa
|
remove immintrin.h header
|
2017-05-29 04:25:58 -07:00 |
|
Johannes Hofmann
|
af07f9cf6f
|
add AVX load/store templates
|
2017-05-23 07:41:54 +02:00 |
|
Johannes Hofmann
|
b99cacd528
|
initial import
|
2017-05-19 12:18:17 +02:00 |
|
Johannes Hofmann
|
ccf4f3333d
|
Initial commit
|
2017-05-19 10:02:29 +02:00 |
|