pleroy
91da9a311a
Upper case the argument to the --syntax flag, otherwise 'att' means 'intel' :-/
2025-03-12 00:35:01 +01:00
pleroy
0c201be10e
Revert 62908f3b8f
and fix a failure in tests.test_cli.TestCLI.test_without_arch while preserving the possibility to try more archs than the detected one.
2025-03-11 23:34:36 +01:00
JanLJL
9c2f559983
black formatting
2025-03-05 10:20:47 +01:00
JanLJL
02716e7b41
flake8 formatting
2025-03-05 10:19:10 +01:00
JanLJL
62908f3b8f
fix bug when no micro arch was given
2025-03-04 17:42:05 +01:00
pleroy
1a7c1588f6
Add support for the Intel syntax supported by MSVC and ICC
2025-02-02 14:02:16 +01:00
JanLJL
9b9d7f8649
changed default ARCHs to SPR/V2
2024-10-11 16:29:13 +02:00
JanLJL
3059952025
formatting
2024-09-05 10:44:23 +02:00
JanLJL
a731954166
fixed YAML output
2024-09-03 14:22:19 +02:00
JanLJL
2fcc62ae1a
initial support ZEN 4
2024-08-30 17:41:45 +02:00
JanLJL
2286da45b7
Merge branch 'master' into feat/spr
2024-05-02 21:19:10 +02:00
JanLJL
764b22cebe
initial support for SPR
2024-03-06 00:52:06 +01:00
JanLJL
3435641451
initial support Neoverse V2
2024-03-04 20:45:48 +01:00
stefandesouza
ec798f61b2
More formatting
2024-01-10 13:26:50 +01:00
stefandesouza
4647615c5c
Merge remote-tracking branch 'origin/master' into InstrucForm
2023-12-16 12:14:36 +01:00
JanLJL
f3b50b93f5
added M1 arch
2023-12-12 18:33:24 +01:00
stefandesouza
93ae586745
Fixed semantic and marker tests. Now only dump needs to be adjusted
2023-12-03 16:49:33 +01:00
stefandesouza
14a2aa0b52
Changed style to conform to PEP-8 conventions; Added source and destination attributes to parent Operand class
2023-10-29 13:52:49 +01:00
stefandesouza
615ef82f04
Changes to accomodate the new OO style
2023-08-28 15:19:46 +02:00
JanLJL
ab10febe74
enhanced YAML output to include all kernel objects and no ruamel.yaml-specific data types
2023-08-15 14:01:11 +02:00
Stephen Nicholas Swatman
e1ce402133
Add support for structured YAML output
...
This commit adds a new `--yaml-out` flag to OSACA which allows the user
to dump the results of an analysis to a YAML file, allowing them to
analyse the file more easily. I have tried to make the output as
comprehensive as possible.
2023-08-12 19:43:43 +02:00
JanLJL
eab6907c82
added support for optional condition flag dependency analysis
2023-03-14 17:00:49 +01:00
JanLJL
7724ce27c7
added Zen3 support
2022-09-27 18:39:14 +02:00
JanLJL
671f7f5591
added ICX architecture
2022-08-29 11:14:56 +02:00
Qingcai Jiang
7194e79beb
simple implement for TSV110
2021-11-06 16:04:16 +08:00
JanLJL
9c16f8bc56
formatted
2021-10-14 10:59:55 +02:00
JanLJL
5735291d27
Merge branch 'master' into a72
2021-10-14 10:37:05 +02:00
JanLJL
d418c16f4a
applied flake8 and black rules
2021-08-26 16:58:19 +02:00
JanLJL
f13a97e5b5
fixed bug in case of no uarch in CLI
2021-05-02 22:39:07 +02:00
JanLJL
3f31235f8a
added no timeout option
2021-04-19 10:57:51 +02:00
JanLJL
a82a0e24a3
bugfixed CLX as uarch flag
2021-04-19 00:34:21 +02:00
JanLJL
e6a54ee131
added CLX as synonym for CSX uarch
2021-04-19 00:05:53 +02:00
JanLJL
152360bad2
enhanced LCD analysis by making it parallel and added timeout flag
2021-04-19 00:04:03 +02:00
Julian
08440ed5e1
Validation ( #71 )
...
Validating of OSACA predictions for IVB, SKX, ZEN1, ZEN2, A64FX and TX2 with different kernels.
build_and_run.py contains the configuration used at RRZE's testcluster and UR's qpace4, Analysis.ipynb contains the analysis script and results. Raw data from measurements (122MB) will be attached to next OSACA release.
For now, find the raw data here: https://hawo.net/~sijuhamm/d/UPIhBOtz/validation-data.tar.gz
The analysis report can be viewed at https://nbviewer.jupyter.org/github/RRZE-HPC/OSACA/blob/validation/validation/Analysis.ipynb
Quite a few changes on OSACA included:
Feature: register change tracking via semantic understanding of operations
Feature: recording LCD latency along path and exposing this to frontend
Feature: support for memory reference aliases
Feature: store throughput scaling (similar to load throughput scaling)
Fix: model importer works with latest uops.info export
Fix: immediate type tracking on ARM now preserves type in internal representaion
Removed unused KerncraftAPI
2021-04-15 14:42:37 +02:00
Julian Hammer
b7625a4a25
making flake8 happy
2021-03-11 12:29:14 +01:00
Julian Hammer
6204c90934
migrate code style to Black
2021-03-11 12:02:45 +01:00
Julian Hammer
d7a687909e
passing parsing errors to the outside
2021-03-05 18:07:36 +01:00
Git out :V
12044e3ac4
Initial support for the Cortex-A72 (Raspberry Pi 4)
2020-12-16 18:49:16 +01:00
JanLJL
c8c077a834
enhanced length warning
2020-11-06 15:49:13 +01:00
JanLJL
207c53aaad
minor bugfix in HW model and added user warnings for more insight
2020-11-06 15:06:36 +01:00
JanLJL
b986d7eba0
added --lines option
2020-11-06 12:57:41 +01:00
JanLJL
9ba9bab107
try different ISA as fallback when parsing without --arch flag, use SKX as x86 default and enhanced ISA detection heuristic
2020-11-02 15:33:50 +01:00
JanLJL
1c889fa785
Merge branch 'master' of https://github.com/RRZE-HPC/OSACA
2020-10-29 13:00:09 +01:00
JanLJL
022598d94f
autodetect ISA and default uarch for ISA
2020-10-29 13:00:02 +01:00
Julian Hammer
9d2ea8603f
new caching structure with support for distribution
2020-10-28 16:29:55 +01:00
JanLJL
e8b78e4cc6
Merge branch 'master' into A64FX
2020-10-15 22:44:12 +02:00
Julian Hammer
7269156854
added --out
argument
2020-10-12 15:04:18 +02:00
JanLJL
6c72281d65
prepared for aarch64 8.2 support
2020-07-23 15:54:54 +02:00
JanLJL
93060eee43
Merge branch 'master' into A64FX
2020-07-13 14:41:49 +02:00
Cloud User
34e978d2ae
initial implementation of Neoverse N1 support
2020-06-30 20:28:57 +00:00