mirror of https://github.com/micropython/micropython.git synced 2025-09-05 01:10:36 +02:00

Files

Damien George d9a0fdda9a qemu-arm: Rework to provide a REPL and run tests via a pty serial port.

Currently, the qemu-arm (and qemu-riscv) port has two build modes:
- a simple test that executes a Python string; and
- a full test that uses tinytest to embed all tests within the firmware,
  then executes that and captures the output.

This is very different to all the other ports.  A difficulty with using
tinytest is that with the large number of tests the firmware overflows its
virtual flash size.  It's also hard to run tests via .mpy files and with
the native emitter.  Being different to the other ports also means an extra
burden on maintenance.

This commit reworks the qemu-arm port so that it has a single build target
that creates a standard firmware which has a REPL.  When run under
qemu-system-arm, the REPL acts like any other bare-metal port, complete
with soft reset (use machine.reset() to turn it off and exit
qemu-system-arm).

This approach gives many benefits:
- allows playing with a REPL without hardware;
- allows running the test suite as it would on a bare-metal board, by
  making qemu-system-arm redirect the UART serial of the virtual device to
  a /dev/pts/xx file, and then running run-tests.py against that serial
  device;
- skipping tests is now done via the logic in `run-tests.py` and no longer
  needs multiple places to define which tests to skip
  (`tools/tinytest-codegen.py`, `ports/qemu-arm/tests_profile.txt` and also
  `tests/run-tests.py`);
- allows testing/using mpremote with the qemu-arm port.

Eventually the qemu-riscv port would have a similar change.

Prior to this commit the test results were:

    743 tests ok.  (121 skipped)

With this commit the test results are:

    753 tests performed (22673 individual testcases)
    753 tests passed
    138 tests skipped

More tests are skipped because more are included in the run. But overall
more tests pass.

Signed-off-by: Damien George <damien@micropython.org>

2024-08-28 11:52:08 +10:00

basics

py/objstr: Skip whitespace in bytes.fromhex().

2024-08-19 14:18:34 +10:00

cmdline

py/lexer: Support raw f-strings.

2024-06-06 17:34:28 +10:00

cpydiff

tests/cpydiff: Add diff for overriding __init__.

2024-07-25 12:01:43 +10:00

extmod

test/extmod: Fix machine_spi_rate test on ESP32-C3.

2024-08-14 15:03:27 +10:00

feature_check

tests/run-tests.py: Skip Thumb2 tests if target doesn't support them.

2023-12-06 16:05:37 +11:00

float

tests/float: Use "not" instead of ~ to invert bool value.

2024-05-28 10:49:22 +10:00

frozen

tests/frozen: Move frozentest.mpy from ports/ to tests/.

2022-09-19 23:51:10 +10:00

import

all: Fix "reuse" and "overridden" spelling mistakes.

2024-01-05 15:08:33 +11:00

inlineasm

tests: Replace umodule with module everywhere.

2023-06-08 17:54:24 +10:00

internal_bench

tests: Replace umodule with module everywhere.

2023-06-08 17:54:24 +10:00

all: Prune trailing whitespace.

2024-03-07 16:25:17 +11:00

jni

tests: Rename run-tests to run-tests.py for consistency.

2021-03-12 19:56:09 +11:00

micropython

py/dynruntime: Export mp_load_method_maybe and mp_arg_parse_all* funcs.

2024-05-24 13:50:57 +10:00

misc

py/objtype: Avoid crash on calling members of uninitialized native type.

2024-07-25 12:01:43 +10:00

multi_bluetooth

tests/multi_bluetooth/perf_gatt_notify.py: Reduce connection interval.

2024-07-19 22:28:31 +10:00

multi_espnow

tests: Rename uasyncio to asyncio.

2023-06-19 17:33:03 +10:00

multi_net

tests/multi_net: Fix skipping of SSLContext tests when .der don't exist.

2024-07-25 18:14:52 +10:00

net_hosted

extmod/modlwip: Make socket.connect raise ETIMEDOUT on non-zero timeout.

2024-06-08 09:02:01 +10:00

net_inet

tests/net_inet/tls_text_errors.py: Tweak test for newer CPython version.

2024-05-27 13:56:55 +10:00

perf_bench

tests: Use vfs module instead of os.

2024-02-07 13:25:09 +11:00

ports

unix: Switch stack limit check to new cstack API.

2024-08-14 12:57:27 +10:00

stress

tests/stress/bytecode_limit.py: Make test more robust with low memory.

2024-07-05 17:07:30 +10:00

thread

esp32/mpthreadport: Fix uneven GIL allocation between Python threads.

2024-07-23 12:33:19 +10:00

unicode

tests: Replace umodule with module everywhere.

2023-06-08 17:54:24 +10:00

README.md

all: Prune trailing whitespace.

2024-03-07 16:25:17 +11:00

run-internalbench.py

tests/run-internalbench.py: Remove old CPython reference.

2023-09-29 15:41:41 +10:00

run-multitests.py

tests/run-multitests.py: Change to dir of test script when running it.

2024-01-05 11:05:34 +11:00

run-natmodtests.py

tests/run-natmodtests.py: Fix search for supported native tests.

2024-05-27 11:44:54 +10:00

run-perfbench.py

tests/run-perfbench.py: Don't allow imports from the cwd.

2023-06-08 17:54:24 +10:00

run-tests-exp.py

tests: Replace umodule with module everywhere.

2023-06-08 17:54:24 +10:00

run-tests-exp.sh

tests: Rename run-tests to run-tests.py for consistency.

2021-03-12 19:56:09 +11:00

run-tests.py

qemu-arm: Rework to provide a REPL and run tests via a pty serial port.

2024-08-28 11:52:08 +10:00

README.md

MicroPython Test Suite

This directory contains tests for various functionality areas of MicroPython. To run all stable tests, run "run-tests.py" script in this directory.

Tests of capabilities not supported on all platforms should be written to check for the capability being present. If it is not, the test should merely output 'SKIP' followed by the line terminator, and call sys.exit() to raise SystemExit, instead of attempting to test the missing capability. The testing framework (run-tests.py in this directory, test_main.c in qemu_arm) recognizes this as a skipped test.

There are a few features for which this mechanism cannot be used to condition a test. The run-tests.py script uses small scripts in the feature_check directory to check whether each such feature is present, and skips the relevant tests if not.

Tests are generally verified by running the test both in MicroPython and in CPython and comparing the outputs. If the output differs the test fails and the outputs are saved in a .out and a .exp file respectively. For tests that cannot be run in CPython, for example because they use the machine module, a .exp file can be provided next to the test's .py file. A convenient way to generate that is to run the test, let it fail (because CPython cannot run it) and then copy the .out file (but not before checking it manually!)

When creating new tests, anything that relies on float support should go in the float/ subdirectory. Anything that relies on import x, where x is not a built-in module, should go in the import/ subdirectory.

perf_bench

The perf_bench directory contains some performance benchmarks that can be used to benchmark different MicroPython firmwares or host ports.

The runner utility is run-perfbench,py. Execute ./run-perfbench.py --help for a full list of command line options.

Benchmarking a target

To run tests on a firmware target using pyboard.py, run the command line like this:

./run-perfbench.py -p -d /dev/ttyACM0 168 100

-p indicates running on a remote target via pyboard.py, not the host.
-d PORTNAME is the serial port, /dev/ttyACM0 is the default if not provided.
168 is value N, the approximate CPU frequency in MHz (in this case Pyboard V1.1 is 168MHz). It's possible to choose other values as well: lower values like 10 will run much the tests much quicker, higher values like 1000 will run much longer.
100 is value M, the approximate heap size in kilobytes (can get this from import micropython; micropython.mem_info() or estimate it). It's possible to choose other values here too: lower values like 10 will run shorter/smaller tests, and higher values will run bigger tests. The maximum value of M is limited by available heap, and the tests are written so the "recommended" value is approximately the upper limit.

Benchmarking the host

To benchmark the host build (unix/Windows), run like this:

./run-perfbench.py 2000 10000

The output of perfbench is a list of tests and times/scores, like this:

N=2000 M=10000 n_average=8
perf_bench/bm_chaos.py: SKIP
perf_bench/bm_fannkuch.py: 94550.38 2.9145 84.68 2.8499
perf_bench/bm_fft.py: 79920.38 10.0771 129269.74 8.8205
perf_bench/bm_float.py: 43844.62 17.8229 353219.64 17.7693
perf_bench/bm_hexiom.py: 32959.12 15.0243 775.77 14.8893
perf_bench/bm_nqueens.py: 40855.00 10.7297 247776.15 11.3647
perf_bench/bm_pidigits.py: 64547.75 2.5609 7751.36 2.5996
perf_bench/core_import_mpy_multi.py: 15433.38 14.2733 33065.45 14.2368
perf_bench/core_import_mpy_single.py: 263.00 11.3910 3858.35 12.9021
perf_bench/core_qstr.py: 4929.12 1.8434 8117.71 1.7921
perf_bench/core_yield_from.py: 16274.25 6.2584 12334.13 5.8125
perf_bench/misc_aes.py: 57425.25 5.5226 17888.60 5.7482
perf_bench/misc_mandel.py: 40809.25 8.2007 158107.00 9.8864
perf_bench/misc_pystone.py: 39821.75 6.4145 100867.62 6.5043
perf_bench/misc_raytrace.py: 36293.75 6.8501 26906.93 6.8402
perf_bench/viper_call0.py: 15573.00 14.9931 19644.99 13.1550
perf_bench/viper_call1a.py: 16725.75 9.8205 18099.96 9.2752
perf_bench/viper_call1b.py: 20752.62 8.3372 14565.60 9.0663
perf_bench/viper_call1c.py: 20849.88 5.8783 14444.80 6.6295
perf_bench/viper_call2a.py: 16156.25 11.2956 18818.59 11.7959
perf_bench/viper_call2b.py: 22047.38 8.9484 13725.73 9.6800

The numbers across each line are times and scores for the test:

Runtime average (microseconds, lower is better)
Runtime standard deviation as a percentage
Score average (units depend on the benchmark, higher is better)
Score standard deviation as a percentage

Comparing performance

Usually you want to know if something is faster or slower than a reference. To do this, copy the output of each run-perfbench.py run to a text file.

This can be done multiple ways, but one way on Linux/macOS is with the tee utility: ./run-perfbench.py -p 168 100 | tee pyb-run1.txt

Once you have two files with output from two different runs (maybe with different code or configuration), compare the runtimes with ./run-perfbench.py -t pybv-run1.txt pybv-run2.txt or compare scores with ./run-perfbench.py -s pybv-run1.txt pybv-run2.txt:

> ./run-perfbench.py -s pyb-run1.txt pyb-run2.txt
diff of scores (higher is better)
N=168 M=100                pyb-run1.txt -> pyb-run2.txt         diff      diff% (error%)
bm_chaos.py                    352.90 ->     352.63 :      -0.27 =  -0.077% (+/-0.00%)
bm_fannkuch.py                  77.52 ->      77.45 :      -0.07 =  -0.090% (+/-0.01%)
bm_fft.py                     2516.80 ->    2519.74 :      +2.94 =  +0.117% (+/-0.00%)
bm_float.py                   5749.27 ->    5749.65 :      +0.38 =  +0.007% (+/-0.00%)
bm_hexiom.py                    42.22 ->      42.30 :      +0.08 =  +0.189% (+/-0.00%)
bm_nqueens.py                 4407.55 ->    4414.44 :      +6.89 =  +0.156% (+/-0.00%)
bm_pidigits.py                 638.09 ->     632.14 :      -5.95 =  -0.932% (+/-0.25%)
core_import_mpy_multi.py       477.74 ->     477.57 :      -0.17 =  -0.036% (+/-0.00%)
core_import_mpy_single.py       58.74 ->      58.72 :      -0.02 =  -0.034% (+/-0.00%)
core_qstr.py                    63.11 ->      63.11 :      +0.00 =  +0.000% (+/-0.01%)
core_yield_from.py             357.57 ->     357.57 :      +0.00 =  +0.000% (+/-0.00%)
misc_aes.py                    397.27 ->     396.47 :      -0.80 =  -0.201% (+/-0.00%)
misc_mandel.py                3375.70 ->    3375.84 :      +0.14 =  +0.004% (+/-0.00%)
misc_pystone.py               2265.36 ->    2265.97 :      +0.61 =  +0.027% (+/-0.01%)
misc_raytrace.py               367.61 ->     368.15 :      +0.54 =  +0.147% (+/-0.01%)
viper_call0.py                 605.92 ->     605.92 :      +0.00 =  +0.000% (+/-0.00%)
viper_call1a.py                576.78 ->     576.78 :      +0.00 =  +0.000% (+/-0.00%)
viper_call1b.py                452.45 ->     452.46 :      +0.01 =  +0.002% (+/-0.01%)
viper_call1c.py                457.39 ->     457.39 :      +0.00 =  +0.000% (+/-0.00%)
viper_call2a.py                561.37 ->     561.37 :      +0.00 =  +0.000% (+/-0.00%)
viper_call2b.py                389.49 ->     389.50 :      +0.01 =  +0.003% (+/-0.01%)

Note in particular the error percentages at the end of each line. If these are high relative to the percentage difference then it indicates high variability in the test runs, and the absolute difference value is unreliable. High error percentages are particularly common on PC builds, where the host OS may influence test run times. Increasing the N value may help average this out by running each test longer.

internal_bench

The internal_bench directory contains a set of tests for benchmarking different internal Python features. By default, tests are run on the (unix or Windows) host, but the --pyboard option allows them to be run on an attached board instead.

Tests are grouped by the first part of the file name, and the test runner compares output between each group of tests.

The benchmarks measure the elapsed (wall time) for each test, according to MicroPython's own time module.

If run without any arguments, all test groups are run. Otherwise, it's possible to manually specify which test cases to run.

Example:

$ ./run-internalbench.py internal_bench/bytebuf-*.py
internal_bench/bytebuf:
    0.094s (+00.00%) internal_bench/bytebuf-1-inplace.py
    0.471s (+399.24%) internal_bench/bytebuf-2-join_map_bytes.py
    0.177s (+87.78%) internal_bench/bytebuf-3-bytarray_map.py
1 tests performed (3 individual testcases)

Test key/certificates

SSL/TLS tests in multi_net and net_inet use a self-signed key/cert pair that is randomly generated and to be used for testing/demonstration only. You should always generate your own key/cert.

To generate a new self-signed RSA key/cert pair with openssl do:

$ openssl req -x509 -newkey rsa:2048 -keyout rsa_key.pem -out rsa_cert.pem -days 365 -nodes -subj '/CN=micropython.local/O=MicroPython/C=AU'

In this case CN is: micropython.local

Convert them to DER format:

$ openssl pkey -in rsa_key.pem -out rsa_key.der -outform DER
$ openssl x509 -in rsa_cert.pem -out rsa_cert.der -outform DER

To test elliptic curve key/cert pairs, create a key then a certificate using:

$ openssl ecparam -name prime256v1 -genkey -noout -out ec_key.der -outform DER
$ openssl req -new -x509 -key ec_key.der -out ec_cert.der -outform DER -days 365 -nodes -subj '/CN=micropython.local/O=MicroPython/C=AU'