bpo-47009: Let PRECALL_NO_KW_LIST_APPEND do its own POP_TOP#32239

sweeneyde · 2022-04-01T18:37:02Z

Most code won't do y = L.append(x) or whatnot, so PRECALL_NO_KW_LIST_APPEND is almost always followed by POP_TOP. We can verify at specialization time.

This saves a Py_INCREF(Py_None), a SET_TOP(Py_None), and POP_TOP's Py_DECREF(POP()); DISPATCH();.

Some microbenchmarks:

frompyperfimportRunner, perf_counterdefbench_append(loops, length): src=list(map(float, range(length))) arr= [] t0=perf_counter() foriinrange(loops): arr.clear() forxinsrc: arr.append(x) returnperf_counter() -t0defbench_append_less_gc(loops, length): src=list(map(float, range(length))) out= [None] *loopst0=perf_counter() foriinrange(loops): arr= [] forxinsrc: arr.append(x) out[i] =arrreturnperf_counter() -t0runner=Runner() fornin [100, 1_000, 10_000, 100_000]: runner.bench_time_func(f"append {n}", bench_append, n, inner_loops=n) runner.bench_time_func(f"append-less-gc {n}", bench_append_less_gc, n, inner_loops=n)

From GCC, --enable-optimizations, --with-lto:

- append 100000: 14.9 ns +- 0.3 ns -> 13.3 ns +- 0.4 ns: 1.12x faster - append 10000: 15.1 ns +- 0.3 ns -> 13.6 ns +- 0.5 ns: 1.11x faster - append-less-gc 100000: 16.4 ns +- 0.5 ns -> 14.9 ns +- 0.4 ns: 1.10x faster - append 1000: 15.6 ns +- 0.3 ns -> 14.2 ns +- 0.3 ns: 1.09x faster - append 100: 18.9 ns +- 0.6 ns -> 17.3 ns +- 0.6 ns: 1.09x faster - append-less-gc 100: 27.4 ns +- 1.1 ns -> 25.2 ns +- 1.2 ns: 1.09x faster - append-less-gc 10000: 19.2 ns +- 0.3 ns -> 17.8 ns +- 0.2 ns: 1.08x faster - append-less-gc 1000: 22.0 ns +- 0.6 ns -> 20.8 ns +- 0.3 ns: 1.06x faster Geometric mean: 1.09x faster

https://bugs.python.org/issue47009

markshannon · 2022-04-05T10:18:03Z

Looks good. I'm a bit wary of specialized superinstructions, but this seems solid.
I can imagine cases where list.append() wouldn't be followed by a POP_TOP, but they are contrived and highly unlikely.

tiran · 2022-04-05T11:27:55Z

The assert is failing on s390x Fedora buildbot https://buildbot.python.org/all/#/builders/232/builds/524

_bootstrap_python: Python/ceval.c:5045: _PyEval_EvalFrameDefault: Assertion `next_instr[-1] == POP_TOP' failed. make: *** [Makefile:1204: Python/frozen_modules/io.h] Aborted (core dumped)

markshannon · 2022-04-05T11:38:32Z

Strange. The bytecode is exactly the same on all platforms.

Let PRECALL_NO_KW_LIST_APPEND do its own POP_TOP
344ee45

sweeneyde requested a review from markshannon as a code owner April 1, 2022 18:37

bedevere-bot added the awaiting core review label Apr 1, 2022

the-knights-who-say-ni added the CLA signed label Apr 1, 2022

sweeneyde added the skip news label Apr 1, 2022

Fix a copy/paste error
e0adb6a

sweeneyde force-pushed the listappend_pop branch from 43782cc to e0adb6aCompare April 1, 2022 20:51

Merge branch 'main' into listappend_pop
3361295

sweeneyde requested a review from brandtbucher April 5, 2022 07:19

markshannon merged commit 6c6e040 into python:mainApr 5, 2022

bedevere-bot removed the awaiting core review label Apr 5, 2022

sweeneyde deleted the listappend_pop branch April 5, 2022 22:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

bpo-47009: Let PRECALL_NO_KW_LIST_APPEND do its own POP_TOP#32239

bpo-47009: Let PRECALL_NO_KW_LIST_APPEND do its own POP_TOP #32239

Uh oh!

sweeneyde commented Apr 1, 2022•
edited
Loading

Uh oh!

markshannon commented Apr 5, 2022

Uh oh!

tiran commented Apr 5, 2022

Uh oh!

markshannon commented Apr 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

bpo-47009: Let PRECALL_NO_KW_LIST_APPEND do its own POP_TOP#32239

bpo-47009: Let PRECALL_NO_KW_LIST_APPEND do its own POP_TOP #32239

Uh oh!

Conversation

sweeneyde commented Apr 1, 2022• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

markshannon commented Apr 5, 2022

Uh oh!

tiran commented Apr 5, 2022

Uh oh!

markshannon commented Apr 5, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

sweeneyde commented Apr 1, 2022•
edited
Loading