gh-124153: Introduce PyType_GetBaseByToken function (PoC)#121079

neonene · 2024-06-27T08:24:10Z

Reference implementation of the following C-API functinons:

PyType_GetBaseByToken()
~~PyType_GetToken()~~

Discussion: https://discuss.python.org/t/55598

📚 Documentation preview 📚: https://cpython-previews--121079.org.readthedocs.build/

Issue: Add PyType_GetBaseByToken function with Py_tp_token slot #124153

neonene · 2024-07-03T12:19:34Z

The tp_bases can be used in a fallback implementation. I checked the overhead, adding a repeat function temporarily (100972d):

fromtimeitimporttimeitsetup=f"""if 1: import _testcapi A = _testcapi.create_type_with_token("_testcapi.A", 0) tokenA = _testcapi.get_tp_token(A) class B(A): pass class C(B): pass class D(C): pass class E(D): pass getbase = _testcapi.repeat_getbasebytoken"""c_repeat=10# py_repeat = timeit default (1000000)mro=timeit(s1:=f'getbase(C, tokenA, {c_repeat}, True)', setup) bases=timeit(s2:=f'getbase(C, tokenA, {c_repeat}, False)', setup) print(s1, mro) print(s2, bases, bases/mro)

Win non-debug: (the higher, the slower)

find A from	starts with	run once in C	repeat 10 in C
mro	class A	1.00	1.00
bases	class A	1.00x	1.00x
mro	class B	1.00x	1.05x
bases	class B	1.01x	1.14x
mro	class C	1.01x	1.10x
bases	class C	1.04x	1.40x
mro	class E	1.02x	1.16x
bases	class E	1.08x	1.50x

This will keep the slowdown by up to 2% on the `telco` benchmark (PGO). Unlike the `PyDecContextObject`, extending the `PyDecObject` struct seems to affect only binary ops and seems to be a waste of memory.

Faster than the upstream by up to 2% on the `telco` benchmarks (PGO/non-PGO). Based on the GetBaseByToken() optimization by ac82d36.

Keeps the performance unchanged even if the private function is not inlined (i.e. not trained well on PGO).

PyType_GetBaseByToken() fails to inline the wrapped ptivate function, whose overhead appears to be not ignorable.

This cleanup can cause a slowdown by 10% on the `telco` benchmark for some reason.

This version sets the *result to NULL at the end to reduce the overhead of double memory acces when returning true. Under verification.

encukou

The code looks good. Do you want to clear the draft bit, and file an issue?

If there are performance more tweaks to make, they can go in a follow-up PR.

encukou · 2024-09-16T13:47:53Z

Modules/_ctypes/ctypes.h

+PyErr_Format(PyExc_TypeError, "expected a ctypes type, got '%N'", type);
+returnNULL;
+ }
+exercise_get_base_by_token(PyCType_Type);


Could you remove the exercise from this PR?
Hopefully the training will get better as PyType_GetBaseByToken is used more; if not, we can adjust it in a future PR.

I'll post a new PR. It may be better to have a dedicated branch when the result argument is NULL.

neonene added 17 commits June 27, 2024 17:13

add document
eb0d23d

introduce Py_tp_token and ht_token
273bac1

fix PyType_GetSlot()
d041167

add PyType_GetBaseByToken() PyType_GetToken()
d41cead

add test
1456baa

add ctypes example
14b4103

use tp_mro directly (like PyType_IsSubtype) for free-threading err
7af3378

silence warning
0b8bd07

edit testcase
f917ad9

remove an assertion
e08d58f

add missing cast
22c20d9

cast again
7fa0d81

edit testcase
5621704

remove redundant testcase
492a7d5

edit testcase
f71fa78

add test for no-result mode 1/2
83760d7

add test for no-result mode 2/2
245d9d4

erlend-aasland added the topic-C-API label Jun 28, 2024

neonene added 3 commits June 29, 2024 07:13

edit test
a0858a7

add functions to check perf
4fb9cf8

remove previous experiment commit
eb29cf8

This comment was marked as outdated.
Sign in to view

neonene added 5 commits July 3, 2024 18:34

abandon the proposal of PyType_GetToken()
ccd5ede

optimize GetBaseByToken like GetModuleByDef
7d3f8b6

add a repeat test (temporary use)
100972d

Merge branch 'main' into bytoken
c58ef1c

fix a build error
7e89a98

neonene changed the title ~~Introduce PyType_GetBaseByToken function and friends~~Introduce PyType_GetBaseByToken functionJul 3, 2024

remove the repeat test
1de4cba

neonene added 20 commits August 30, 2024 00:21

_decimal: restore PyType_GetModuleByDef() in bin-ops
52616f1
This will keep the slowdown by up to 2% on the `telco` benchmark (PGO). Unlike the `PyDecContextObject`, extending the `PyDecObject` struct seems to affect only binary ops and seems to be a waste of memory.

Merge branch 'main' into bytoken
5e094d0

fix build error
ac03429

optimize PyType_GetBaseByToken()
ac82d36

_decimal: improve performance
2a24934
Faster than the upstream by up to 2% on the `telco` benchmarks (PGO/non-PGO). Based on the GetBaseByToken() optimization by ac82d36.

Merge branch 'main' into bytoken
ce68195

rename
fa936d7

clarify the optimizations
a9121fc

typo
7f50633

Add a private function and use it
ea6e9d3
Keeps the performance unchanged even if the private function is not inlined (i.e. not trained well on PGO).

Merge branch 'main' into bytoken
60779f8

revert private function (borrowed ref ver.)
588d5ee
PyType_GetBaseByToken() fails to inline the wrapped ptivate function, whose overhead appears to be not ignorable.

Merge branch 'main' into bytoken
de19b5a

edit header file
b730abb

bad PyType_GetBaseByToken() example
77f143a
This cleanup can cause a slowdown by 10% on the `telco` benchmark for some reason.

recover performance when a reference is needed
b114bc8

Merge branch 'main' into bytoken
08bfb87

edit PyType_GetBaseByToken()
f08da25

recover performance take2
e3c1182
This version sets the *result to NULL at the end to reduce the overhead of double memory acces when returning true. Under verification.

Merge branch 'main' into bytoken
d95e422

encukou reviewed Sep 16, 2024
View reviewed changes

neonene changed the title ~~Introduce PyType_GetBaseByToken function~~gh-124153: Introduce PyType_GetBaseByToken function (PoC)Sep 17, 2024

bedevere-appbot mentioned this pull request Sep 17, 2024
Add PyType_GetBaseByToken function with Py_tp_token slot #124153
Closed
6 tasks

neoneneand others added 5 commits September 17, 2024 12:45

do not exercise in ctypes
974fce3

edit this version of PyType_GetBaseByToken()
61bb346

ditto
cd750ca

📜🤖 Added by blurb_it.
ca3043f

typo
5ccf0b8

neonene closed this Sep 20, 2024

neonene deleted the bytoken branch September 20, 2024 03:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-124153: Introduce PyType_GetBaseByToken function (PoC)#121079

gh-124153: Introduce PyType_GetBaseByToken function (PoC) #121079

Uh oh!

neonene commented Jun 27, 2024•
edited by bedevere-app bot
Loading

Uh oh!

This comment was marked as outdated.

neonene commented Jul 3, 2024

Uh oh!

encukou left a comment

Uh oh!

encukouSep 16, 2024

Uh oh!

neoneneSep 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

gh-124153: Introduce PyType_GetBaseByToken function (PoC)#121079

gh-124153: Introduce PyType_GetBaseByToken function (PoC) #121079

Uh oh!

Conversation

neonene commented Jun 27, 2024• edited by bedevere-app botLoading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as outdated.

neonene commented Jul 3, 2024

Uh oh!

encukou left a comment

Choose a reason for hiding this comment

Uh oh!

encukouSep 16, 2024

Choose a reason for hiding this comment

Uh oh!

neoneneSep 17, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

neonene commented Jun 27, 2024•
edited by bedevere-app bot
Loading