gh-118761: Improve import time of `dataclasses`#129925

donbarbos · 2025-02-09T23:51:47Z

Another attempt to improve import time of stdlib modules.
Importing dataclasses takes a long time and affects many other modules so it needs to do is make dataclasses better.

I use lazy importing for 4 largest modules (re, copy, inspect, annotationlib), they are also rarely called (1 and 2 times)

CPython configure flags:

./configure --enable-optimizations --with-lto --enable-loadable-sqlite-extensions

Benchmarks:

Running: pipx install tuna && ./python -X importtime -c 'import dataclasses' 2> import.log && tuna import.log

Total import time: 0.022s -> 0.008s = x2.75 as fast

main branch	PR branch

`dataclasses` import time: 0.015s -> 0.001s = x15 as fast

main branch	PR branch

hyperfine: 24.ms -> 10.2ms = x2.4 as fast

Main branch:

$ hyperfine --warmup 11 --runs 3000 "./python -c 'import dataclasses'" Benchmark 1: ./python -c 'import dataclasses' Time (mean ± σ): 24.5 ms ± 1.2 ms [User: 21.2 ms, System: 3.3 ms] Range (min … max): 22.9 ms … 38.1 ms 3000 runs

PR branch:

$ hyperfine --warmup 11 --runs 3000 "./python -c 'import dataclasses'" Benchmark 1: ./python -c 'import dataclasses' Time (mean ± σ): 10.2 ms ± 0.4 ms [User: 8.3 ms, System: 1.8 ms] Range (min … max): 9.8 ms … 19.9 ms 3000 runs

Issue: Improve import time of various stdlib modules #118761

donbarbos · 2025-02-10T00:41:50Z

On Trade-offs

Here is the new call trace

import re;re.compile in _is_type()
_is_type() in _process_class() and _get_field()
_get_field() in _process_class()
_process_class() in dataclass.wrap()

import annotationlib in _process_class()
_process_class() in dataclass.wrap()

import inspect in _process_class() and _add_slots()
_add_slots() in _process_class()
_process_class() in dataclass.wrap()

import copy in _asdict_inner() and _astuple_inner()
_asdict_inner() in asdict()
_astuple_inner() in astuple()

And I wrote benchmarks to test calling affected public functions (dataclass decorator and asdict, astuple functions):

`bench_dataclass.py` starts like this:

importtimeimportdataclassesimportoscode="""@dataclasses.dataclassclass Address: city: str zip_code: str@dataclasses.dataclassclass Person: name: str age: int addresses: list[Address] = dataclasses.field(default_factory=list) metadata: dict[str, str] = dataclasses.field(default_factory=dict)"""result_times= [] for_inrange(100): os.system("sync && echo 3 | sudo tee /proc/sys/vm/drop_caches > /dev/null") start_time=time.time() exec(code,{"dataclasses": dataclasses}) end_time=time.time() result_times.append(end_time-start_time)

`bench_asdict.py` starts like this:

importtimeimportdataclassesimportos@dataclasses.dataclassclassAddress: city: strzip_code: str@dataclasses.dataclassclassPerson: name: strage: intaddresses: list[Address] =dataclasses.field(default_factory=list) metadata: dict[str, str] =dataclasses.field(default_factory=dict) person=Person( name="John Doe", age=30, addresses=[Address("New York", "10001"), Address("Los Angeles", "90001")], metadata={"key1": "value1", "key2": "value2"} ) code="""dataclasses.asdict(person)"""result_times= [] for_inrange(100): os.system("sync && echo 3 | sudo tee /proc/sys/vm/drop_caches > /dev/null") start_time=time.time() exec(code,{"dataclasses": dataclasses, "person": person}) end_time=time.time() result_times.append(end_time-start_time)

And I getting stats in both scripts in the end of file like this:

importstatisticsfirst=result_times[0] mean=statistics.mean(result_times) median=statistics.median(result_times) stdev=statistics.stdev(result_times) variance=statistics.variance(result_times) print(f"First time: {first*1_000_000:.2f}μs") print(f"Mean: {mean*1_000_000:.2f}μs") print(f"Median: {median*1_000_000:.2f}μs") print(f"Standard deviation: {stdev*1_000_000:.2f}μs") print(f"Variance: {variance*1_000_000:.2f}μs")

Results on main branch:

$ ./python -B bench_dataclass.py First time: 1586.20μs Mean: 1267.27μs Median: 1243.83μs Standard deviation: 106.15μs Variance: 0.01μs $ ./python -B bench_asdict.py First time: 99.66μs Mean: 112.33μs Median: 109.91μs Standard deviation: 12.26μs Variance: 0.00μs

Results on PR branch:

$ ./python -B bench_dataclass.py First time: 20320.18μs # is 20ms or 0.02s and this is a permanent result for first call Mean: 1454.47μs Median: 1243.11μs Standard deviation: 1911.43μs Variance: 3.65μs $ ./python -B bench_asdict.py First time: 100.37μs Mean: 112.21μs Median: 108.96μs Standard deviation: 13.01μs Variance: 0.00μs

In the end we can say that about 20ms was added for the first mention of @dataclass in the code.

eli-schwartz

I use lazy importing for 4 largest modules (re, copy, inspect, annotationlib), they are also rarely called (1 and 2 times)

I don't fully understand what you mean by "rarely", I think. It looks like a lot of this PR is essentially delaying imports that will be unconditionally used in all code that actually utilizes a dataclass. The timings are based on import time for the module itself, but given that the primary use of this module is as a decorator, it's always going to actually use the module at import time, unlike other modules where you might import it at the top of the file and then only use it inside of an if block.

So I don't think you can consider the import time of dataclasses without considering how it's actually used. Any imports that are unconditionally used when decorating a class aren't beneficial to delay the import of (but will incur the use-time cost of re-running import itself, which isn't major but does exist).

donbarbos · 2025-02-16T21:02:16Z

@eli-schwartz yes, but most of the PRs for this issue are also adding lazy imports.

eli-schwartz · 2025-02-16T21:17:28Z

I'm not sure what point you're trying to make.

Most of the PRs for this issue are adding lazy imports. Lazy imports are a useful tool for making python programs faster, in the principle of "only pay for what you use" -- and stdlib modules often don't know what an application will in fact use.

The issue here is that as far as I can tell you're adding lazy imports for things that the consumer will always use, which means that there won't be a benefit to making them lazy...

gpshead · 2025-02-17T19:24:36Z

I'm closing this one based on Eli's analysis.

Improve import time of dataclasses
a827615

donbarbos requested a review from ericvsmith as a code owner February 9, 2025 23:51

bedevere-appbot added the awaiting review label Feb 9, 2025

donbarbos changed the title ~~gh-11876: Improve import time of dataclasses~~gh-118761: Improve import time of dataclassesFeb 10, 2025

bedevere-appbot mentioned this pull request Feb 10, 2025
Improve import time of various stdlib modules #118761
Closed

eli-schwartz suggested changes Feb 16, 2025
View reviewed changes

bedevere-appbot added awaiting core review and removed awaiting review labels Feb 16, 2025

gpshead closed this Feb 17, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-118761: Improve import time of `dataclasses`#129925

gh-118761: Improve import time of `dataclasses`#129925

Uh oh!

donbarbos commented Feb 9, 2025•
edited by bedevere-app bot
Loading

Uh oh!

donbarbos commented Feb 10, 2025

Uh oh!

eli-schwartz left a comment

Uh oh!

donbarbos commented Feb 16, 2025

Uh oh!

eli-schwartz commented Feb 16, 2025

Uh oh!

gpshead commented Feb 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

gh-118761: Improve import time of dataclasses#129925

gh-118761: Improve import time of dataclasses#129925

Uh oh!

Conversation

donbarbos commented Feb 9, 2025• edited by bedevere-app botLoading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CPython configure flags:

Benchmarks:

Total import time: 0.022s -> 0.008s = x2.75 as fast

dataclasses import time: 0.015s -> 0.001s = x15 as fast

hyperfine: 24.ms -> 10.2ms = x2.4 as fast

Main branch:

PR branch:

Uh oh!

donbarbos commented Feb 10, 2025

On Trade-offs

Here is the new call trace

bench_dataclass.py starts like this:

bench_asdict.py starts like this:

And I getting stats in both scripts in the end of file like this:

Results on main branch:

Results on PR branch:

Uh oh!

eli-schwartz left a comment

Choose a reason for hiding this comment

Uh oh!

donbarbos commented Feb 16, 2025

Uh oh!

eli-schwartz commented Feb 16, 2025

Uh oh!

gpshead commented Feb 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gh-118761: Improve import time of `dataclasses`#129925

gh-118761: Improve import time of `dataclasses`#129925

donbarbos commented Feb 9, 2025•
edited by bedevere-app bot
Loading

`dataclasses` import time: 0.015s -> 0.001s = x15 as fast

`bench_dataclass.py` starts like this:

`bench_asdict.py` starts like this: