gh-121313: multiprocessing: change connection buffer size to 64KiB#123559

methane · 2024-09-01T07:24:38Z

Windows:

Current buffer size is 8KiB since multiprocessing is introduced.
It seems small for recent Python usages.

e711caf#diff-2c54a007d7fe1d9ac5ca008fe2d054394c39a4f521eea2cb580101a284d7b7ecR28

macOS/BSD:

They use 64KiB buffer for pipes. Current 16 pages (256KiB) buffer makes ~10% slowdown compared to 64KiB on M1 mac.

Linux:

I don't have 16k/64k page Linux. But when I change the pipe buffer size via fcntl, 256KiB buffer doesn't make notable performance benefit.

64KiB seems good default buffer size.
If it is not suitable, user can try other size by changing multiprocessing.connection.BUFSIZE.

Issue: Limit the reading size from pipes to their default buffer size on Unix systems #121313

methane · 2024-09-01T07:52:07Z

Previous discussion:
#121315 (comment)

quick bench code:
https://gist.github.com/methane/a6cb799704a11f2bb2f64b16c0b830cc

64k vs 256k quick bench:

$ time python hello.py # 64k real 0m25.999s user 0m9.753s sys 0m29.687s real 0m25.964s user 0m9.553s sys 0m29.854s $ time python hello.py # 256k real 0m25.927s user 0m9.421s sys 0m29.958s real 0m25.933s user 0m9.330s sys 0m30.108s

How to change the pipe buffer size:

diff --git a/Lib/multiprocessing/connection.py b/Lib/multiprocessing/connection.py index ede5596e4fc..9c1bfb93728 100644 --- a/Lib/multiprocessing/connection.py +++ b/Lib/multiprocessing/connection.py @@ -544,7 +544,10 @@ def Pipe(duplex=True): c1 = Connection(s1.detach()) c2 = Connection(s2.detach()) else: + import fcntl fd1, fd2 = os.pipe() + fcntl.fcntl(fd1, fcntl.F_SETPIPE_SZ, 256*1024) + fcntl.fcntl(fd2, fcntl.F_SETPIPE_SZ, 256*1024) c1 = Connection(fd1, writable=False) c2 = Connection(fd2, readable=False)

I confirmed that this setting works by checking n = len(chunk) size.

So I don't think having Linux pipe only optimization is worth enough.
Static 64KiB size seems good enough.

Lib/multiprocessing/connection.py

picnixz

Really small nitpick. But otherwise LGTM.

Lib/multiprocessing/connection.py

Co-authored-by: Bénédikt Tran <[email protected]>

Misc/NEWS.d/next/Library/2024-07-03-10-11-53.gh-issue-121313.D7gARW.rst

Lib/multiprocessing/connection.py

Misc/NEWS.d/next/Library/2024-07-03-10-11-53.gh-issue-121313.D7gARW.rst

Co-authored-by: Victor Stinner <[email protected]>

vstinner

LGTM

methane · 2024-09-03T02:40:36Z

I tested this PR on Windows but no speedup. It is because Windows can read whole remaining data at once via this function.

cpython/Lib/multiprocessing/connection.py

Line 350 in 7dc489b

def_get_more_data(self, ov, maxsize):

methane added 2 commits September 1, 2024 07:16

pythongh-121313: multiprocessing: change connection buffer size
88f46e4

update changelog
b24b46f

methane requested a review from gpshead as a code owner September 1, 2024 07:24

bedevere-appbot mentioned this pull request Sep 1, 2024
Limit the reading size from pipes to their default buffer size on Unix systems #121313
Closed

bedevere-appbot added the awaiting core review label Sep 1, 2024

methane mentioned this pull request Sep 1, 2024
gh-123557: Limit the reading size from Unix sockets to same limit pipes use #123558
Closed

methane mentioned this pull request Sep 1, 2024
gh-121313: Limit the reading size from pipes to their default buffer size on Unix systems #121315
Merged

picnixz reviewed Sep 1, 2024
View reviewed changes

Lib/multiprocessing/connection.pyShow resolvedHide resolved

methane added 2 commits September 2, 2024 13:28

update comment
0e0933d

remove unnecessary code
205ec7b

picnixz approved these changes Sep 2, 2024
View reviewed changes

Lib/multiprocessing/connection.py Outdated Show resolvedHide resolved

methaneand others added 3 commits September 2, 2024 19:28

Update Lib/multiprocessing/connection.py
bfbf9bd
Co-authored-by: Bénédikt Tran <[email protected]>

fix news category
5ea7d02

revice changelog
3dbf77d

picnixz reviewed Sep 2, 2024
View reviewed changes

Misc/NEWS.d/next/Library/2024-07-03-10-11-53.gh-issue-121313.D7gARW.rst Outdated Show resolvedHide resolved

vstinner reviewed Sep 2, 2024
View reviewed changes

Lib/multiprocessing/connection.py Outdated Show resolvedHide resolved
Lib/multiprocessing/connection.py Outdated Show resolvedHide resolved
Misc/NEWS.d/next/Library/2024-07-03-10-11-53.gh-issue-121313.D7gARW.rst Outdated Show resolvedHide resolved

methaneand others added 2 commits September 2, 2024 20:32

Apply suggestions from code review
3535586
Co-authored-by: Victor Stinner <[email protected]>

fix link
7dc489b

vstinner approved these changes Sep 2, 2024
View reviewed changes

bedevere-appbot added awaiting merge and removed awaiting core review labels Sep 2, 2024

gpshead approved these changes Sep 3, 2024
View reviewed changes

gpshead merged commit 13f61bf into python:mainSep 3, 2024

bedevere-appbot removed the awaiting merge label Sep 3, 2024

methane deleted the mp-bufsize branch January 15, 2025 04:19

methane mentioned this pull request Feb 4, 2025
performance: Update io.DEFAULT_BUFFER_SIZE to make python IO faster? #117151
Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

gh-121313: multiprocessing: change connection buffer size to 64KiB#123559

gh-121313: multiprocessing: change connection buffer size to 64KiB #123559

Uh oh!

methane commented Sep 1, 2024•
edited
Loading

Uh oh!

methane commented Sep 1, 2024

Uh oh!

Uh oh!

picnixz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vstinner left a comment

Uh oh!

methane commented Sep 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

gh-121313: multiprocessing: change connection buffer size to 64KiB#123559

gh-121313: multiprocessing: change connection buffer size to 64KiB #123559

Uh oh!

Conversation

methane commented Sep 1, 2024• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

methane commented Sep 1, 2024

Uh oh!

Uh oh!

picnixz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vstinner left a comment

Choose a reason for hiding this comment

Uh oh!

methane commented Sep 3, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

methane commented Sep 1, 2024•
edited
Loading