Skip to content

Conversation

@tomasr8
Copy link
Member

@tomasr8tomasr8 commented Nov 11, 2023

Still a work in progress but I'd be happy for feedback ;)

I'm not entirely sure what to do about HASHLIB_GIL_MINSIZE. IIUC the original code avoids releasing the GIL and getting a separate lock when the update buffer is short enough. I'm guessing that in the nogil version, we'll have to lock the mutex regardless of the length (i.e. essentially getting rid of the macro)? Is that correct?

As a side note, I'm failing to build this PR with the GIL enabled. Running ./configure --with-pydebug and make -s -j2, I'm getting this error:

Infileincludedfrom ./Modules/hashlib.h:7, from ./Modules/md5module.c:29: ./Include/internal/pycore_lock.h: InfunctionPyMutex_LockFast’: ./Include/internal/pycore_lock.h:60:12: error: implicitdeclarationoffunction_Py_atomic_compare_exchange_uint8’; didyoumean__atomic_compare_exchange_n’? [-Werror=implicit-function-declaration] 60 | return_Py_atomic_compare_exchange_uint8(lock_bits, &expected, _Py_LOCKED);

Even if I add #include "pyatomic.h" to pycore_lock.h I get the same error.. any idea why that is?

Fixes#111916

Copy link
Contributor

@colesburycolesbury left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The build errors you're seeing are due to md5module.c being compiled with the limited API, which means internal-only functions like PyMutex_Lock are not available. You should remove these lines from md5module.c:

#ifndef _MSC_VER #include "pyconfig.h" // Py_NOGIL #endif #ifndef Py_NOGIL // Need limited C API version 3.12 for Py_MOD_PER_INTERPRETER_GIL_SUPPORTED #define Py_LIMITED_API 0x030c0000 #endif 

And instead add:

#ifndef Py_BUILD_CORE_BUILTIN # define Py_BUILD_CORE_MODULE 1 #endif 

@colesbury
Copy link
Contributor

(I edited my review comment to note that you also should add Py_BUILD_CORE_MODULE to md5module.c)

@tomasr8tomasr8 marked this pull request as ready for review November 14, 2023 21:13
@tomasr8tomasr8 requested a review from tiran as a code ownerNovember 14, 2023 21:13
@colesbury
Copy link
Contributor

It looks like some of the generated parts of Modules/_blake2/blake2s_impl.c are out of date. You'll need to re-run Argument Clinic. I think python3 Tools/clinic/clinic.py --make --exclude Lib/test/clinic.test.c --srcdir . should do the trick.

@tomasr8
Copy link
MemberAuthor

Thanks for the thorough review! I reran AC as you suggested, so hopefully it should be good this time

Copy link
Contributor

@colesburycolesbury left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks, @tomasr8! This LGTM.

@gpshead - would you be able to review this PR? (I saw your name on the git log for these files.)

Copy link
Member

@gpsheadgpshead left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

one minor style nit, overall looks great. thanks!

@gpshead
Copy link
Member

the original code avoids releasing the GIL and getting a separate lock when the update buffer is short enough. I'm guessing that in the nogil version, we'll have to lock the mutex regardless of the length

Yep, the old HASHLIB_GIL_MINSIZE logic doesn't make sense in a free threaded build - always locking as this PR does works there. This PR could probably elide the concept from the code entirely during a free-threaded build, but that'd just makes the maintenance harder with additional ifdefs. For little value. It'll be an always-taken branch because use_mutex = true in free-threaded builds so the size will never get checked. Good enough - it won't have a measurable performance impact.

@gpsheadgpshead enabled auto-merge (squash) November 15, 2023 23:37
@gpsheadgpshead merged commit a646560 into python:mainNov 15, 2023
@tomasr8tomasr8 deleted the hashlib branch November 16, 2023 07:05
aisk pushed a commit to aisk/cpython that referenced this pull request Feb 11, 2024
… GIL (python#111981) Always use an individual lock on hash objects when in free-threaded builds. Fixespython#111916
Glyphack pushed a commit to Glyphack/cpython that referenced this pull request Sep 2, 2024
… GIL (python#111981) Always use an individual lock on hash objects when in free-threaded builds. Fixespython#111916
Sign up for freeto join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Make hashlib related modules thread-safe without the GIL

3 participants

@tomasr8@colesbury@gpshead