this post was submitted on 06 Aug 2023
23 points (100.0% liked)

Python

3222 readers
1 users here now

News and discussions about the programming language Python


founded 5 years ago
MODERATORS
 

Meta is dedicating 3 engineers to get the nogil patches into cpython. There are some other companies stepping up as well. This is huge this is the closest we have ever been to solving the issue of the GIL.

all 13 comments
sorted by: hot top controversial new old
[–] teraflopsweat@lemm.ee 7 points 1 year ago (4 children)

As a filthy casual, could anyone give me a link or brief summary as to why the GIL should/shouldn’t go away?

[–] mvirts@lemmy.world 7 points 1 year ago (1 children)

Basically there is no parallel execution of python code within a single process as long as the global interpreter lock exists. It prevents more than one python thread from running. However, many of the major libraries used in python call out to libraries that can and do release the gil to run native code in parallel.

[–] kale@lemmy.zip 5 points 1 year ago* (last edited 1 year ago) (1 children)

The GIL only executes one thread at a time. A python program can be multithreaded, but only only thread runs in CPython at a time. If one thread does a system call (like copying a file), then when the python thread is sleeping, the system call can still run in the OS, so there are situations where multithreading can speed up Python programs, even running one thread at a time.

You can run multiple instances of CPython, which is called multiprocessing, and each instance will run one python thread at a time. With different memory space, so all process communication has to be handled manually (afaik, by definition, threads share the same memory space, processes do not).

Any library calls not written in Python don't run in the interpreter, so most common critical things aren't limited too badly. For example, I install a NumPy and SciPy library which are compiled against Intel's MKL library. Any NumPy operations execute in MKL, not the Python interpreter, so are almost as fast as writing the program in C and compiling against MKL myself. And I can write Python and NumPy code about 10x faster than C/MKL. And if I'm on a computer that doesn't have MKL, I can install a different NumPy library and it will execute just fine without changing the code.

There's a book called "high performance Python" that helped me figure out a lot of this.

Edit: thought I was posting on the grandparents post instead of the parent post. Sorry.

[–] mvirts@lemmy.world 1 points 1 year ago

No worries!

[–] verdare@beehaw.org 3 points 1 year ago

This proposal was linked in the page above, and I think the first few sentences do a pretty good job of summarizing the problem.

[–] anonono@lemmy.world 3 points 1 year ago

a simple explanation but not 100% correct is that even if your code is made to run in parallel using threads, it will never use more than 1 core in your computer.

getting rid of the GIL will let it use all the cores in the processor.

the multiprocessing module "solved" this problem by forking processes instead of threads, but it's not ideal for a lot of workloads.

[–] Dogeek@sh.itjust.works 2 points 1 year ago

The GIL is a thread lock. It prevents threads from accessing the same memory space, eliminating race conditions and more importantly, keeping the reference counters correct so that the python garbage collector can correctly free memory (avoiding leaks)

[–] verdare@beehaw.org 5 points 1 year ago

For the amount of multiprocess code I’ve written in Python, I’m embarrassed that this is the first I’m hearing about the noGIL build. Really excited to see this feature become the default, though it looks like it will take a while. GIL has definitely been a pain in the ass for fully utilizing multi-core systems.

[–] doot@social.bug.expert 4 points 1 year ago

can't wait to try it out

[–] mvirts@lemmy.world 2 points 1 year ago