GH-98831: Remove super-instruction definitions, use macro instructions instead#100124

gvanrossum · 2022-12-08T23:42:12Z

Replace all super-instructions with macros, using a special JOIN op to extract the next oparg. JOIN has a cache effect of one word that is equivalent to bumping next_instr.

Rip out all code for parsing and generating super-instructions.

Issue: Generate the interpreter #98831

brandtbucher · 2022-12-08T23:48:17Z

Not gonna lie, the PR title scared me.

gvanrossum · 2022-12-09T00:03:42Z

Clearly I should use more provocative PR titles more often. :-)

brandtbucher · 2022-12-08T23:51:26Z

Python/bytecodes.c

 #define_COMPARE_OP_FLOAT 1003
 #define_COMPARE_OP_INT 1004
 #define_COMPARE_OP_STR 1005
 #define_JUMP_IF 1006


Should these just be zeroes, too? Or even -1, just to emphasize that they aren't real?

Yeah, they should all be zeros. Since the value is never used I'm not sure what the point of -1 would be.

brandtbucher · 2022-12-08T23:55:18Z

Python/generated_cases.c.h

+TARGET(LOAD_FAST__LOAD_FAST){
+PyObject*_tmp_1;
+PyObject*_tmp_2;
+{


Random idea... any chance we could annotate the output of macros to make it easier to see what the different parts correspond to? So, like, this line would be:
Suggested change
{
{// LOAD_FAST
And we would have {// JOIN and another {// LOAD_FAST below? Not sure how hard this is.

brandtbucher · 2022-12-08T23:57:54Z

Python/bytecodes.c


 // BEGIN BYTECODES //
+
+op(JOIN, (word/1--)){


Hmmm. It feels weird to treat the next instruction as a cache entry.
Why don't all macro components generate code to reload the incremented opcodes/opargs unconditionally? That way we wouldn't need JOIN at all. Or are we worried the compiler can't optimize it out when it's unused (or that garbage values could be too confusing)? Or does it mess up something else?
Perhaps there's a clean way of expressing in the DSL whether an op corresponds to a real instruction or not? Then we can just reload the opcode/oparg if that's true, and get rid of JOIN.

Hmmm. It feels weird to treat the next instruction as a cache entry.
It's only weird if you think of the instruction stream as alternating instructions and cache entries. In the discussion about registers it was already proposed to use a "cache" entry to encode the operation (ADD, MUL, etc.) for BINARY_OP, so it's really just a stream of variable-length instructions. But yeah, this op is definitely full of internal hackery.
Why don't all macro components generate code to reload the incremented opcodes/opargs unconditionally? That way we wouldn't need JOIN at all. Or are we worried the compiler can't optimize it out when it's unused (or that garbage values could be too confusing)? Or does it mess up something else?
I worry more about where the next oparg would be loaded from. The code generator makes assumption about where next_instr points (for macros it keeps pointing just past the original instruction until the end, for supers it gets bumped for each component).
Perhaps there's a clean way of expressing in the DSL whether an op corresponds to a real instruction or not? Then we can just reload the opcode/oparg if that's true, and get rid of JOIN.
That's not a bad idea -- because we actually have that way: inst vs op. The main downside would then be that there would no longer be a clue in the macro whether it defines a super or not.

brandtbucher · 2022-12-09T00:13:49Z

Tools/cases_generator/parser.py

-if (tkn:=self.expect(lx.IDENTIFIER)) andtkn.text=="super":
-ifself.expect(lx.LPAREN):
-iftkn:=self.expect(lx.IDENTIFIER):
-ifself.expect(lx.RPAREN):
-ifself.expect(lx.EQUALS):
-ifops:=self.ops():


Kill it with fire! ;)

gvanrossum

I'm feeling a lot of hesitation about this PR. Maybe we should just leave things as they were and focus on other work (e.g. finish converting more instructions to the explicit stack/cache effects form, and adding arrays).

gvanrossum · 2022-12-09T20:26:55Z

Python/bytecodes.c

 #define_COMPARE_OP_FLOAT 1003
 #define_COMPARE_OP_INT 1004
 #define_COMPARE_OP_STR 1005
 #define_JUMP_IF 1006


Yeah, they should all be zeros. Since the value is never used I'm not sure what the point of -1 would be.

gvanrossum · 2022-12-09T20:59:44Z

Python/bytecodes.c


 // BEGIN BYTECODES //
+
+op(JOIN, (word/1--)){


Hmmm. It feels weird to treat the next instruction as a cache entry.
It's only weird if you think of the instruction stream as alternating instructions and cache entries. In the discussion about registers it was already proposed to use a "cache" entry to encode the operation (ADD, MUL, etc.) for BINARY_OP, so it's really just a stream of variable-length instructions. But yeah, this op is definitely full of internal hackery.
Why don't all macro components generate code to reload the incremented opcodes/opargs unconditionally? That way we wouldn't need JOIN at all. Or are we worried the compiler can't optimize it out when it's unused (or that garbage values could be too confusing)? Or does it mess up something else?
I worry more about where the next oparg would be loaded from. The code generator makes assumption about where next_instr points (for macros it keeps pointing just past the original instruction until the end, for supers it gets bumped for each component).
Perhaps there's a clean way of expressing in the DSL whether an op corresponds to a real instruction or not? Then we can just reload the opcode/oparg if that's true, and get rid of JOIN.
That's not a bad idea -- because we actually have that way: inst vs op. The main downside would then be that there would no longer be a clue in the macro whether it defines a super or not.

gvanrossum · 2022-12-12T18:14:12Z

Off-line we decided not to do this. Super-instructions may eventually disappear (when we have a register VM).

bedevere-bot mentioned this pull request Dec 8, 2022
Generate the interpreter #98831
Closed

bedevere-bot added the awaiting core review label Dec 8, 2022

gvanrossum requested a review from brandtbucher December 8, 2022 23:44

gvanrossum mentioned this pull request Dec 8, 2022
GH-98831: Get rid of super(), just use macro() #100095
Closed

gvanrossum mentioned this pull request Dec 8, 2022
Remove special syntax for super(X) = A + B in favor of macro(X) = A + JOIN + B faster-cpython/ideas#508
Closed

gvanrossum added 2 commits December 8, 2022 16:00

Change all super() to macro() using op(JOIN)
d2573b6

Rip out parser and generator support for super()
eadca51

gvanrossum force-pushed the remove-supers branch from 6706827 to eadca51Compare December 9, 2022 00:01

gvanrossum added the skip news label Dec 9, 2022

gvanrossum changed the title ~~GH-98831: Remove super()~~GH-98831: Remove super-instruction definitions, use macro instructions insteadDec 9, 2022

brandtbucher reviewed Dec 9, 2022
View reviewed changes

python deleted a comment from netlifybotDec 9, 2022

gvanrossum commented Dec 9, 2022
View reviewed changes

gvanrossum closed this Dec 12, 2022

gvanrossum deleted the remove-supers branch June 13, 2023 20:38

Uh oh!

GH-98831: Remove super-instruction definitions, use macro instructions instead#100124

GH-98831: Remove super-instruction definitions, use macro instructions instead #100124

Uh oh!

Conversation

gvanrossum commented Dec 8, 2022• edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brandtbucher commented Dec 8, 2022

Uh oh!

gvanrossum commented Dec 9, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gvanrossum left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gvanrossum commented Dec 12, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gvanrossum commented Dec 8, 2022•
edited
Loading