Changing CPython’s Grammar¶
Abstract¶
There’s more to changing Python’s grammar than editing
Grammar/python.gram. Here’s a checklist.
备注
These instructions are for Python 3.9 and beyond. Earlier versions use a different parser technology. You probably shouldn’t try to change the grammar of earlier Python versions, but if you really want to, use GitHub to track down the earlier version of this file in the devguide.
For more information on how to use the new parser, check the section on how to use CPython’s parser.
Checklist¶
Note: sometimes things mysteriously don’t work. Before giving up, try make clean.
Grammar/python.gram: The grammar, with actions that build AST nodes. After changing it, runmake regen-pegen(orbuild.bat --regenon Windows), to regenerateParser/parser.c. (This runs Python’s parser generator,Tools/peg_generator).Grammar/Tokensis a place for adding new token types. After changing it, runmake regen-tokento regenerateInclude/token.h,Parser/token.c,Lib/token.pyandDoc/library/token-list.inc. If you change bothpython.gramandTokens, runmake regen-tokenbeforemake regen-pegen. On Windows,build.bat --regenwill regenerate both at the same time.Parser/Python.asdlmay need changes to match the grammar. Then runmake regen-astto regenerateInclude/Python-ast.handPython/Python-ast.c.Parser/tokenizer.ccontains the tokenization code. This is where you would add a new type of comment or string literal, for example.Python/ast.cwill need changes to validate AST objects involved with the grammar change.Python/ast_unparse.cwill need changes to unparse AST objects involved with the grammar change (“unparsing” is used to turn annotations into strings per PEP 563).The Design of CPython’s Compiler has its own page.
_Unparserin theLib/ast.pyfile may need changes to accommodate any modifications in the AST nodes.Doc/library/ast.rstmay need to be updated to reflect changes to AST nodes.Add some usage of your new syntax to
test_grammar.py.Certain changes may require tweaks to the library module
pyclbr.Lib/tokenize.pyneeds changes to match changes to the tokenizer.Documentation must be written! Specifically, one or more of the pages in
Doc/reference/will need to be updated.