Complete rewrite of the PEP, discarding the previous long explanation about

augmented assignment. Kept as short as possible, but might still be considered wordy :-)
2000-08-07 12:40:00 +00:00 · 2000-08-07 12:40:00 +00:00 · 18701cf307
parent 93d20f2bf1
commit 18701cf307
1 changed files with 181 additions and 118 deletions
--- a/pep-0203.txt
+++ b/pep-0203.txt
@ -3,10 +3,9 @@ Title: Augmented Assignments
 Version: $Revision$
 Owner: thomas@xs4all.net (Thomas Wouters)
 Python-Version: 2.0
-Status: Incomplete
+Status: Draft


-
 Introduction

    This PEP describes the `augmented assignment' proposal for Python
@ -19,133 +18,197 @@ Introduction
    definitive historical record.


-
-The Origin of Augmented Assignment
+Proposed semantics

-    Augmented assignment refers to binary operators that combine two
-    existing operators: the assignment operator, and one of the binary
-    operators. Its origins lie in other programming languages, most
-    notably `C', where it was defined for performance reasons. They
-    are meant to replace the repetetive syntax of, for instance,
-    adding the number '1' to a variable:
+    The proposed patch that adds augmented assignment to Python
+    introduces the following new operators:
    
-      x = x + 1;
-      
-    with an expression that is shorter, less error-prone and easier to
-    optimize (by the compiler):
+       += -= *= /= %= **= <<= >>= &= ^= |=
    
-      x += 1;
-      
-    The same goes for all other binary operands, resulting in the
-    following augmented assignment operator list, based on Python's
-    current binary operator list:
-
-      +=, -=, /=, *=, %=, **=, >>=, <<=, &=, |=, ^=
+    They implement the same operator as their normal binary form, with
+    the exception that the operation is done `in-place' whenever
+    possible.
    
-    See the documentation of each operator on what they do.
-
-     
-
-Augmented Assignment in Python
-
-    The traditional reasons for augmented assignment, readability and
-    optimization, are not as obvious in Python, for several reasons.
+    They truly behave as augmented assignment, in that they perform
+    all of the normal load and store operations, in addition to the
+    binary operation they are intended to do. So, given the expression:
    
-     - Numbers are immutable, they cannot be changed. In other
-       programming languages, a variable holds a value, and altering
-       the variable changes the value it holds. In Python, variables
-       hold `references' to values, and altering an immutable value
-       means changing the variable, not what it points to.
-
-     - Assignment is a different operation in Python. In most
-       languages, variables are containers, and assignment copies a
-       value into that container. In Python, assignment binds a value
-       to a name, it does not copy the value into a new storage space.
-       
-     - The augmented assignment operators map fairly directly into the
-       underlying hardware. Python does not deal directly with the
-       hardware it runs on, so this `natural inclusion' does not make
-       sense.
-
-     - The augmented assigment syntax is subtly different in more
-       complex expressions. What to do, for instance, in a case such
-       as this:
-       
-       seq[i:calc(seq, i)] *= r
-       
-       It is unclear whether 'seq' gets indexed once or twice, and
-       whether 'calc' gets called once or twice.
-
-
-
-Normal operators
-
-    There are, however, good reasons to include augented assignment. 
-    One of these has to do with Python's way of handling operators. In
-    Python, a user defined class can implement one or more of the
-    binary operators by supplying a 'magic' method name. For instance,
-    for a class to support '<instance> + <object>', the '__add__'
-    method should be defined. This method should return a new object,
-    which is the result of the expression.
+       x += y
    
-    For the case of '<object> + <instance>', where 'object' does not
-    have an '__add__' method, the class can define a '__radd__'
-    method, which then should behave exactly as '__add__'. Indeed,
-    '__radd__' is often a different name for the same method.
-    
-    For C extention types, a similar technique is available, through
-    the PyNumberMethods and PySequenceMethods members of the PyType
-    structure.
+    The object `x' is loaded, then added with 1, and the resulting
+    object is stored back in the original place. The precise action
+    performed on the two arguments depends on the type of `x', and
+    possibly of `y'.

-    However, the problem with this approach is that the '__add__'
-    method cannot know in what context it is called. It cannot tell
-    whether it should create a new object, or whether it is allowed to
-    modify itself. (As would be the case in 'x = x + 1') As a result,
-    the '__add__' method, and all other such 'magic' methods, should
-    always return a new object. For large objects, this can be very
-    inefficient.
-    
-    This inefficiency is often solved by adding a method that does the
-    appropriate modification 'in-place'. List objects, for instance,
-    have the 'extend' method that behaves exactly as the '+' operator,
-    except the operation is done on the list itself, instead of on a
-    copy.
+    The idea behind augmented assignment in Python is that it isn't
+    just an easier way to write the common practice of storing the
+    result of a binary operation in its left-hand operand, but also a
+    way for the left-hand operand in question to know that it should
+    operate 'on itself', rather than creating a modified copy of
+    itself.

-    The augmented assignment syntax can support this behaviour
-    explicitly. When the magic method for 'in-place' operation are
-    missing, it can fall back to the normal methods for that
-    operation, maintaining full backward compatibility even when
-    mixing the new syntax with old objects.
+    To make this possible, a number of new `hooks' are added to Python
+    classes and C extention types, which are called when the object in
+    question is used as the left hand side of an augmented assignment
+    operation. If the class or type does not implement the `in-place'
+    hooks, the normal hooks for the particular binary operation are
+    used.
+    
+    So, given an instance object `x', the expression
+    
+        x += y
+    
+    tries to call x.__add_ab__(y), which is the 'in-place' variant of
+    __add__. If __add_ab__ is not present, x.__add__(y) is
+    attempted, and finally y.__radd__(x) if __add__ is missing too. 
+    There is no `right-hand-side' variant of __add_ab__, because that
+    would require for `y' to know how to in-place modify `x', which is
+    an unsafe assumption. The __add_ab__ hook should behave exactly
+    like __add__, returning the result of the operation (which could
+    be `self') which is to be stored in the variable `x'.
+ 
+    For C extention types, the `hooks' are members of the
+    PyNumberMethods and PySequenceMethods structures, and are called
+    in exactly the same manner as the existing non-inplace operations,
+    including argument coercion. C methods should also take care to
+    return a new reference to the result object, whether it's the same
+    object or a new one. So if the original object is returned, it
+    should be INCREF()'d appropriately.

-    The other benifit of augmented assignment is readability. After
-    the general concept of augmented assignment is grasped, all the
-    augmented assigment operators instantly become obvious. There is
-    no need for non-obvious and non-standard method names to implement
-    efficient, in-place operations, and there is no need to check the
-    type of an object before operating on it: the augmented assignment
-    will work for all types that implement that basic operation, not
-    merely those that implement the augmented variant.
-    
-    And the last problem with augmented assignment, what to do with
-    indexes and function calls in the expression, can be solved in a
-    very Pythonic manner: if it looks like it's only called once, it
-    *is* only called once. Taking this expression:
-    
-    seq[func(x)] += x
-    
-    The function 'func' is called once, and 'seq' is indexed twice:
-    once to retrieve the value (__getitem__), and once to store it
-    (__setitem__). So the expression can be rewritten as:
-    
-    tmp = func(x)
-    seq[tmp] = seq[tmp] + x
-    
-    The augmented assignment form of this expression is much more
-    readable.
-    

-
+New methods

+    The proposed implementation adds the following 11 possible `hooks'
+    which Python classes can implement to overload the augmented
+    assignment operations:
+    
+        __add_ab__
+        __sub_ab__
+        __mul_ab__
+        __div_ab__
+        __mod_ab__
+        __pow_ab__
+        __lshift_ab__
+        __rshift_ab__
+        __and_ab__
+        __xor_ab__
+        __or_ab__
+    
+    The `__add_ab__' name is one proposed by Guido[1], and stands for `and
+    becomes'. Other proposed names include '__iadd__', `__add_in__'
+    `__inplace_add__'
+
+    For C extention types, the following struct members are added:
+    
+    To PyNumberMethods:
+        binaryfunc nb_inplace_add;
+        binaryfunc nb_inplace_subtract;
+        binaryfunc nb_inplace_multiply;
+        binaryfunc nb_inplace_divide;
+        binaryfunc nb_inplace_remainder;
+        binaryfunc nb_inplace_power;
+        binaryfunc nb_inplace_lshift;
+        binaryfunc nb_inplace_rshift;
+        binaryfunc nb_inplace_and;
+        binaryfunc nb_inplace_xor;
+        binaryfunc nb_inplace_or;
+
+    To PySequenceMethods:
+        binaryfunc sq_inplace_concat;
+        intargfunc sq_inplace_repeat;
+
+    In order to keep binary compatibility, the tp_flags TypeObject
+    member is used to determine whether the TypeObject in question has
+    allocated room for these slots. Until a clean break in binary
+    compatibility is made (which may or may not happen before 2.0)
+    code that wants to use one of the new struct members must first
+    check that they are available with the 'PyType_HasFeature()' macro:
+    
+    if (PyType_HasFeature(x->ob_type, Py_TPFLAGS_HAVE_INPLACE_OPS) &&
+        x->ob_type->tp_as_number && x->ob_type->tp_as_number->nb_inplace_add) {
+            /* ... */
+
+    This check must be made even before testing the method slots for
+    NULL values! The macro only tests whether the slots are available,
+    not whether they are filled with methods or not.
+
+
+Implementation
+
+    The current implementation of augmented assignment[2] adds, in
+    addition to the methods and slots alread covered, 13 new bytecodes
+    and 13 new API functions.
+    
+    The API functions are simply in-place versions of the current
+    binary-operation API functions:
+    
+        PyNumber_InPlaceAdd(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceSubtract(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceMultiply(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceDivide(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceRemainder(PyObject *o1, PyObject *o2);
+        PyNumber_InPlacePower(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceLshift(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceRshift(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceAnd(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceXor(PyObject *o1, PyObject *o2);
+        PyNumber_InPlaceOr(PyObject *o1, PyObject *o2);
+        PySequence_InPlaceConcat(PyObject *o1, PyObject *o2);
+        PySequence_InPlaceRepeat(PyObject *o, int count);
+
+    They call either the Python class hooks (if either of the objects
+    is a Python class instance) or the C type's number or sequence
+    methods.
+
+    The new bytecodes are:
+        INPLACE_ADD
+        INPLACE_SUBTRACT
+        INPLACE_MULTIPLY
+        INPLACE_DIVIDE
+        INPLACE_REMAINDER
+        INPLACE_POWER
+        INPLACE_LEFTSHIFT
+        INPLACE_RIGHTSHIFT
+        INPLACE_AND
+        INPLACE_XOR
+        INPLACE_OR
+        ROT_FOUR
+        DUP_TOPX
+    
+    The INPLACE_* bytecodes mirror the BINARY_* bytecodes, except that
+    they are implemented as calls to the 'InPlace' API functions. The
+    other two bytecodes are 'utility' bytecodes: ROT_FOUR behaves like
+    ROT_THREE except that the four topmost stack items are rotated.
+    
+    DUP_TOPX is a bytecode that takes a single argument, which should
+    be an integer between 1 and 5 (inclusive) which is the number of
+    items to duplicate in one block. Given a stack like this (where
+    the left side of the list is the 'top' of the stack):
+
+        [a, b, c, d, e, f, g]
+    
+    "DUP_TOPX 3" would duplicate the top 3 items, resulting in this
+    stack:
+    
+        [a, b, c, d, e, f, g, e, f, g]
+
+    DUP_TOPX with an argument of 1 is the same as DUP_TOP. The limit
+    of 5 is purely an implementation limit. The implementation of
+    augmented assignment requires only DUP_TOPX with an argument of 2
+    and 3, and could do without this new opcode at the cost of a fair
+    number of DUP_TOP and ROT_*.
+
+
+Copyright
+
+    This document has been placed in the public domain.
+
+
+References
+
+    [1] http://www.python.org/pipermail/python-list/2000-June/059556.html
+    [2]
+http://sourceforge.net/patch?func=detailpatch&patch_id=100699&group_id=5470