Merge.
This commit is contained in:
commit
297ca2977e
299
pep-0008.txt
299
pep-0008.txt
|
@ -3,12 +3,13 @@ Title: Style Guide for Python Code
|
||||||
Version: $Revision$
|
Version: $Revision$
|
||||||
Last-Modified: $Date$
|
Last-Modified: $Date$
|
||||||
Author: Guido van Rossum <guido@python.org>,
|
Author: Guido van Rossum <guido@python.org>,
|
||||||
Barry Warsaw <barry@python.org>
|
Barry Warsaw <barry@python.org>,
|
||||||
|
Nick Coghlan <ncoghlan@gmail.com>
|
||||||
Status: Active
|
Status: Active
|
||||||
Type: Process
|
Type: Process
|
||||||
Content-Type: text/x-rst
|
Content-Type: text/x-rst
|
||||||
Created: 05-Jul-2001
|
Created: 05-Jul-2001
|
||||||
Post-History: 05-Jul-2001
|
Post-History: 05-Jul-2001, 01-Aug-2013
|
||||||
|
|
||||||
|
|
||||||
Introduction
|
Introduction
|
||||||
|
@ -23,6 +24,13 @@ This document and PEP 257 (Docstring Conventions) were adapted from
|
||||||
Guido's original Python Style Guide essay, with some additions from
|
Guido's original Python Style Guide essay, with some additions from
|
||||||
Barry's style guide [2]_.
|
Barry's style guide [2]_.
|
||||||
|
|
||||||
|
This style guide evolves over time as additional conventions are
|
||||||
|
identified and past conventions are rendered obsolete by changes in
|
||||||
|
the language itself.
|
||||||
|
|
||||||
|
Many projects have their own coding style guidelines. In the event of any
|
||||||
|
conflicts, such project-specific guides take precedence for that project.
|
||||||
|
|
||||||
|
|
||||||
A Foolish Consistency is the Hobgoblin of Little Minds
|
A Foolish Consistency is the Hobgoblin of Little Minds
|
||||||
======================================================
|
======================================================
|
||||||
|
@ -41,15 +49,24 @@ style guide just doesn't apply. When in doubt, use your best
|
||||||
judgment. Look at other examples and decide what looks best. And
|
judgment. Look at other examples and decide what looks best. And
|
||||||
don't hesitate to ask!
|
don't hesitate to ask!
|
||||||
|
|
||||||
Two good reasons to break a particular rule:
|
In particular: do not break backwards compatibility just to comply with
|
||||||
|
this PEP!
|
||||||
|
|
||||||
1. When applying the rule would make the code less readable, even for
|
Some other good reasons to ignore a particular guideline:
|
||||||
someone who is used to reading code that follows the rules.
|
|
||||||
|
1. When applying the guideline would make the code less readable, even
|
||||||
|
for someone who is used to reading code that follows this PEP.
|
||||||
|
|
||||||
2. To be consistent with surrounding code that also breaks it (maybe
|
2. To be consistent with surrounding code that also breaks it (maybe
|
||||||
for historic reasons) -- although this is also an opportunity to
|
for historic reasons) -- although this is also an opportunity to
|
||||||
clean up someone else's mess (in true XP style).
|
clean up someone else's mess (in true XP style).
|
||||||
|
|
||||||
|
3. Because the code in question predates the introduction of the
|
||||||
|
guideline and there is no other reason to be modifying that code.
|
||||||
|
|
||||||
|
4. When the code needs to remain compatible with older versions of
|
||||||
|
Python that don't support the feature recommended by the style guide.
|
||||||
|
|
||||||
|
|
||||||
Code lay-out
|
Code lay-out
|
||||||
============
|
============
|
||||||
|
@ -59,9 +76,6 @@ Indentation
|
||||||
|
|
||||||
Use 4 spaces per indentation level.
|
Use 4 spaces per indentation level.
|
||||||
|
|
||||||
For really old code that you don't want to mess up, you can continue
|
|
||||||
to use 8-space tabs.
|
|
||||||
|
|
||||||
Continuation lines should align wrapped elements either vertically
|
Continuation lines should align wrapped elements either vertically
|
||||||
using Python's implicit line joining inside parentheses, brackets and
|
using Python's implicit line joining inside parentheses, brackets and
|
||||||
braces, or using a hanging indent. When using a hanging indent the
|
braces, or using a hanging indent. When using a hanging indent the
|
||||||
|
@ -101,7 +115,8 @@ Optional::
|
||||||
var_three, var_four)
|
var_three, var_four)
|
||||||
|
|
||||||
The closing brace/bracket/parenthesis on multi-line constructs may
|
The closing brace/bracket/parenthesis on multi-line constructs may
|
||||||
either line up under the last item of the list, as in::
|
either line up under the first non-whitespace character of the last
|
||||||
|
line of list, as in::
|
||||||
|
|
||||||
my_list = [
|
my_list = [
|
||||||
1, 2, 3,
|
1, 2, 3,
|
||||||
|
@ -128,47 +143,78 @@ starts the multi-line construct, as in::
|
||||||
Tabs or Spaces?
|
Tabs or Spaces?
|
||||||
---------------
|
---------------
|
||||||
|
|
||||||
Never mix tabs and spaces.
|
Spaces are the preferred indentation method.
|
||||||
|
|
||||||
The most popular way of indenting Python is with spaces only. The
|
Tabs should be used solely to remain consistent with code that is
|
||||||
second-most popular way is with tabs only. Code indented with a
|
already indented with tabs.
|
||||||
mixture of tabs and spaces should be converted to using spaces
|
|
||||||
exclusively. When invoking the Python command line interpreter with
|
Python 3 disallows mixing the use of tabs and spaces for indentation.
|
||||||
|
|
||||||
|
Python 2 code indented with a mixture of tabs and spaces should be
|
||||||
|
converted to using spaces exclusively.
|
||||||
|
|
||||||
|
When invoking the Python 2 command line interpreter with
|
||||||
the ``-t`` option, it issues warnings about code that illegally mixes
|
the ``-t`` option, it issues warnings about code that illegally mixes
|
||||||
tabs and spaces. When using ``-tt`` these warnings become errors.
|
tabs and spaces. When using ``-tt`` these warnings become errors.
|
||||||
These options are highly recommended!
|
These options are highly recommended!
|
||||||
|
|
||||||
For new projects, spaces-only are strongly recommended over tabs.
|
|
||||||
Most editors have features that make this easy to do.
|
|
||||||
|
|
||||||
Maximum Line Length
|
Maximum Line Length
|
||||||
-------------------
|
-------------------
|
||||||
|
|
||||||
Limit all lines to a maximum of 79 characters.
|
Limit all lines to a maximum of 79 characters.
|
||||||
|
|
||||||
There are still many devices around that are limited to 80 character
|
For flowing long blocks of text with fewer structural restrictions
|
||||||
lines; plus, limiting windows to 80 characters makes it possible to
|
(docstrings or comments), the line length should be limited to 72
|
||||||
have several windows side-by-side. The default wrapping on such
|
characters.
|
||||||
devices disrupts the visual structure of the code, making it more
|
|
||||||
difficult to understand. Therefore, please limit all lines to a
|
Limiting the required editor window width makes it possible to have
|
||||||
maximum of 79 characters. For flowing long blocks of text (docstrings
|
several files open side-by-side, and works well when using code
|
||||||
or comments), limiting the length to 72 characters is recommended.
|
review tools that present the two versions in adjacent columns.
|
||||||
|
|
||||||
|
The default wrapping in most tools disrupts the visual structure of the
|
||||||
|
code, making it more difficult to understand. The limits are chosen to
|
||||||
|
avoid wrapping in editors with the window width set to 80, even
|
||||||
|
if the tool places a marker glyph in the final column when wrapping
|
||||||
|
lines. Some web based tools may not offer dynamic line wrapping at all.
|
||||||
|
|
||||||
|
Some teams strongly prefer a longer line length. For code maintained
|
||||||
|
exclusively or primarily by a team that can reach agreement on this
|
||||||
|
issue, it is okay to increase the nominal line length from 80 to
|
||||||
|
100 characters (effectively increasing the maximum length to 99
|
||||||
|
characters), provided that comments and docstrings are still wrapped
|
||||||
|
at 72 characters.
|
||||||
|
|
||||||
|
The Python standard library is conservative and requires limiting
|
||||||
|
lines to 79 characters (and docstrings/comments to 72).
|
||||||
|
|
||||||
The preferred way of wrapping long lines is by using Python's implied
|
The preferred way of wrapping long lines is by using Python's implied
|
||||||
line continuation inside parentheses, brackets and braces. Long lines
|
line continuation inside parentheses, brackets and braces. Long lines
|
||||||
can be broken over multiple lines by wrapping expressions in
|
can be broken over multiple lines by wrapping expressions in
|
||||||
parentheses. These should be used in preference to using a backslash
|
parentheses. These should be used in preference to using a backslash
|
||||||
for line continuation. Make sure to indent the continued line
|
for line continuation.
|
||||||
appropriately. The preferred place to break around a binary operator
|
|
||||||
is *after* the operator, not before it. Some examples::
|
Backslashes may still be appropriate at times. For example, long,
|
||||||
|
multiple ``with``-statements cannot use implicit continuation, so
|
||||||
|
backslashes are acceptable::
|
||||||
|
|
||||||
|
with open('/path/to/some/file/you/want/to/read') as file_1, \
|
||||||
|
open('/path/to/some/file/being/written', 'w') as file_2:
|
||||||
|
file_2.write(file_1.read())
|
||||||
|
|
||||||
|
Another such case is with ``assert`` statements.
|
||||||
|
|
||||||
|
Make sure to indent the continued line appropriately. The preferred
|
||||||
|
place to break around a binary operator is *after* the operator, not
|
||||||
|
before it. Some examples::
|
||||||
|
|
||||||
class Rectangle(Blob):
|
class Rectangle(Blob):
|
||||||
|
|
||||||
def __init__(self, width, height,
|
def __init__(self, width, height,
|
||||||
color='black', emphasis=None, highlight=0):
|
color='black', emphasis=None, highlight=0):
|
||||||
if (width == 0 and height == 0 and
|
if (width == 0 and height == 0 and
|
||||||
color == 'red' and emphasis == 'strong' or
|
color == 'red' and emphasis == 'strong' or
|
||||||
highlight > 100):
|
highlight > 100):
|
||||||
raise ValueError("sorry, you lose")
|
raise ValueError("sorry, you lose")
|
||||||
if width == 0 and height == 0 and (color == 'red' or
|
if width == 0 and height == 0 and (color == 'red' or
|
||||||
emphasis is None):
|
emphasis is None):
|
||||||
|
@ -198,18 +244,21 @@ you may use them to separate pages of related sections of your file.
|
||||||
Note, some editors and web-based code viewers may not recognize
|
Note, some editors and web-based code viewers may not recognize
|
||||||
control-L as a form feed and will show another glyph in its place.
|
control-L as a form feed and will show another glyph in its place.
|
||||||
|
|
||||||
Encodings (PEP 263)
|
|
||||||
-------------------
|
|
||||||
|
|
||||||
Code in the core Python distribution should always use the ASCII or
|
Source File Encoding
|
||||||
Latin-1 encoding (a.k.a. ISO-8859-1). For Python 3.0 and beyond,
|
--------------------
|
||||||
UTF-8 is preferred over Latin-1, see PEP 3120.
|
|
||||||
|
|
||||||
Files using ASCII should not have a coding cookie. Latin-1 (or UTF-8)
|
Code in the core Python distribution should always use UTF-8 (or ASCII
|
||||||
should only be used when a comment or docstring needs to mention an
|
in Python 2).
|
||||||
author name that requires Latin-1; otherwise, using ``\x``, ``\u`` or
|
|
||||||
``\U`` escapes is the preferred way to include non-ASCII data in
|
Files using ASCII (in Python 2) or UTF-8 (in Python 3) should not have
|
||||||
string literals.
|
an encoding declaration.
|
||||||
|
|
||||||
|
In the standard library, non-default encodings should be used only for
|
||||||
|
test purposes or when a comment or docstring needs to mention an author
|
||||||
|
name that contains non-ASCII characters; otherwise, using ``\x``,
|
||||||
|
``\u``, ``\U``, or ``\N`` escapes is the preferred way to include
|
||||||
|
non-ASCII data in string literals.
|
||||||
|
|
||||||
For Python 3.0 and beyond, the following policy is prescribed for the
|
For Python 3.0 and beyond, the following policy is prescribed for the
|
||||||
standard library (see PEP 3131): All identifiers in the Python
|
standard library (see PEP 3131): All identifiers in the Python
|
||||||
|
@ -253,11 +302,27 @@ Imports
|
||||||
|
|
||||||
Put any relevant ``__all__`` specification after the imports.
|
Put any relevant ``__all__`` specification after the imports.
|
||||||
|
|
||||||
- Relative imports for intra-package imports are highly discouraged.
|
- Absolute imports are recommended, as they are usually more readable
|
||||||
Always use the absolute package path for all imports. Even now that
|
and tend to be better behaved (or at least give better error
|
||||||
PEP 328 is fully implemented in Python 2.5, its style of explicit
|
messages) if the import system is incorrectly configured (such as
|
||||||
relative imports is actively discouraged; absolute imports are more
|
when a directory inside a package ends up on ``sys.path``)::
|
||||||
portable and usually more readable.
|
|
||||||
|
import mypkg.sibling
|
||||||
|
from mypkg import sibling
|
||||||
|
from mypkg.sibling import example
|
||||||
|
|
||||||
|
However, explicit relative imports are an acceptable alternative to
|
||||||
|
absolute imports, especially when dealing with complex package layouts
|
||||||
|
where using absolute imports would be unnecessarily verbose::
|
||||||
|
|
||||||
|
from . import sibling
|
||||||
|
from .sibling import example
|
||||||
|
|
||||||
|
Standard library code should avoid complex package layouts and always
|
||||||
|
use absolute imports.
|
||||||
|
|
||||||
|
Implicit relative imports should *never* be used and have been removed
|
||||||
|
in Python 3.
|
||||||
|
|
||||||
- When importing a class from a class-containing module, it's usually
|
- When importing a class from a class-containing module, it's usually
|
||||||
okay to spell this::
|
okay to spell this::
|
||||||
|
@ -272,6 +337,18 @@ Imports
|
||||||
|
|
||||||
and use "myclass.MyClass" and "foo.bar.yourclass.YourClass".
|
and use "myclass.MyClass" and "foo.bar.yourclass.YourClass".
|
||||||
|
|
||||||
|
- Wildcard imports (``from <module> import *``) should be avoided, as
|
||||||
|
they make it unclear which names are present in the namespace,
|
||||||
|
confusing both readers and many automated tools. There is one
|
||||||
|
defensible use case for a wildcard import, which is to republish an
|
||||||
|
internal interface as part of a public API (for example, overwriting
|
||||||
|
a pure Python implementation of an interface with the definitions
|
||||||
|
from an optional accelerator module and exactly which definitions
|
||||||
|
will be overwritten isn't known in advance).
|
||||||
|
|
||||||
|
When republishing names this way, the guidelines below regarding
|
||||||
|
public and internal interfaces still apply.
|
||||||
|
|
||||||
|
|
||||||
Whitespace in Expressions and Statements
|
Whitespace in Expressions and Statements
|
||||||
========================================
|
========================================
|
||||||
|
@ -330,7 +407,7 @@ Other Recommendations
|
||||||
|
|
||||||
- If operators with different priorities are used, consider adding
|
- If operators with different priorities are used, consider adding
|
||||||
whitespace around the operators with the lowest priority(ies). Use
|
whitespace around the operators with the lowest priority(ies). Use
|
||||||
your own judgement; however, never use more than one space, and
|
your own judgment; however, never use more than one space, and
|
||||||
always have the same amount of whitespace on both sides of a binary
|
always have the same amount of whitespace on both sides of a binary
|
||||||
operator.
|
operator.
|
||||||
|
|
||||||
|
@ -747,6 +824,36 @@ With this in mind, here are the Pythonic guidelines:
|
||||||
advanced callers.
|
advanced callers.
|
||||||
|
|
||||||
|
|
||||||
|
Public and internal interfaces
|
||||||
|
------------------------------
|
||||||
|
|
||||||
|
Any backwards compatibility guarantees apply only to public interfaces.
|
||||||
|
Accordingly, it is important that users be able to clearly distinguish
|
||||||
|
between public and internal interfaces.
|
||||||
|
|
||||||
|
Documented interfaces are considered public, unless the documentation
|
||||||
|
explicitly declares them to be provisional or internal interfaces exempt
|
||||||
|
from the usual backwards compatibility guarantees. All undocumented
|
||||||
|
interfaces should be assumed to be internal.
|
||||||
|
|
||||||
|
To better support introspection, modules should explicitly declare the
|
||||||
|
names in their public API using the ``__all__`` attribute. Setting
|
||||||
|
``__all__`` to an empty list indicates that the module has no public API.
|
||||||
|
|
||||||
|
Even with ``__all__`` set appropriately, internal interfaces (packages,
|
||||||
|
modules, classes, functions, attributes or other names) should still be
|
||||||
|
prefixed with a single leading underscore.
|
||||||
|
|
||||||
|
An interface is also considered internal if any containing namespace
|
||||||
|
(package, module or class) is considered internal.
|
||||||
|
|
||||||
|
Imported names should always be considered an implementation detail.
|
||||||
|
Other modules must not rely on indirect access to such imported names
|
||||||
|
unless they are an explicitly documented part of the containing module's
|
||||||
|
API, such as ``os.path`` or a package's ``__init__`` module that exposes
|
||||||
|
functionality from submodules.
|
||||||
|
|
||||||
|
|
||||||
Programming Recommendations
|
Programming Recommendations
|
||||||
===========================
|
===========================
|
||||||
|
|
||||||
|
@ -756,10 +863,12 @@ Programming Recommendations
|
||||||
|
|
||||||
For example, do not rely on CPython's efficient implementation of
|
For example, do not rely on CPython's efficient implementation of
|
||||||
in-place string concatenation for statements in the form ``a += b``
|
in-place string concatenation for statements in the form ``a += b``
|
||||||
or ``a = a + b``. Those statements run more slowly in Jython. In
|
or ``a = a + b``. This optimization is fragile even in CPython (it
|
||||||
performance sensitive parts of the library, the ``''.join()`` form
|
only works for some types) and isn't present at all in implementations
|
||||||
should be used instead. This will ensure that concatenation occurs
|
that don't use refcounting. In performance sensitive parts of the
|
||||||
in linear time across various implementations.
|
library, the ``''.join()`` form should be used instead. This will
|
||||||
|
ensure that concatenation occurs in linear time across various
|
||||||
|
implementations.
|
||||||
|
|
||||||
- Comparisons to singletons like None should always be done with
|
- Comparisons to singletons like None should always be done with
|
||||||
``is`` or ``is not``, never the equality operators.
|
``is`` or ``is not``, never the equality operators.
|
||||||
|
@ -786,29 +895,59 @@ Programming Recommendations
|
||||||
operator. However, it is best to implement all six operations so
|
operator. However, it is best to implement all six operations so
|
||||||
that confusion doesn't arise in other contexts.
|
that confusion doesn't arise in other contexts.
|
||||||
|
|
||||||
- Use class-based exceptions.
|
- Always use a def statement instead of an assignment statement that binds
|
||||||
|
a lambda expression directly to a name.
|
||||||
|
|
||||||
String exceptions in new code are forbidden, and this language
|
Yes::
|
||||||
feature has been removed in Python 2.6.
|
|
||||||
|
|
||||||
Modules or packages should define their own domain-specific base
|
def f(x): return 2*x
|
||||||
exception class, which should be subclassed from the built-in
|
|
||||||
Exception class. Always include a class docstring. E.g.::
|
|
||||||
|
|
||||||
class MessageError(Exception):
|
No::
|
||||||
"""Base class for errors in the email package."""
|
|
||||||
|
f = lambda x: 2*x
|
||||||
|
|
||||||
|
The first form means that the name of the resulting function object is
|
||||||
|
specifically 'f' instead of the generic '<lambda>'. This is more
|
||||||
|
useful for tracebacks and string representations in general. The use
|
||||||
|
of the assignment statement eliminates the sole benefit a lambda
|
||||||
|
expression can offer over an explicit def statement (i.e. that it can
|
||||||
|
be embedded inside a larger expression)
|
||||||
|
|
||||||
|
- Derive exceptions from ``Exception`` rather than ``BaseException``.
|
||||||
|
Direct inheritance from ``BaseException`` is reserved for exceptions
|
||||||
|
where catching them is almost always the wrong thing to do.
|
||||||
|
|
||||||
|
Design exception hierarchies based on the distinctions that code
|
||||||
|
*catching* the exceptions is likely to need, rather than the locations
|
||||||
|
where the exceptions are raised. Aim to answer the question
|
||||||
|
"What went wrong?" programmatically, rather than only stating that
|
||||||
|
"A problem occurred" (see PEP 3151 for an example of this lesson being
|
||||||
|
learned for the builtin exception hierarchy)
|
||||||
|
|
||||||
Class naming conventions apply here, although you should add the
|
Class naming conventions apply here, although you should add the
|
||||||
suffix "Error" to your exception classes, if the exception is an
|
suffix "Error" to your exception classes if the exception is an
|
||||||
error. Non-error exceptions need no special suffix.
|
error. Non-error exceptions that are used for non-local flow control
|
||||||
|
or other forms of signaling need no special suffix.
|
||||||
|
|
||||||
- When raising an exception, use ``raise ValueError('message')``
|
- Use exception chaining appropriately. In Python 3, "raise X from Y"
|
||||||
|
should be used to indicate explicit replacement without losing the
|
||||||
|
original traceback.
|
||||||
|
|
||||||
|
When deliberately replacing an inner exception (using "raise X" in
|
||||||
|
Python 2 or "raise X from None" in Python 3.3+), ensure that relevant
|
||||||
|
details are transferred to the new exception (such as preserving the
|
||||||
|
attribute name when converting KeyError to AttributeError, or
|
||||||
|
embedding the text of the original exception in the new exception
|
||||||
|
message).
|
||||||
|
|
||||||
|
- When raising an exception in Python 2, use ``raise ValueError('message')``
|
||||||
instead of the older form ``raise ValueError, 'message'``.
|
instead of the older form ``raise ValueError, 'message'``.
|
||||||
|
|
||||||
The paren-using form is preferred because when the exception
|
The latter form is not legal Python 3 syntax.
|
||||||
arguments are long or include string formatting, you don't need to
|
|
||||||
use line continuation characters thanks to the containing
|
The paren-using form also means that when the exception arguments are
|
||||||
parentheses. The older form is not legal syntax in Python 3.
|
long or include string formatting, you don't need to use line
|
||||||
|
continuation characters thanks to the containing parentheses.
|
||||||
|
|
||||||
- When catching exceptions, mention specific exceptions whenever
|
- When catching exceptions, mention specific exceptions whenever
|
||||||
possible instead of using a bare ``except:`` clause.
|
possible instead of using a bare ``except:`` clause.
|
||||||
|
@ -838,6 +977,21 @@ Programming Recommendations
|
||||||
exception propagate upwards with ``raise``. ``try...finally``
|
exception propagate upwards with ``raise``. ``try...finally``
|
||||||
can be a better way to handle this case.
|
can be a better way to handle this case.
|
||||||
|
|
||||||
|
- When binding caught exceptions to a name, prefer the explicit name
|
||||||
|
binding syntax added in Python 2.6::
|
||||||
|
|
||||||
|
try:
|
||||||
|
process_data()
|
||||||
|
except Exception as exc:
|
||||||
|
raise DataProcessingFailedError(str(exc))
|
||||||
|
|
||||||
|
This is the only syntax supported in Python 3, and avoids the
|
||||||
|
ambiguity problems associated with the older comma-based syntax.
|
||||||
|
|
||||||
|
- When catching operating system errors, prefer the explicit exception
|
||||||
|
hierarchy introduced in Python 3.3 over introspection of ``errno``
|
||||||
|
values.
|
||||||
|
|
||||||
- Additionally, for all try/except clauses, limit the ``try`` clause
|
- Additionally, for all try/except clauses, limit the ``try`` clause
|
||||||
to the absolute minimum amount of code necessary. Again, this
|
to the absolute minimum amount of code necessary. Again, this
|
||||||
avoids masking bugs.
|
avoids masking bugs.
|
||||||
|
@ -860,6 +1014,10 @@ Programming Recommendations
|
||||||
# Will also catch KeyError raised by handle_value()
|
# Will also catch KeyError raised by handle_value()
|
||||||
return key_not_found(key)
|
return key_not_found(key)
|
||||||
|
|
||||||
|
- When a resource is local to a particular section of code, use a
|
||||||
|
``with`` statement to ensure it is cleaned up promptly and reliably
|
||||||
|
after use. A try/finally statement is also acceptable.
|
||||||
|
|
||||||
- Context managers should be invoked through separate functions or methods
|
- Context managers should be invoked through separate functions or methods
|
||||||
whenever they do something other than acquire and release resources.
|
whenever they do something other than acquire and release resources.
|
||||||
For example:
|
For example:
|
||||||
|
@ -894,9 +1052,6 @@ Programming Recommendations
|
||||||
Yes: if foo.startswith('bar'):
|
Yes: if foo.startswith('bar'):
|
||||||
No: if foo[:3] == 'bar':
|
No: if foo[:3] == 'bar':
|
||||||
|
|
||||||
The exception is if your code must work with Python 1.5.2 (but let's
|
|
||||||
hope not!).
|
|
||||||
|
|
||||||
- Object type comparisons should always use isinstance() instead of
|
- Object type comparisons should always use isinstance() instead of
|
||||||
comparing types directly. ::
|
comparing types directly. ::
|
||||||
|
|
||||||
|
@ -905,11 +1060,15 @@ Programming Recommendations
|
||||||
No: if type(obj) is type(1):
|
No: if type(obj) is type(1):
|
||||||
|
|
||||||
When checking if an object is a string, keep in mind that it might
|
When checking if an object is a string, keep in mind that it might
|
||||||
be a unicode string too! In Python 2.3, str and unicode have a
|
be a unicode string too! In Python 2, str and unicode have a
|
||||||
common base class, basestring, so you can do::
|
common base class, basestring, so you can do::
|
||||||
|
|
||||||
if isinstance(obj, basestring):
|
if isinstance(obj, basestring):
|
||||||
|
|
||||||
|
Note that in Python 3, ``unicode`` and ``basestring`` no longer exist
|
||||||
|
(there is only ``str``) and a bytes object is no longer a kind of
|
||||||
|
string (it is a sequence of integers instead)
|
||||||
|
|
||||||
- For sequences, (strings, lists, tuples), use the fact that empty
|
- For sequences, (strings, lists, tuples), use the fact that empty
|
||||||
sequences are false. ::
|
sequences are false. ::
|
||||||
|
|
||||||
|
@ -934,6 +1093,10 @@ Programming Recommendations
|
||||||
annotation style. Instead, the annotations are left for users to
|
annotation style. Instead, the annotations are left for users to
|
||||||
discover and experiment with useful annotation styles.
|
discover and experiment with useful annotation styles.
|
||||||
|
|
||||||
|
It is recommended that third party experiments with annotations use an
|
||||||
|
associated decorator to indicate how the annotation should be
|
||||||
|
interpreted.
|
||||||
|
|
||||||
Early core developer attempts to use function annotations revealed
|
Early core developer attempts to use function annotations revealed
|
||||||
inconsistent, ad-hoc annotation styles. For example:
|
inconsistent, ad-hoc annotation styles. For example:
|
||||||
|
|
||||||
|
@ -991,6 +1154,8 @@ References
|
||||||
|
|
||||||
.. [3] http://www.wikipedia.com/wiki/CamelCase
|
.. [3] http://www.wikipedia.com/wiki/CamelCase
|
||||||
|
|
||||||
|
.. [4] PEP 8 modernisation, July 2013
|
||||||
|
http://bugs.python.org/issue18472
|
||||||
|
|
||||||
Copyright
|
Copyright
|
||||||
=========
|
=========
|
||||||
|
|
23
pep-0315.txt
23
pep-0315.txt
|
@ -4,7 +4,7 @@ Version: $Revision$
|
||||||
Last-Modified: $Date$
|
Last-Modified: $Date$
|
||||||
Author: Raymond Hettinger <python@rcn.com>
|
Author: Raymond Hettinger <python@rcn.com>
|
||||||
W Isaac Carroll <icarroll@pobox.com>
|
W Isaac Carroll <icarroll@pobox.com>
|
||||||
Status: Deferred
|
Status: Rejected
|
||||||
Type: Standards Track
|
Type: Standards Track
|
||||||
Content-Type: text/plain
|
Content-Type: text/plain
|
||||||
Created: 25-Apr-2003
|
Created: 25-Apr-2003
|
||||||
|
@ -21,19 +21,32 @@ Abstract
|
||||||
|
|
||||||
Notice
|
Notice
|
||||||
|
|
||||||
Deferred; see
|
Rejected; see
|
||||||
|
http://mail.python.org/pipermail/python-ideas/2013-June/021610.html
|
||||||
|
|
||||||
|
This PEP has been deferred since 2006; see
|
||||||
http://mail.python.org/pipermail/python-dev/2006-February/060718.html
|
http://mail.python.org/pipermail/python-dev/2006-February/060718.html
|
||||||
|
|
||||||
Subsequent efforts to revive the PEP in April 2009 did not
|
Subsequent efforts to revive the PEP in April 2009 did not
|
||||||
meet with success because no syntax emerged that could
|
meet with success because no syntax emerged that could
|
||||||
compete with a while-True and an inner if-break.
|
compete with the following form:
|
||||||
|
|
||||||
A syntax was found for a basic do-while loop but it found
|
while True:
|
||||||
had little support because the condition was at the top:
|
<setup code>
|
||||||
|
if not <condition>:
|
||||||
|
break
|
||||||
|
<loop body>
|
||||||
|
|
||||||
|
A syntax alternative to the one proposed in the PEP was found for
|
||||||
|
a basic do-while loop but it gained little support because the
|
||||||
|
condition was at the top:
|
||||||
|
|
||||||
do ... while <cond>:
|
do ... while <cond>:
|
||||||
<loop body>
|
<loop body>
|
||||||
|
|
||||||
|
Users of the language are advised to use the while-True form with
|
||||||
|
an inner if-break when a do-while loop would have been appropriate.
|
||||||
|
|
||||||
|
|
||||||
Motivation
|
Motivation
|
||||||
|
|
||||||
|
|
55
pep-0394.txt
55
pep-0394.txt
|
@ -19,10 +19,17 @@ This PEP provides a convention to ensure that Python scripts can continue to
|
||||||
be portable across ``*nix`` systems, regardless of the default version of the
|
be portable across ``*nix`` systems, regardless of the default version of the
|
||||||
Python interpreter (i.e. the version invoked by the ``python`` command).
|
Python interpreter (i.e. the version invoked by the ``python`` command).
|
||||||
|
|
||||||
* ``python2`` will refer to some version of Python 2.x
|
* ``python2`` will refer to some version of Python 2.x.
|
||||||
* ``python3`` will refer to some version of Python 3.x
|
* ``python3`` will refer to some version of Python 3.x.
|
||||||
* ``python`` *should* refer to the same target as ``python2`` but *may*
|
* for the time being, all distributions *should* ensure that ``python``
|
||||||
refer to ``python3`` on some bleeding edge distributions
|
refers to the same target as ``python2``.
|
||||||
|
* however, end users should be aware that ``python`` refers to ``python3``
|
||||||
|
on at least Arch Linux (that change is what prompted the creation of this
|
||||||
|
PEP), so ``python`` should be used in the shebang line only for scripts
|
||||||
|
that are source compatible with both Python 2 and 3.
|
||||||
|
* in preparation for an eventual change in the default version of Python,
|
||||||
|
Python 2 only scripts should either be updated to be source compatible
|
||||||
|
with Python 3 or else to use ``python2`` in the shebang line.
|
||||||
|
|
||||||
|
|
||||||
Recommendation
|
Recommendation
|
||||||
|
@ -103,15 +110,29 @@ aspects of migrating to Python 3 as the default version of Python for a
|
||||||
system. They will hopefully be helpful to any distributions considering
|
system. They will hopefully be helpful to any distributions considering
|
||||||
making such a change.
|
making such a change.
|
||||||
|
|
||||||
* Distributions that only include ``python3`` in their base install (i.e.
|
* The main barrier to a distribution switching the ``python`` command from
|
||||||
they do not provide ``python2`` by default) along with those that are
|
``python2`` to ``python3`` isn't breakage within the distribution, but
|
||||||
aggressively trying to reach that point (and are willing to break third
|
instead breakage of private third party scripts developed by sysadmins
|
||||||
party scripts while attempting to get there) are already beginning to alias
|
and other users. Updating the ``python`` command to invoke ``python3``
|
||||||
the ``python`` command to ``python3``
|
by default indicates that a distribution is willing to break such scripts
|
||||||
* More conservative distributions that are less willing to tolerate breakage
|
with errors that are potentially quite confusing for users that aren't
|
||||||
of third party scripts continue to alias it to ``python2``. Until the
|
yet familiar with the backwards incompatible changes in Python 3. For
|
||||||
conventions described in this PEP are more widely adopted, having ``python``
|
example, while the change of ``print`` from a statement to a builtin
|
||||||
invoke ``python2`` will remain the recommended option.
|
function is relatively simple for automated converters to handle, the
|
||||||
|
SyntaxError from attempting to use the Python 2 notation in Python 3 is
|
||||||
|
thoroughly confusing if you aren't already aware of the change::
|
||||||
|
|
||||||
|
$ python3 -c 'print "Hello, world!"'
|
||||||
|
File "<string>", line 1
|
||||||
|
print "Hello, world!"
|
||||||
|
^
|
||||||
|
SyntaxError: invalid syntax
|
||||||
|
|
||||||
|
* Avoiding breakage of such third party scripts is the key reason this
|
||||||
|
PEP recommends that ``python`` continue to refer to ``python2`` for the
|
||||||
|
time being. Until the conventions described in this PEP are more widely
|
||||||
|
adopted, having ``python`` invoke ``python2`` will remain the recommended
|
||||||
|
option.
|
||||||
* The ``pythonX.X`` (e.g. ``python2.6``) commands exist on some systems, on
|
* The ``pythonX.X`` (e.g. ``python2.6``) commands exist on some systems, on
|
||||||
which they invoke specific minor versions of the Python interpreter. It
|
which they invoke specific minor versions of the Python interpreter. It
|
||||||
can be useful for distribution-specific packages to take advantage of these
|
can be useful for distribution-specific packages to take advantage of these
|
||||||
|
@ -148,10 +169,13 @@ making such a change.
|
||||||
``python`` command is only executed in an interactive manner as a user
|
``python`` command is only executed in an interactive manner as a user
|
||||||
convenience, or to run scripts that are source compatible with both Python
|
convenience, or to run scripts that are source compatible with both Python
|
||||||
2 and Python 3.
|
2 and Python 3.
|
||||||
|
* one symbolic date being considered for a possible change to the official
|
||||||
|
recommendation in this PEP is the planned switch of Python 2.7 from full
|
||||||
|
maintenance to security update only status in 2015 (see PEP 373).
|
||||||
|
|
||||||
|
|
||||||
Backwards Compatibility
|
Backwards Compatibility
|
||||||
=========================
|
=======================
|
||||||
|
|
||||||
A potential problem can arise if a script adhering to the
|
A potential problem can arise if a script adhering to the
|
||||||
``python2``/``python3`` convention is executed on a system not supporting
|
``python2``/``python3`` convention is executed on a system not supporting
|
||||||
|
@ -217,7 +241,8 @@ Exclusion of MS Windows
|
||||||
This PEP deliberately excludes any proposals relating to Microsoft Windows, as
|
This PEP deliberately excludes any proposals relating to Microsoft Windows, as
|
||||||
devising an equivalent solution for Windows was deemed too complex to handle
|
devising an equivalent solution for Windows was deemed too complex to handle
|
||||||
here. PEP 397 and the related discussion on the python-dev mailing list
|
here. PEP 397 and the related discussion on the python-dev mailing list
|
||||||
address this issue.
|
address this issue (like this PEP, the PEP 397 launcher invokes Python 2 by
|
||||||
|
default if versions of both Python 2 and 3 are installed on the system).
|
||||||
|
|
||||||
|
|
||||||
References
|
References
|
||||||
|
|
|
@ -627,7 +627,7 @@ release, one possible layout for such an approach might look like::
|
||||||
<news entries>
|
<news entries>
|
||||||
# Add maint.1, compat.1 etc as releases are made
|
# Add maint.1, compat.1 etc as releases are made
|
||||||
|
|
||||||
Putting the version information in the directory heirarchy isn't strictly
|
Putting the version information in the directory hierarchy isn't strictly
|
||||||
necessary (since the NEWS file generator could figure out from the version
|
necessary (since the NEWS file generator could figure out from the version
|
||||||
history), but does make it easier for *humans* to keep the different versions
|
history), but does make it easier for *humans* to keep the different versions
|
||||||
in order.
|
in order.
|
||||||
|
|
2384
pep-0426.txt
2384
pep-0426.txt
File diff suppressed because it is too large
Load Diff
|
@ -0,0 +1,329 @@
|
||||||
|
{
|
||||||
|
"id": "http://www.python.org/dev/peps/pep-0426/",
|
||||||
|
"$schema": "http://json-schema.org/draft-04/schema#",
|
||||||
|
"title": "Metadata for Python Software Packages 2.0",
|
||||||
|
"type": "object",
|
||||||
|
"properties": {
|
||||||
|
"metadata_version": {
|
||||||
|
"description": "Version of the file format",
|
||||||
|
"type": "string",
|
||||||
|
"pattern": "^(\\d+(\\.\\d+)*)$"
|
||||||
|
},
|
||||||
|
"generator": {
|
||||||
|
"description": "Name and version of the program that produced this file.",
|
||||||
|
"type": "string",
|
||||||
|
"pattern": "^[0-9A-Za-z]([0-9A-Za-z_.-]*[0-9A-Za-z])( \\(.*\\))?$"
|
||||||
|
},
|
||||||
|
"name": {
|
||||||
|
"description": "The name of the distribution.",
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/distribution_name"
|
||||||
|
},
|
||||||
|
"version": {
|
||||||
|
"description": "The distribution's public version identifier",
|
||||||
|
"type": "string",
|
||||||
|
"pattern": "^(\\d+(\\.\\d+)*)((a|b|c|rc)(\\d+))?(\\.(post)(\\d+))?(\\.(dev)(\\d+))?$"
|
||||||
|
},
|
||||||
|
"source_label": {
|
||||||
|
"description": "A constrained identifying text string",
|
||||||
|
"type": "string",
|
||||||
|
"pattern": "^[0-9a-z_.-+]+$"
|
||||||
|
},
|
||||||
|
"source_url": {
|
||||||
|
"description": "A string containing a full URL where the source for this specific version of the distribution can be downloaded.",
|
||||||
|
"type": "string",
|
||||||
|
"format": "uri"
|
||||||
|
},
|
||||||
|
"summary": {
|
||||||
|
"description": "A one-line summary of what the distribution does.",
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"document_names": {
|
||||||
|
"description": "Names of supporting metadata documents",
|
||||||
|
"type": "object",
|
||||||
|
"properties": {
|
||||||
|
"description": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/document_name"
|
||||||
|
},
|
||||||
|
"changelog": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/document_name"
|
||||||
|
},
|
||||||
|
"license": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/document_name"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"additionalProperties": false
|
||||||
|
},
|
||||||
|
"keywords": {
|
||||||
|
"description": "A list of additional keywords to be used to assist searching for the distribution in a larger catalog.",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"license": {
|
||||||
|
"description": "A string indicating the license covering the distribution.",
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"classifiers": {
|
||||||
|
"description": "A list of strings, with each giving a single classification value for the distribution.",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"contacts": {
|
||||||
|
"description": "A list of contributor entries giving the recommended contact points for getting more information about the project.",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "object",
|
||||||
|
"$ref": "#/definitions/contact"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"contributors": {
|
||||||
|
"description": "A list of contributor entries for other contributors not already listed as current project points of contact.",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "object",
|
||||||
|
"$ref": "#/definitions/contact"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"project_urls": {
|
||||||
|
"description": "A mapping of arbitrary text labels to additional URLs relevant to the project.",
|
||||||
|
"type": "object"
|
||||||
|
},
|
||||||
|
"extras": {
|
||||||
|
"description": "A list of optional sets of dependencies that may be used to define conditional dependencies in \"may_require\" and similar fields.",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/extra_name"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"meta_requires": {
|
||||||
|
"description": "A list of subdistributions made available through this metadistribution.",
|
||||||
|
"type": "array",
|
||||||
|
"$ref": "#/definitions/dependencies"
|
||||||
|
},
|
||||||
|
"run_requires": {
|
||||||
|
"description": "A list of other distributions needed to run this distribution.",
|
||||||
|
"type": "array",
|
||||||
|
"$ref": "#/definitions/dependencies"
|
||||||
|
},
|
||||||
|
"test_requires": {
|
||||||
|
"description": "A list of other distributions needed when this distribution is tested.",
|
||||||
|
"type": "array",
|
||||||
|
"$ref": "#/definitions/dependencies"
|
||||||
|
},
|
||||||
|
"build_requires": {
|
||||||
|
"description": "A list of other distributions needed when this distribution is built.",
|
||||||
|
"type": "array",
|
||||||
|
"$ref": "#/definitions/dependencies"
|
||||||
|
},
|
||||||
|
"dev_requires": {
|
||||||
|
"description": "A list of other distributions needed when this distribution is developed.",
|
||||||
|
"type": "array",
|
||||||
|
"$ref": "#/definitions/dependencies"
|
||||||
|
},
|
||||||
|
"provides": {
|
||||||
|
"description": "A list of strings naming additional dependency requirements that are satisfied by installing this distribution. These strings must be of the form Name or Name (Version)",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/provides_declaration"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"modules": {
|
||||||
|
"description": "A list of modules and/or packages available for import after installing this distribution.",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/qualified_name"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"namespaces": {
|
||||||
|
"description": "A list of namespace packages this distribution contributes to",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/qualified_name"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"commands": {
|
||||||
|
"description": "Command line interfaces provided by this distribution",
|
||||||
|
"type": "object",
|
||||||
|
"$ref": "#/definitions/commands"
|
||||||
|
},
|
||||||
|
"exports": {
|
||||||
|
"description": "Other exported interfaces provided by this distribution",
|
||||||
|
"type": "object",
|
||||||
|
"$ref": "#/definitions/exports"
|
||||||
|
},
|
||||||
|
"obsoleted_by": {
|
||||||
|
"description": "A string that indicates that this project is no longer being developed. The named project provides a substitute or replacement.",
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/requirement"
|
||||||
|
},
|
||||||
|
"supports_environments": {
|
||||||
|
"description": "A list of strings specifying the environments that the distribution explicitly supports.",
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/environment_marker"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"install_hooks": {
|
||||||
|
"description": "The install_hooks field is used to define various operations that may be invoked on a distribution in a platform independent manner.",
|
||||||
|
"type": "object",
|
||||||
|
"properties": {
|
||||||
|
"postinstall": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/export_specifier"
|
||||||
|
},
|
||||||
|
"preuninstall": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/export_specifier"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"extensions": {
|
||||||
|
"description": "Extensions to the metadata may be present in a mapping under the 'extensions' key.",
|
||||||
|
"type": "object"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
|
||||||
|
"required": ["metadata_version", "name", "version", "summary"],
|
||||||
|
"additionalProperties": false,
|
||||||
|
|
||||||
|
"definitions": {
|
||||||
|
"contact": {
|
||||||
|
"type": "object",
|
||||||
|
"properties": {
|
||||||
|
"name": {
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"email": {
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"url": {
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"role": {
|
||||||
|
"type": "string"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"required": ["name"],
|
||||||
|
"additionalProperties": false
|
||||||
|
},
|
||||||
|
"dependencies": {
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "object",
|
||||||
|
"$ref": "#/definitions/dependency"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"dependency": {
|
||||||
|
"type": "object",
|
||||||
|
"properties": {
|
||||||
|
"extra": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/valid_name"
|
||||||
|
},
|
||||||
|
"environment": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/environment_marker"
|
||||||
|
},
|
||||||
|
"requires": {
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/requirement"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"required": ["requires"],
|
||||||
|
"additionalProperties": false
|
||||||
|
},
|
||||||
|
"commands": {
|
||||||
|
"type": "object",
|
||||||
|
"properties": {
|
||||||
|
"wrap_console": {
|
||||||
|
"type": "object",
|
||||||
|
"$ref": "#/definitions/command_map"
|
||||||
|
},
|
||||||
|
"wrap_gui": {
|
||||||
|
"type": "object",
|
||||||
|
"$ref": "#/definitions/command_map"
|
||||||
|
},
|
||||||
|
"prebuilt": {
|
||||||
|
"type": "array",
|
||||||
|
"items": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/relative_path"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"additionalProperties": false
|
||||||
|
},
|
||||||
|
"exports": {
|
||||||
|
"type": "object",
|
||||||
|
"patternProperties": {
|
||||||
|
"^[A-Za-z]([0-9A-Za-z_])*([.][A-Za-z]([0-9A-Za-z_])*)*$": {
|
||||||
|
"type": "object",
|
||||||
|
"patternProperties": {
|
||||||
|
".": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/export_specifier"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"additionalProperties": false
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"additionalProperties": false
|
||||||
|
},
|
||||||
|
"command_map": {
|
||||||
|
"type": "object",
|
||||||
|
"patternProperties": {
|
||||||
|
"^[0-9A-Za-z]([0-9A-Za-z_.-]*[0-9A-Za-z])?$": {
|
||||||
|
"type": "string",
|
||||||
|
"$ref": "#/definitions/export_specifier"
|
||||||
|
}
|
||||||
|
},
|
||||||
|
"additionalProperties": false
|
||||||
|
},
|
||||||
|
"distribution_name": {
|
||||||
|
"type": "string",
|
||||||
|
"pattern": "^[0-9A-Za-z]([0-9A-Za-z_.-]*[0-9A-Za-z])?$"
|
||||||
|
},
|
||||||
|
"requirement": {
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"provides_declaration": {
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"environment_marker": {
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"document_name": {
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"extra_name" : {
|
||||||
|
"type": "string",
|
||||||
|
"pattern": "^[0-9A-Za-z]([0-9A-Za-z_.-]*[0-9A-Za-z])?$"
|
||||||
|
},
|
||||||
|
"relative_path" : {
|
||||||
|
"type": "string"
|
||||||
|
},
|
||||||
|
"export_specifier": {
|
||||||
|
"type": "string",
|
||||||
|
"pattern": "^([A-Za-z_][A-Za-z_0-9]*([.][A-Za-z_][A-Za-z_0-9]*)*)(:[A-Za-z_][A-Za-z_0-9]*([.][A-Za-z_][A-Za-z_0-9]*)*)?(\\[[0-9A-Za-z]([0-9A-Za-z_.-]*[0-9A-Za-z])?\\])?$"
|
||||||
|
},
|
||||||
|
"qualified_name" : {
|
||||||
|
"type": "string",
|
||||||
|
"pattern": "^[A-Za-z_][A-Za-z_0-9]*([.][A-Za-z_][A-Za-z_0-9]*)*$"
|
||||||
|
}
|
||||||
|
}
|
||||||
|
}
|
29
pep-0432.txt
29
pep-0432.txt
|
@ -3,11 +3,11 @@ Title: Simplifying the CPython startup sequence
|
||||||
Version: $Revision$
|
Version: $Revision$
|
||||||
Last-Modified: $Date$
|
Last-Modified: $Date$
|
||||||
Author: Nick Coghlan <ncoghlan@gmail.com>
|
Author: Nick Coghlan <ncoghlan@gmail.com>
|
||||||
Status: Draft
|
Status: Deferred
|
||||||
Type: Standards Track
|
Type: Standards Track
|
||||||
Content-Type: text/x-rst
|
Content-Type: text/x-rst
|
||||||
Created: 28-Dec-2012
|
Created: 28-Dec-2012
|
||||||
Python-Version: 3.4
|
Python-Version: 3.5
|
||||||
Post-History: 28-Dec-2012, 2-Jan-2013
|
Post-History: 28-Dec-2012, 2-Jan-2013
|
||||||
|
|
||||||
|
|
||||||
|
@ -25,6 +25,31 @@ resolution for most of these should become clearer as the reference
|
||||||
implementation is developed.
|
implementation is developed.
|
||||||
|
|
||||||
|
|
||||||
|
PEP Deferral
|
||||||
|
============
|
||||||
|
|
||||||
|
Python 3.4 is nearing its first alpha, and already includes a couple of
|
||||||
|
significant low level changes in PEP 445 (memory allocator customisation)
|
||||||
|
and PEP 442 (safe object finalization). As a result of the latter PEP,
|
||||||
|
the shutdown procedure of CPython has also been changed to be more heavily
|
||||||
|
reliant on the cyclic garbage collector, significantly reducing the
|
||||||
|
number of modules that will experience the "module globals set to None"
|
||||||
|
behaviour that is used to deliberate break cycles and attempt to releases
|
||||||
|
more external resources cleanly.
|
||||||
|
|
||||||
|
Furthermore, I am heavily involved in the current round of updates to the
|
||||||
|
Python packaging ecosystem (as both the lead author of PEP 426 and
|
||||||
|
BDFL-delegate for several other PEPs), leaving little to spare to work on
|
||||||
|
this proposal. The other developers I would trust to lead this effort are
|
||||||
|
also working on other things.
|
||||||
|
|
||||||
|
So, due to those practical resource constraints, the proximity of Python
|
||||||
|
3.4 deadlines, and recognition that making too many significant changes to
|
||||||
|
the low level CPython infrastructure in one release is likely to be unwise,
|
||||||
|
further work on this PEP has been deferred to the Python 3.5 development
|
||||||
|
cycle.
|
||||||
|
|
||||||
|
|
||||||
Proposal
|
Proposal
|
||||||
========
|
========
|
||||||
|
|
||||||
|
|
|
@ -5,7 +5,7 @@ Last-Modified: $Date$
|
||||||
Author: Barry Warsaw <barry@python.org>,
|
Author: Barry Warsaw <barry@python.org>,
|
||||||
Eli Bendersky <eliben@gmail.com>,
|
Eli Bendersky <eliben@gmail.com>,
|
||||||
Ethan Furman <ethan@stoneleaf.us>
|
Ethan Furman <ethan@stoneleaf.us>
|
||||||
Status: Accepted
|
Status: Final
|
||||||
Type: Standards Track
|
Type: Standards Track
|
||||||
Content-Type: text/x-rst
|
Content-Type: text/x-rst
|
||||||
Created: 2013-02-23
|
Created: 2013-02-23
|
||||||
|
@ -467,6 +467,10 @@ assignment to ``Animal`` is equivalent to::
|
||||||
... cat = 3
|
... cat = 3
|
||||||
... dog = 4
|
... dog = 4
|
||||||
|
|
||||||
|
The reason for defaulting to ``1`` as the starting number and not ``0`` is
|
||||||
|
that ``0`` is ``False`` in a boolean sense, but enum members all evaluate
|
||||||
|
to ``True``.
|
||||||
|
|
||||||
|
|
||||||
Proposed variations
|
Proposed variations
|
||||||
===================
|
===================
|
||||||
|
|
166
pep-0439.txt
166
pep-0439.txt
|
@ -45,6 +45,12 @@ Python installation the barrier to installing additional software is
|
||||||
considerably reduced. It is hoped that this will therefore increase
|
considerably reduced. It is hoped that this will therefore increase
|
||||||
the likelihood that Python projects will reuse third party software.
|
the likelihood that Python projects will reuse third party software.
|
||||||
|
|
||||||
|
The Python community also has an issue of complexity around the current
|
||||||
|
bootstrap procedure for pip and setuptools. They all have
|
||||||
|
their own bootstrap download file with slightly different usages and
|
||||||
|
even refer to each other in some cases. Having a single bootstrap which
|
||||||
|
is common amongst them all, with a simple usage, would be far preferable.
|
||||||
|
|
||||||
It is also hoped that this is reduces the number of proposals to
|
It is also hoped that this is reduces the number of proposals to
|
||||||
include more and more software in the Python standard library, and
|
include more and more software in the Python standard library, and
|
||||||
therefore that more popular Python software is more easily upgradeable
|
therefore that more popular Python software is more easily upgradeable
|
||||||
|
@ -54,23 +60,32 @@ beyond requiring Python installation upgrades.
|
||||||
Proposal
|
Proposal
|
||||||
========
|
========
|
||||||
|
|
||||||
This proposal affects three components of packaging: `the pip bootstrap`_,
|
The bootstrap will install the pip implementation, setuptools by downloading
|
||||||
`setuptools`_ and, thanks to easier package installation, `modifications to
|
their installation files from PyPI.
|
||||||
publishing packages`_.
|
|
||||||
|
This proposal affects two components of packaging: `the pip bootstrap`_ and,
|
||||||
|
thanks to easier package installation, `modifications to publishing
|
||||||
|
packages`_.
|
||||||
|
|
||||||
|
The core of this proposal is that the user experience of using pip should not
|
||||||
|
require the user to install pip.
|
||||||
|
|
||||||
|
|
||||||
The pip bootstrap
|
The pip bootstrap
|
||||||
-----------------
|
-----------------
|
||||||
|
|
||||||
The Python installation includes an executable called "pip3" (see PEP 394 for
|
The Python installation includes an executable called "pip3" (see PEP 394 for
|
||||||
naming rationale etc.) that attempts to import pip machinery. If it can
|
naming rationale etc.) that attempts to import pip machinery. If it can then
|
||||||
then the pip command proceeds as normal. If it cannot it will bootstrap pip by
|
the pip command proceeds as normal. If it cannot it will bootstrap pip by
|
||||||
downloading the pip implementation wheel file. Once installed, the pip command
|
downloading the pip implementation and setuptools wheel files. Hereafter the
|
||||||
proceeds as normal.
|
installation of the "pip implementation" will imply installation of setuptools
|
||||||
|
and virtualenv. Once installed, the pip command proceeds as normal. Once the
|
||||||
|
bootstrap process is complete the "pip3" command is no longer the bootstrap
|
||||||
|
but rather the full pip command.
|
||||||
|
|
||||||
A boostrap is used in the place of a the full pip code so that we
|
A boostrap is used in the place of a the full pip code so that we don't have
|
||||||
don't have to bundle pip and also the install tool is upgradeable
|
to bundle pip and also pip is upgradeable outside of the regular Python
|
||||||
outside of the regular Python upgrade timeframe and processes.
|
upgrade timeframe and processes.
|
||||||
|
|
||||||
To avoid issues with sudo we will have the bootstrap default to
|
To avoid issues with sudo we will have the bootstrap default to
|
||||||
installing the pip implementation to the per-user site-packages
|
installing the pip implementation to the per-user site-packages
|
||||||
|
@ -88,82 +103,58 @@ The bootstrap process will proceed as follows:
|
||||||
2. The user will invoke a pip command, typically "pip3 install
|
2. The user will invoke a pip command, typically "pip3 install
|
||||||
<package>", for example "pip3 install Django".
|
<package>", for example "pip3 install Django".
|
||||||
3. The boostrap script will attempt to import the pip implementation.
|
3. The boostrap script will attempt to import the pip implementation.
|
||||||
If this succeeds, the pip command is processed normally.
|
If this succeeds, the pip command is processed normally. Stop.
|
||||||
4. On failing to import the pip implementation the bootstrap notifies
|
4. On failing to import the pip implementation the bootstrap notifies
|
||||||
the user that it is "upgrading pip" and contacts PyPI to obtain the
|
the user that it needs to "install pip". It will ask the user whether it
|
||||||
latest download wheel file (see PEP 427.)
|
should install pip as a system-wide site-packages or as a user-only
|
||||||
5. Upon downloading the file it is installed using the distlib
|
package. This choice will also be present as a command-line option to pip
|
||||||
installation machinery for wheel packages. Upon completing the
|
so non-interactive use is possible.
|
||||||
installation the user is notified that "pip3 has been upgraded."
|
5. The bootstrap will and contact PyPI to obtain the latest download wheel
|
||||||
TODO how is it verified?
|
file (see PEP 427.)
|
||||||
6. The pip tool may now import the pip implementation and continues to
|
6. Upon downloading the file it is installed using "python setup.py install".
|
||||||
|
7. The pip tool may now import the pip implementation and continues to
|
||||||
process the requested user command normally.
|
process the requested user command normally.
|
||||||
|
|
||||||
Users may be running in an environment which cannot access the public
|
Users may be running in an environment which cannot access the public
|
||||||
Internet and are relying solely on a local package repository. They
|
Internet and are relying solely on a local package repository. They
|
||||||
would use the "-i" (Base URL of Python Package Index) argument to the
|
would use the "-i" (Base URL of Python Package Index) argument to the
|
||||||
"pip3 install" command. This use case will be handled by:
|
"pip3 install" command. This simply overrides the default index URL pointing
|
||||||
|
to PyPI.
|
||||||
|
|
||||||
1. Recognising the command-line arguments that specify alternative or
|
Some users may have no Internet access suitable for fetching the pip
|
||||||
additional locations to discover packages and attempting to
|
implementation file. These users can manually download and install the
|
||||||
download the package from those locations.
|
setuptools and pip tar files. Adding specific support for this use-case is
|
||||||
2. If the package is not found there then we attempt to donwload it
|
unnecessary.
|
||||||
using the standard "https://pypi.python.org/pypi/simple/pip" index.
|
|
||||||
3. If that also fails, for any reason, we indicate to the user the
|
|
||||||
operation we were attempting, the reason for failure (if we know
|
|
||||||
it) and display further instructions for downloading and installing
|
|
||||||
the file manually.
|
|
||||||
|
|
||||||
Manual installation of the pip implementation will be supported
|
The download of the pip implementation install file will be performed
|
||||||
through the manual download of the wheel file and "pip3 install
|
securely. The transport from pypi.python.org will be done over HTTPS with the
|
||||||
<downloaded wheel file>".
|
CA certificate check performed. This facility will be present in Python 3.4+
|
||||||
|
using Operating System certificates (see PEP XXXX).
|
||||||
This installation will not perform standard pip installation steps of
|
|
||||||
saving the file to a cache directory or updating any local database of
|
|
||||||
installed files.
|
|
||||||
|
|
||||||
The download of the pip implementation install file should be performed
|
|
||||||
securely. The transport from pypi.python.org will be done over HTTPS but the CA
|
|
||||||
certificate check will most likely not be performed, and therefore the download
|
|
||||||
would still be vulnerable to active MITM attacks. To mitigate this
|
|
||||||
risk we will use the embedded signature support in the wheel format to validate
|
|
||||||
the downloaded file.
|
|
||||||
|
|
||||||
Beyond those arguments controlling index location and download
|
Beyond those arguments controlling index location and download
|
||||||
options, the "pip3" boostrap command may support further standard pip
|
options, the "pip3" boostrap command may support further standard pip
|
||||||
options for verbosity, quietness and logging.
|
options for verbosity, quietness and logging.
|
||||||
|
|
||||||
|
The "pip3" command will support two new command-line options that are used
|
||||||
|
in the boostrapping, and otherwise ignored. They control where the pip
|
||||||
|
implementation is installed:
|
||||||
|
|
||||||
|
--bootstrap
|
||||||
|
Install to the user's packages directory. The name of this option is chosen
|
||||||
|
to promote it as the preferred installation option.
|
||||||
|
|
||||||
|
--bootstrap-to-system
|
||||||
|
Install to the system site-packages directory.
|
||||||
|
|
||||||
|
These command-line options will also need to be implemented, but otherwise
|
||||||
|
ignored, in the pip implementation.
|
||||||
|
|
||||||
|
Consideration should be given to defaulting pip to install packages to the
|
||||||
|
user's packages directory if pip is installed in that location.
|
||||||
|
|
||||||
The "--no-install" option to the "pip3" command will not affect the
|
The "--no-install" option to the "pip3" command will not affect the
|
||||||
bootstrapping process.
|
bootstrapping process.
|
||||||
|
|
||||||
setuptools
|
|
||||||
----------
|
|
||||||
|
|
||||||
The deprecation of requiring setuptools for installation is an existing goal of
|
|
||||||
the packaging comminity (TODO ref needed). Currently pip depends upon setuptools
|
|
||||||
functionality, and it is installed by the current pip boostrap. This PEP does
|
|
||||||
not propose installing setuptools during the new bootstrap.
|
|
||||||
|
|
||||||
It is intended that before Python 3.4 is shipped the functionlity required by
|
|
||||||
pip will be present in Python's standard library as the distlib module, and that
|
|
||||||
pip would be modified to use that functionality when present. TODO PEP reference
|
|
||||||
for distlib
|
|
||||||
|
|
||||||
Many existing "setup.py" files require setuptools to be installed (because one
|
|
||||||
of the first things they do is import setuptools). It is intended that pip's
|
|
||||||
behaviour will be either:
|
|
||||||
|
|
||||||
1. If setuptools is not present it can only install from wheel files and
|
|
||||||
sdists with 2.0+ metadata, or
|
|
||||||
2. If setuptools is present it can also install from sdists with legacy
|
|
||||||
metadata and eggs
|
|
||||||
|
|
||||||
By default, installing setuptools when necessary should be automatic so that
|
|
||||||
users are not inconvenienced, but advanced users should be able to ask that it
|
|
||||||
instead be treated as an error if no wheel is available to satisfy an
|
|
||||||
installation request or dependency (so they don't inadvertently install
|
|
||||||
setuptools on their production systems if they don't want to).
|
|
||||||
|
|
||||||
|
|
||||||
Modifications to publishing packages
|
Modifications to publishing packages
|
||||||
------------------------------------
|
------------------------------------
|
||||||
|
@ -189,22 +180,36 @@ Implementation
|
||||||
==============
|
==============
|
||||||
|
|
||||||
The changes to pip required by this PEP are being tracked in that project's
|
The changes to pip required by this PEP are being tracked in that project's
|
||||||
issue tracker [2]_
|
issue tracker [2]_. Most notably, the addition of --bootstrap and --bootstrap-
|
||||||
|
to-system to the pip command-line.
|
||||||
|
|
||||||
|
It would be preferable that the pip and setuptools projects distribute a wheel
|
||||||
|
format download.
|
||||||
|
|
||||||
|
The required code for this implementation is the "pip3" command described
|
||||||
|
above. The additional pypublish can be developed outside of the scope of this
|
||||||
|
PEP's work.
|
||||||
|
|
||||||
|
Finally, it would be desirable that "pip3" be ported to Python 2.6+ to allow
|
||||||
|
the single command to replace existing pip, setuptools and virtualenv (which
|
||||||
|
would be added to the bootstrap) bootstrap scripts. Having that bootstrap
|
||||||
|
included in a future Python 2.7 release would also be highly desirable.
|
||||||
|
|
||||||
|
|
||||||
Risks
|
Risks
|
||||||
=====
|
=====
|
||||||
|
|
||||||
The Fedora variant of Linux has had a separate program called "pip" (a
|
|
||||||
Perl package installer) available for install for some time. The
|
|
||||||
current Python "pip" program is installed as "pip-python". It is
|
|
||||||
hoped that the Fedora community will resolve this issue by renaming
|
|
||||||
the Perl installer.
|
|
||||||
|
|
||||||
The key that is used to sign the pip implementation download might be
|
The key that is used to sign the pip implementation download might be
|
||||||
compromised and this PEP currently proposes no mechanism for key
|
compromised and this PEP currently proposes no mechanism for key
|
||||||
revocation.
|
revocation.
|
||||||
|
|
||||||
|
There is a Perl package installer also named "pip". It is quite rare and not
|
||||||
|
commonly used. The Fedora variant of Linux has historically named Python's
|
||||||
|
"pip" as "python-pip" and Perl's "pip" as "perl-pip". This policy has been
|
||||||
|
altered[3] so that future and upgraded Fedora installations will use the name
|
||||||
|
"pip" for Python's "pip". Existing (non-upgraded) installations will still
|
||||||
|
have the old name for the Python "pip", though the potential for confusion is
|
||||||
|
now much reduced.
|
||||||
|
|
||||||
|
|
||||||
References
|
References
|
||||||
|
@ -216,6 +221,9 @@ References
|
||||||
.. [2] pip issue tracking work needed for this PEP
|
.. [2] pip issue tracking work needed for this PEP
|
||||||
https://github.com/pypa/pip/issues/863
|
https://github.com/pypa/pip/issues/863
|
||||||
|
|
||||||
|
.. [3] Fedora's python-pip package does not provide /usr/bin/pip
|
||||||
|
https://bugzilla.redhat.com/show_bug.cgi?id=958377
|
||||||
|
|
||||||
|
|
||||||
Acknowledgments
|
Acknowledgments
|
||||||
===============
|
===============
|
||||||
|
@ -223,7 +231,9 @@ Acknowledgments
|
||||||
Nick Coghlan for his thoughts on the proposal and dealing with the Red
|
Nick Coghlan for his thoughts on the proposal and dealing with the Red
|
||||||
Hat issue.
|
Hat issue.
|
||||||
|
|
||||||
Jannis Leidel and Carl Meyer for their thoughts.
|
Jannis Leidel and Carl Meyer for their thoughts. Marcus Smith for feedback.
|
||||||
|
|
||||||
|
Marcela Mašláňová for resolving the Fedora issue.
|
||||||
|
|
||||||
|
|
||||||
Copyright
|
Copyright
|
||||||
|
|
359
pep-0440.txt
359
pep-0440.txt
|
@ -9,7 +9,7 @@ Status: Draft
|
||||||
Type: Standards Track
|
Type: Standards Track
|
||||||
Content-Type: text/x-rst
|
Content-Type: text/x-rst
|
||||||
Created: 18 Mar 2013
|
Created: 18 Mar 2013
|
||||||
Post-History: 30 Mar 2013, 27-May-2013
|
Post-History: 30 Mar 2013, 27 May 2013, 20 Jun 2013
|
||||||
Replaces: 386
|
Replaces: 386
|
||||||
|
|
||||||
|
|
||||||
|
@ -27,7 +27,7 @@ standardised approach to versioning, as described in PEP 345 and PEP 386.
|
||||||
This PEP was broken out of the metadata 2.0 specification in PEP 426.
|
This PEP was broken out of the metadata 2.0 specification in PEP 426.
|
||||||
|
|
||||||
Unlike PEP 426, the notes that remain in this document are intended as
|
Unlike PEP 426, the notes that remain in this document are intended as
|
||||||
part of the final specification.
|
part of the final specification (except for this one).
|
||||||
|
|
||||||
|
|
||||||
Definitions
|
Definitions
|
||||||
|
@ -40,7 +40,7 @@ document are to be interpreted as described in RFC 2119.
|
||||||
The following terms are to be interpreted as described in PEP 426:
|
The following terms are to be interpreted as described in PEP 426:
|
||||||
|
|
||||||
* "Distributions"
|
* "Distributions"
|
||||||
* "Versions"
|
* "Releases"
|
||||||
* "Build tools"
|
* "Build tools"
|
||||||
* "Index servers"
|
* "Index servers"
|
||||||
* "Publication tools"
|
* "Publication tools"
|
||||||
|
@ -52,9 +52,13 @@ The following terms are to be interpreted as described in PEP 426:
|
||||||
Version scheme
|
Version scheme
|
||||||
==============
|
==============
|
||||||
|
|
||||||
Distribution versions are identified by both a public version identifier,
|
Distributions are identified by a public version identifier which
|
||||||
which supports all defined version comparison operations, and a build
|
supports all defined version comparison operations
|
||||||
label, which supports only strict equality comparisons.
|
|
||||||
|
Distributions may also define a source label, which is not used by
|
||||||
|
automated tools. Source labels are useful when a project internal
|
||||||
|
versioning scheme requires translation to create a compliant public
|
||||||
|
version identifier.
|
||||||
|
|
||||||
The version scheme is used both to describe the distribution version
|
The version scheme is used both to describe the distribution version
|
||||||
provided by a particular distribution archive, as well as to place
|
provided by a particular distribution archive, as well as to place
|
||||||
|
@ -84,7 +88,7 @@ Public version identifiers are separated into up to four segments:
|
||||||
* Post-release segment: ``.postN``
|
* Post-release segment: ``.postN``
|
||||||
* Development release segment: ``.devN``
|
* Development release segment: ``.devN``
|
||||||
|
|
||||||
Any given version will be a "release", "pre-release", "post-release" or
|
Any given release will be a "final release", "pre-release", "post-release" or
|
||||||
"developmental release" as defined in the following sections.
|
"developmental release" as defined in the following sections.
|
||||||
|
|
||||||
.. note::
|
.. note::
|
||||||
|
@ -99,34 +103,43 @@ Any given version will be a "release", "pre-release", "post-release" or
|
||||||
sections.
|
sections.
|
||||||
|
|
||||||
|
|
||||||
Build labels
|
Source labels
|
||||||
------------
|
-------------
|
||||||
|
|
||||||
Build labels are text strings with minimal defined semantics.
|
Source labels are text strings with minimal defined semantics.
|
||||||
|
|
||||||
To ensure build labels can be readily incorporated as part of file names
|
To ensure source labels can be readily incorporated as part of file names
|
||||||
and URLs, they MUST be comprised of only ASCII alphanumerics, plus signs,
|
and URLs, and to avoid formatting inconsistences in hexadecimal hash
|
||||||
periods and hyphens.
|
representations they MUST be limited to the following set of permitted
|
||||||
|
characters:
|
||||||
|
|
||||||
In addition, build labels MUST be unique within a given distribution.
|
* Lowercase ASCII letters (``[a-z]``)
|
||||||
|
* ASCII digits (``[0-9]``)
|
||||||
|
* underscores (``_``)
|
||||||
|
* hyphens (``-``)
|
||||||
|
* periods (``.``)
|
||||||
|
* plus signs (``+``)
|
||||||
|
|
||||||
As with distribution names, all comparisons of build labels MUST be case
|
Source labels MUST start and end with an ASCII letter or digit.
|
||||||
insensitive.
|
|
||||||
|
Source labels MUST be unique within each project and MUST NOT match any
|
||||||
|
defined version for the project.
|
||||||
|
|
||||||
|
|
||||||
Releases
|
Final releases
|
||||||
--------
|
--------------
|
||||||
|
|
||||||
A version identifier that consists solely of a release segment is termed
|
A version identifier that consists solely of a release segment is
|
||||||
a "release".
|
termed a "final release".
|
||||||
|
|
||||||
The release segment consists of one or more non-negative integer values,
|
The release segment consists of one or more non-negative integer
|
||||||
separated by dots::
|
values, separated by dots::
|
||||||
|
|
||||||
N[.N]+
|
N[.N]+
|
||||||
|
|
||||||
Releases within a project will typically be numbered in a consistently
|
Final releases within a project MUST be numbered in a consistently
|
||||||
increasing fashion.
|
increasing fashion, otherwise automated tools will not be able to upgrade
|
||||||
|
them correctly.
|
||||||
|
|
||||||
Comparison and ordering of release segments considers the numeric value
|
Comparison and ordering of release segments considers the numeric value
|
||||||
of each component of the release segment in turn. When comparing release
|
of each component of the release segment in turn. When comparing release
|
||||||
|
@ -157,8 +170,8 @@ For example::
|
||||||
2.0
|
2.0
|
||||||
2.0.1
|
2.0.1
|
||||||
|
|
||||||
A release series is any set of release numbers that start with a common
|
A release series is any set of final release numbers that start with a
|
||||||
prefix. For example, ``3.3.1``, ``3.3.5`` and ``3.3.9.45`` are all
|
common prefix. For example, ``3.3.1``, ``3.3.5`` and ``3.3.9.45`` are all
|
||||||
part of the ``3.3`` release series.
|
part of the ``3.3`` release series.
|
||||||
|
|
||||||
.. note::
|
.. note::
|
||||||
|
@ -206,8 +219,8 @@ of both ``c`` and ``rc`` releases for a common release segment.
|
||||||
Post-releases
|
Post-releases
|
||||||
-------------
|
-------------
|
||||||
|
|
||||||
Some projects use post-releases to address minor errors in a release that
|
Some projects use post-releases to address minor errors in a final release
|
||||||
do not affect the distributed software (for example, correcting an error
|
that do not affect the distributed software (for example, correcting an error
|
||||||
in the release notes).
|
in the release notes).
|
||||||
|
|
||||||
If used as part of a project's development cycle, these post-releases are
|
If used as part of a project's development cycle, these post-releases are
|
||||||
|
@ -371,7 +384,7 @@ are permitted and MUST be ordered as shown::
|
||||||
.devN, aN, bN, cN, rcN, <no suffix>, .postN
|
.devN, aN, bN, cN, rcN, <no suffix>, .postN
|
||||||
|
|
||||||
Note that `rc` will always sort after `c` (regardless of the numeric
|
Note that `rc` will always sort after `c` (regardless of the numeric
|
||||||
component) although they are semantically equivalent. Tools are free to
|
component) although they are semantically equivalent. Tools MAY
|
||||||
reject this case as ambiguous and remain in compliance with the PEP.
|
reject this case as ambiguous and remain in compliance with the PEP.
|
||||||
|
|
||||||
Within an alpha (``1.0a1``), beta (``1.0b1``), or release candidate
|
Within an alpha (``1.0a1``), beta (``1.0b1``), or release candidate
|
||||||
|
@ -444,7 +457,7 @@ Compatibility with other version schemes
|
||||||
|
|
||||||
Some projects may choose to use a version scheme which requires
|
Some projects may choose to use a version scheme which requires
|
||||||
translation in order to comply with the public version scheme defined in
|
translation in order to comply with the public version scheme defined in
|
||||||
this PEP. In such cases, the build label can be used to
|
this PEP. In such cases, the source label can be used to
|
||||||
record the project specific version as an arbitrary label, while the
|
record the project specific version as an arbitrary label, while the
|
||||||
translated public version is published in the version field.
|
translated public version is published in the version field.
|
||||||
|
|
||||||
|
@ -488,7 +501,7 @@ identifier. As hashes cannot be ordered reliably such versions are not
|
||||||
permitted in the public version field.
|
permitted in the public version field.
|
||||||
|
|
||||||
As with semantic versioning, the public ``.devN`` suffix may be used to
|
As with semantic versioning, the public ``.devN`` suffix may be used to
|
||||||
uniquely identify such releases for publication, while the build label is
|
uniquely identify such releases for publication, while the source label is
|
||||||
used to record the original DVCS based version label.
|
used to record the original DVCS based version label.
|
||||||
|
|
||||||
|
|
||||||
|
@ -496,7 +509,7 @@ Date based versions
|
||||||
~~~~~~~~~~~~~~~~~~~
|
~~~~~~~~~~~~~~~~~~~
|
||||||
|
|
||||||
As with other incompatible version schemes, date based versions can be
|
As with other incompatible version schemes, date based versions can be
|
||||||
stored in the build label field. Translating them to a compliant
|
stored in the source label field. Translating them to a compliant
|
||||||
public version is straightforward: use a leading ``"0."`` prefix in the
|
public version is straightforward: use a leading ``"0."`` prefix in the
|
||||||
public version label, with the date based version number as the remaining
|
public version label, with the date based version number as the remaining
|
||||||
components in the release segment.
|
components in the release segment.
|
||||||
|
@ -506,6 +519,22 @@ numbering based on API compatibility, as well as triggering more appropriate
|
||||||
version comparison semantics.
|
version comparison semantics.
|
||||||
|
|
||||||
|
|
||||||
|
Olson database versioning
|
||||||
|
~~~~~~~~~~~~~~~~~~~~~~~~~
|
||||||
|
|
||||||
|
The ``pytz`` project inherits its versioning scheme from the corresponding
|
||||||
|
Olson timezone database versioning scheme: the year followed by a lowercase
|
||||||
|
character indicating the version of the database within that year.
|
||||||
|
|
||||||
|
This can be translated to a compliant 3-part version identifier as
|
||||||
|
``0.<year>.<serial>``, where the serial starts at zero (for the '<year>a'
|
||||||
|
release) and is incremented with each subsequent database update within the
|
||||||
|
year.
|
||||||
|
|
||||||
|
As with other translated version identifiers, the corresponding Olson
|
||||||
|
database version would be recorded in the source label field.
|
||||||
|
|
||||||
|
|
||||||
Version specifiers
|
Version specifiers
|
||||||
==================
|
==================
|
||||||
|
|
||||||
|
@ -521,7 +550,6 @@ clause:
|
||||||
* ``~=``: `Compatible release`_ clause
|
* ``~=``: `Compatible release`_ clause
|
||||||
* ``==``: `Version matching`_ clause
|
* ``==``: `Version matching`_ clause
|
||||||
* ``!=``: `Version exclusion`_ clause
|
* ``!=``: `Version exclusion`_ clause
|
||||||
* ``is``: `Build reference`_ clause
|
|
||||||
* ``<=``, ``>=``: `Inclusive ordered comparison`_ clause
|
* ``<=``, ``>=``: `Inclusive ordered comparison`_ clause
|
||||||
* ``<``, ``>``: `Exclusive ordered comparison`_ clause
|
* ``<``, ``>``: `Exclusive ordered comparison`_ clause
|
||||||
|
|
||||||
|
@ -605,6 +633,11 @@ version. The *only* substitution performed is the zero padding of the
|
||||||
release segment to ensure the release segments are compared with the same
|
release segment to ensure the release segments are compared with the same
|
||||||
length.
|
length.
|
||||||
|
|
||||||
|
Whether or not strict version matching is appropriate depends on the specific
|
||||||
|
use case for the version specifier. Automated tools SHOULD at least issue
|
||||||
|
warnings and MAY reject them entirely when strict version matches are used
|
||||||
|
inappropriately.
|
||||||
|
|
||||||
Prefix matching may be requested instead of strict comparison, by appending
|
Prefix matching may be requested instead of strict comparison, by appending
|
||||||
a trailing ``.*`` to the version identifier in the version matching clause.
|
a trailing ``.*`` to the version identifier in the version matching clause.
|
||||||
This means that additional trailing segments will be ignored when
|
This means that additional trailing segments will be ignored when
|
||||||
|
@ -626,10 +659,6 @@ comparison operator is intended primarily for use when defining
|
||||||
dependencies for repeatable *deployments of applications* while using
|
dependencies for repeatable *deployments of applications* while using
|
||||||
a shared distribution index.
|
a shared distribution index.
|
||||||
|
|
||||||
Publication tools and index servers SHOULD at least emit a warning when
|
|
||||||
dependencies are pinned in this fashion and MAY refuse to allow publication
|
|
||||||
of such overly specific dependencies.
|
|
||||||
|
|
||||||
|
|
||||||
Version exclusion
|
Version exclusion
|
||||||
-----------------
|
-----------------
|
||||||
|
@ -649,74 +678,6 @@ match or not as shown::
|
||||||
!= 1.1.* # Same prefix, so 1.1.post1 does not match clause
|
!= 1.1.* # Same prefix, so 1.1.post1 does not match clause
|
||||||
|
|
||||||
|
|
||||||
Build reference
|
|
||||||
---------------
|
|
||||||
|
|
||||||
A build reference includes the build reference operator ``is`` and
|
|
||||||
a build label or a build URL.
|
|
||||||
|
|
||||||
Publication tools and public index servers SHOULD NOT permit build
|
|
||||||
references in dependency specifications.
|
|
||||||
|
|
||||||
Installation tools SHOULD support the use of build references to identify
|
|
||||||
dependencies.
|
|
||||||
|
|
||||||
Build label matching works solely on strict equality comparisons: the
|
|
||||||
candidate build label must be exactly the same as the build label in the
|
|
||||||
version clause for the clause to match the candidate distribution.
|
|
||||||
|
|
||||||
For example, a build reference could be used to depend on a ``hashdist``
|
|
||||||
generated build of ``zlib`` with the ``hashdist`` hash used as a build
|
|
||||||
label::
|
|
||||||
|
|
||||||
zlib (is d4jwf2sb2g6glprsdqfdpcracwpzujwq)
|
|
||||||
|
|
||||||
A build URL is distinguished from a build label by the presence of
|
|
||||||
``:`` and ``/`` characters in the build reference. As these characters
|
|
||||||
are not permitted in build labels, they indicate that the reference uses
|
|
||||||
a build URL.
|
|
||||||
|
|
||||||
Some appropriate targets for a build URL are a binary archive, a
|
|
||||||
source tarball, an sdist archive or a direct reference to a tag or
|
|
||||||
specific commit in an online version control system. The exact URLs and
|
|
||||||
targets supported will be installation tool specific.
|
|
||||||
|
|
||||||
For example, a local prebuilt wheel file may be referenced directly::
|
|
||||||
|
|
||||||
exampledist (is file:///localbuilds/exampledist-1.0-py33-none-any.whl)
|
|
||||||
|
|
||||||
All build URL references SHOULD either specify a local file URL, a secure
|
|
||||||
transport mechanism (such as ``https``) or else include an expected hash
|
|
||||||
value in the URL for verification purposes. If an insecure network
|
|
||||||
transport is specified without any hash information (or with hash
|
|
||||||
information that the tool doesn't understand), automated tools SHOULD
|
|
||||||
at least emit a warning and MAY refuse to rely on the URL.
|
|
||||||
|
|
||||||
It is RECOMMENDED that only hashes which are unconditionally provided by
|
|
||||||
the latest version of the standard library's ``hashlib`` module be used
|
|
||||||
for source archive hashes. At time of writing, that list consists of
|
|
||||||
``'md5'``, ``'sha1'``, ``'sha224'``, ``'sha256'``, ``'sha384'``, and
|
|
||||||
``'sha512'``.
|
|
||||||
|
|
||||||
For binary or source archive references, an expected hash value may be
|
|
||||||
specified by including a ``<hash-algorithm>=<expected-hash>`` as part of
|
|
||||||
the URL fragment.
|
|
||||||
|
|
||||||
For version control references, the ``VCS+protocol`` scheme SHOULD be
|
|
||||||
used to identify both the version control system and the secure transport.
|
|
||||||
|
|
||||||
To support version control systems that do not support including commit or
|
|
||||||
tag references directly in the URL, that information may be appended to the
|
|
||||||
end of the URL using the ``@<tag>`` notation.
|
|
||||||
|
|
||||||
The use of ``is`` when defining dependencies for published distributions
|
|
||||||
is strongly discouraged as it greatly complicates the deployment of
|
|
||||||
security fixes. The build label matching operator is intended primarily
|
|
||||||
for use when defining dependencies for repeatable *deployments of
|
|
||||||
applications* while using a shared distribution index, as well as to
|
|
||||||
reference dependencies which are not published through an index server.
|
|
||||||
|
|
||||||
|
|
||||||
Inclusive ordered comparison
|
Inclusive ordered comparison
|
||||||
----------------------------
|
----------------------------
|
||||||
|
|
||||||
|
@ -755,62 +716,108 @@ Handling of pre-releases
|
||||||
------------------------
|
------------------------
|
||||||
|
|
||||||
Pre-releases of any kind, including developmental releases, are implicitly
|
Pre-releases of any kind, including developmental releases, are implicitly
|
||||||
excluded from all version specifiers, *unless* a pre-release or developmental
|
excluded from all version specifiers, *unless* they are already present
|
||||||
release is explicitly mentioned in one of the clauses. For example, these
|
on the system, explicitly requested by the user, or if the only available
|
||||||
specifiers implicitly exclude all pre-releases and development
|
version that satisfies the version specifier is a pre-release.
|
||||||
releases of later versions::
|
|
||||||
|
|
||||||
2.2
|
|
||||||
>= 1.0
|
|
||||||
|
|
||||||
While these specifiers would include at least some of them::
|
|
||||||
|
|
||||||
2.2.dev0
|
|
||||||
2.2, != 2.3b2
|
|
||||||
>= 1.0a1
|
|
||||||
>= 1.0c1
|
|
||||||
>= 1.0, != 1.0b2
|
|
||||||
>= 1.0, < 2.0.dev123
|
|
||||||
|
|
||||||
By default, dependency resolution tools SHOULD:
|
By default, dependency resolution tools SHOULD:
|
||||||
|
|
||||||
* accept already installed pre-releases for all version specifiers
|
* accept already installed pre-releases for all version specifiers
|
||||||
* accept remotely available pre-releases for version specifiers which
|
* accept remotely available pre-releases for version specifiers where
|
||||||
include at least one version clauses that references a pre-release
|
there is no final or post release that satisfies the version specifier
|
||||||
* exclude all other pre-releases from consideration
|
* exclude all other pre-releases from consideration
|
||||||
|
|
||||||
|
Dependency resolution tools MAY issue a warning if a pre-release is needed
|
||||||
|
to satisfy a version specifier.
|
||||||
|
|
||||||
Dependency resolution tools SHOULD also allow users to request the
|
Dependency resolution tools SHOULD also allow users to request the
|
||||||
following alternative behaviours:
|
following alternative behaviours:
|
||||||
|
|
||||||
* accepting pre-releases for all version specifiers
|
* accepting pre-releases for all version specifiers
|
||||||
* excluding pre-releases for all version specifiers (reporting an error or
|
* excluding pre-releases for all version specifiers (reporting an error or
|
||||||
warning if a pre-release is already installed locally)
|
warning if a pre-release is already installed locally, or if a
|
||||||
|
pre-release is the only way to satisfy a particular specifier)
|
||||||
|
|
||||||
Dependency resolution tools MAY also allow the above behaviour to be
|
Dependency resolution tools MAY also allow the above behaviour to be
|
||||||
controlled on a per-distribution basis.
|
controlled on a per-distribution basis.
|
||||||
|
|
||||||
Post-releases and purely numeric releases receive no special treatment in
|
Post-releases and final releases receive no special treatment in version
|
||||||
version specifiers - they are always included unless explicitly excluded.
|
specifiers - they are always included unless explicitly excluded.
|
||||||
|
|
||||||
|
|
||||||
Examples
|
Examples
|
||||||
--------
|
--------
|
||||||
|
|
||||||
* ``3.1``: version 3.1 or later, but not
|
* ``3.1``: version 3.1 or later, but not version 4.0 or later.
|
||||||
version 4.0 or later. Excludes pre-releases and developmental releases.
|
* ``3.1.2``: version 3.1.2 or later, but not version 3.2.0 or later.
|
||||||
* ``3.1.2``: version 3.1.2 or later, but not
|
* ``3.1a1``: version 3.1a1 or later, but not version 4.0 or later.
|
||||||
version 3.2.0 or later. Excludes pre-releases and developmental releases.
|
|
||||||
* ``3.1a1``: version 3.1a1 or later, but not
|
|
||||||
version 4.0 or later. Allows pre-releases like 3.2a4 and developmental
|
|
||||||
releases like 3.2.dev1.
|
|
||||||
* ``== 3.1``: specifically version 3.1 (or 3.1.0), excludes all pre-releases,
|
* ``== 3.1``: specifically version 3.1 (or 3.1.0), excludes all pre-releases,
|
||||||
post releases, developmental releases and any 3.1.x maintenance releases.
|
post releases, developmental releases and any 3.1.x maintenance releases.
|
||||||
* ``== 3.1.*``: any version that starts with 3.1, excluding pre-releases and
|
* ``== 3.1.*``: any version that starts with 3.1. Equivalent to the
|
||||||
developmental releases. Equivalent to the ``3.1.0`` compatible release
|
``3.1.0`` compatible release clause.
|
||||||
clause.
|
|
||||||
* ``3.1.0, != 3.1.3``: version 3.1.0 or later, but not version 3.1.3 and
|
* ``3.1.0, != 3.1.3``: version 3.1.0 or later, but not version 3.1.3 and
|
||||||
not version 3.2.0 or later. Excludes pre-releases and developmental
|
not version 3.2.0 or later.
|
||||||
releases.
|
|
||||||
|
|
||||||
|
Direct references
|
||||||
|
=================
|
||||||
|
|
||||||
|
Some automated tools may permit the use of a direct reference as an
|
||||||
|
alternative to a normal version specifier. A direct reference consists of
|
||||||
|
the word ``from`` and an explicit URL.
|
||||||
|
|
||||||
|
Whether or not direct references are appropriate depends on the specific
|
||||||
|
use case for the version specifier. Automated tools SHOULD at least issue
|
||||||
|
warnings and MAY reject them entirely when direct references are used
|
||||||
|
inappropriately.
|
||||||
|
|
||||||
|
Public index servers SHOULD NOT allow the use of direct references in
|
||||||
|
uploaded distributions. Direct references are intended as a tool for
|
||||||
|
software integrators rather than publishers.
|
||||||
|
|
||||||
|
Depending on the use case, some appropriate targets for a direct URL
|
||||||
|
reference may be a valid ``source_url`` entry (see PEP 426), an sdist, or
|
||||||
|
a wheel binary archive. The exact URLs and targets supported will be tool
|
||||||
|
dependent.
|
||||||
|
|
||||||
|
For example, a local source archive may be referenced directly::
|
||||||
|
|
||||||
|
pip (from file:///localbuilds/pip-1.3.1.zip)
|
||||||
|
|
||||||
|
Alternatively, a prebuilt archive may also be referenced::
|
||||||
|
|
||||||
|
pip (from file:///localbuilds/pip-1.3.1-py33-none-any.whl)
|
||||||
|
|
||||||
|
All direct references that do not refer to a local file URL SHOULD
|
||||||
|
specify a secure transport mechanism (such as ``https``), include an
|
||||||
|
expected hash value in the URL for verification purposes, or both. If an
|
||||||
|
insecure transport is specified without any hash information, with hash
|
||||||
|
information that the tool doesn't understand, or with a selected hash
|
||||||
|
algorithm that the tool considers too weak to trust, automated tools
|
||||||
|
SHOULD at least emit a warning and MAY refuse to rely on the URL.
|
||||||
|
|
||||||
|
It is RECOMMENDED that only hashes which are unconditionally provided by
|
||||||
|
the latest version of the standard library's ``hashlib`` module be used
|
||||||
|
for source archive hashes. At time of writing, that list consists of
|
||||||
|
``'md5'``, ``'sha1'``, ``'sha224'``, ``'sha256'``, ``'sha384'``, and
|
||||||
|
``'sha512'``.
|
||||||
|
|
||||||
|
For source archive and wheel references, an expected hash value may be
|
||||||
|
specified by including a ``<hash-algorithm>=<expected-hash>`` entry as
|
||||||
|
part of the URL fragment.
|
||||||
|
|
||||||
|
Version control references, the ``VCS+protocol`` scheme SHOULD be
|
||||||
|
used to identify both the version control system and the secure transport.
|
||||||
|
|
||||||
|
To support version control systems that do not support including commit or
|
||||||
|
tag references directly in the URL, that information may be appended to the
|
||||||
|
end of the URL using the ``@<tag>`` notation.
|
||||||
|
|
||||||
|
Remote URL examples::
|
||||||
|
|
||||||
|
pip (from https://github.com/pypa/pip/archive/1.3.1.zip)
|
||||||
|
pip (from http://github.com/pypa/pip/archive/1.3.1.zip#sha1=da9234ee9982d4bbb3c72346a6de940a148ea686)
|
||||||
|
pip (from git+https://github.com/pypa/pip.git@1.3.1)
|
||||||
|
|
||||||
|
|
||||||
Updating the versioning specification
|
Updating the versioning specification
|
||||||
|
@ -823,56 +830,45 @@ Actually changing the version comparison semantics still requires a new
|
||||||
versioning scheme and metadata version defined in new PEPs.
|
versioning scheme and metadata version defined in new PEPs.
|
||||||
|
|
||||||
|
|
||||||
Open issues
|
|
||||||
===========
|
|
||||||
|
|
||||||
* The new ``is`` operator seems like a reasonable way to cleanly allow
|
|
||||||
installation tools to bring in non-published dependencies, while heavily
|
|
||||||
discouraging the practice for published libraries. It also makes
|
|
||||||
build labels more useful by allowing them to be used to pin dependencies
|
|
||||||
in the integration use case.
|
|
||||||
|
|
||||||
However, it's an early draft of the idea, so feedback is definitely
|
|
||||||
welcome.
|
|
||||||
|
|
||||||
|
|
||||||
Summary of differences from \PEP 386
|
Summary of differences from \PEP 386
|
||||||
====================================
|
====================================
|
||||||
|
|
||||||
* Moved the description of version specifiers into the versioning PEP
|
* Moved the description of version specifiers into the versioning PEP
|
||||||
|
|
||||||
* added the "build label" concept to better handle projects that wish to
|
* Added the "source label" concept to better handle projects that wish to
|
||||||
use a non-compliant versioning scheme internally, especially those based
|
use a non-compliant versioning scheme internally, especially those based
|
||||||
on DVCS hashes
|
on DVCS hashes
|
||||||
|
|
||||||
* added the "compatible release" clause
|
* Added the "direct reference" concept as a standard notation for direct
|
||||||
|
references to resources (rather than each tool needing to invents its own)
|
||||||
|
|
||||||
* added the "build reference" clause
|
* Added the "compatible release" clause
|
||||||
|
|
||||||
* added the trailing wildcard syntax for prefix based version matching
|
* Added the trailing wildcard syntax for prefix based version matching
|
||||||
and exclusion
|
and exclusion
|
||||||
|
|
||||||
* changed the top level sort position of the ``.devN`` suffix
|
* Changed the top level sort position of the ``.devN`` suffix
|
||||||
|
|
||||||
* allowed single value version numbers
|
* Allowed single value version numbers
|
||||||
|
|
||||||
* explicit exclusion of leading or trailing whitespace
|
* Explicit exclusion of leading or trailing whitespace
|
||||||
|
|
||||||
* explicit criterion for the exclusion of date based versions
|
* Explicit criterion for the exclusion of date based versions
|
||||||
|
|
||||||
* implicitly exclude pre-releases unless explicitly requested
|
* Implicitly exclude pre-releases unless they're already present or
|
||||||
|
needed to satisfy a dependency
|
||||||
|
|
||||||
* treat post releases the same way as unqualified releases
|
* Treat post releases the same way as unqualified releases
|
||||||
|
|
||||||
* Discuss ordering and dependencies across metadata versions
|
* Discuss ordering and dependencies across metadata versions
|
||||||
|
|
||||||
The rationale for major changes is given in the following sections.
|
The rationale for major changes is given in the following sections.
|
||||||
|
|
||||||
|
|
||||||
Adding build labels
|
Adding source labels
|
||||||
-------------------
|
--------------------
|
||||||
|
|
||||||
The new build label support is intended to make it clearer that the
|
The new source label support is intended to make it clearer that the
|
||||||
constraints on public version identifiers are there primarily to aid in
|
constraints on public version identifiers are there primarily to aid in
|
||||||
the creation of reliable automated dependency analysis tools. Projects
|
the creation of reliable automated dependency analysis tools. Projects
|
||||||
are free to use whatever versioning scheme they like internally, so long
|
are free to use whatever versioning scheme they like internally, so long
|
||||||
|
@ -1011,11 +1007,12 @@ The previous interpretation also excluded post-releases from some version
|
||||||
specifiers for no adequately justified reason.
|
specifiers for no adequately justified reason.
|
||||||
|
|
||||||
The updated interpretation is intended to make it difficult to accidentally
|
The updated interpretation is intended to make it difficult to accidentally
|
||||||
accept a pre-release version as satisfying a dependency, while allowing
|
accept a pre-release version as satisfying a dependency, while still
|
||||||
pre-release versions to be explicitly requested when needed.
|
allowing pre-release versions to be retrieved automatically when that's the
|
||||||
|
only way to satisfy a dependency.
|
||||||
|
|
||||||
The "some forward compatibility assumed" default version constraint is
|
The "some forward compatibility assumed" default version constraint is
|
||||||
taken directly from the Ruby community's "pessimistic version constraint"
|
derived from the Ruby community's "pessimistic version constraint"
|
||||||
operator [2]_ to allow projects to take a cautious approach to forward
|
operator [2]_ to allow projects to take a cautious approach to forward
|
||||||
compatibility promises, while still easily setting a minimum required
|
compatibility promises, while still easily setting a minimum required
|
||||||
version for their dependencies. It is made the default behaviour rather
|
version for their dependencies. It is made the default behaviour rather
|
||||||
|
@ -1038,16 +1035,26 @@ improved tools for dynamic path manipulation.
|
||||||
|
|
||||||
The trailing wildcard syntax to request prefix based version matching was
|
The trailing wildcard syntax to request prefix based version matching was
|
||||||
added to make it possible to sensibly define both compatible release clauses
|
added to make it possible to sensibly define both compatible release clauses
|
||||||
and the desired pre-release handling semantics for ``<`` and ``>`` ordered
|
and the desired pre- and post-release handling semantics for ``<`` and ``>``
|
||||||
comparison clauses.
|
ordered comparison clauses.
|
||||||
|
|
||||||
Build references are added for two purposes. In conjunction with build
|
|
||||||
labels, they allow hash based references, such as those employed by
|
Adding direct references
|
||||||
`hashdist <http://hashdist.readthedocs.org/en/latest/build_spec.html>`__,
|
------------------------
|
||||||
or generated from version control. In conjunction with build URLs, they
|
|
||||||
allow the new metadata standard to natively support an existing feature of
|
Direct references are added as an "escape clause" to handle messy real
|
||||||
``pip``, which allows arbitrary URLs like
|
world situations that don't map neatly to the standard distribution model.
|
||||||
``file:///localbuilds/exampledist-1.0-py33-none-any.whl``.
|
This includes dependencies on unpublished software for internal use, as well
|
||||||
|
as handling the more complex compatibility issues that may arise when
|
||||||
|
wrapping third party libraries as C extensions (this is of especial concern
|
||||||
|
to the scientific community).
|
||||||
|
|
||||||
|
Index servers are deliberately given a lot of freedom to disallow direct
|
||||||
|
references, since they're intended primarily as a tool for integrators
|
||||||
|
rather than publishers. PyPI in particular is currently going through the
|
||||||
|
process of *eliminating* dependencies on external references, as unreliable
|
||||||
|
external services have the effect of slowing down installation operations,
|
||||||
|
as well as reducing PyPI's own apparent reliability.
|
||||||
|
|
||||||
|
|
||||||
References
|
References
|
||||||
|
|
|
@ -4,13 +4,13 @@ Version: $Revision$
|
||||||
Last-Modified: $Date$
|
Last-Modified: $Date$
|
||||||
Author: Antoine Pitrou <solipsis@pitrou.net>
|
Author: Antoine Pitrou <solipsis@pitrou.net>
|
||||||
BDFL-Delegate: Benjamin Peterson <benjamin@python.org>
|
BDFL-Delegate: Benjamin Peterson <benjamin@python.org>
|
||||||
Status: Draft
|
Status: Final
|
||||||
Type: Standards Track
|
Type: Standards Track
|
||||||
Content-Type: text/x-rst
|
Content-Type: text/x-rst
|
||||||
Created: 2013-05-18
|
Created: 2013-05-18
|
||||||
Python-Version: 3.4
|
Python-Version: 3.4
|
||||||
Post-History: 2013-05-18
|
Post-History: 2013-05-18
|
||||||
Resolution: TBD
|
Resolution: http://mail.python.org/pipermail/python-dev/2013-June/126746.html
|
||||||
|
|
||||||
|
|
||||||
Abstract
|
Abstract
|
||||||
|
@ -201,8 +201,7 @@ Predictability
|
||||||
--------------
|
--------------
|
||||||
|
|
||||||
Following this scheme, an object's finalizer is always called exactly
|
Following this scheme, an object's finalizer is always called exactly
|
||||||
once. The only exception is if an object is resurrected: the finalizer
|
once, even if it was resurrected afterwards.
|
||||||
will be called again when the object becomes unreachable again.
|
|
||||||
|
|
||||||
For CI objects, the order in which finalizers are called (step 2 above)
|
For CI objects, the order in which finalizers are called (step 2 above)
|
||||||
is undefined.
|
is undefined.
|
||||||
|
|
112
pep-0443.txt
112
pep-0443.txt
|
@ -4,11 +4,11 @@ Version: $Revision$
|
||||||
Last-Modified: $Date$
|
Last-Modified: $Date$
|
||||||
Author: Łukasz Langa <lukasz@langa.pl>
|
Author: Łukasz Langa <lukasz@langa.pl>
|
||||||
Discussions-To: Python-Dev <python-dev@python.org>
|
Discussions-To: Python-Dev <python-dev@python.org>
|
||||||
Status: Draft
|
Status: Final
|
||||||
Type: Standards Track
|
Type: Standards Track
|
||||||
Content-Type: text/x-rst
|
Content-Type: text/x-rst
|
||||||
Created: 22-May-2013
|
Created: 22-May-2013
|
||||||
Post-History: 22-May-2013, 25-May-2013
|
Post-History: 22-May-2013, 25-May-2013, 31-May-2013
|
||||||
Replaces: 245, 246, 3124
|
Replaces: 245, 246, 3124
|
||||||
|
|
||||||
|
|
||||||
|
@ -44,11 +44,14 @@ However, it currently:
|
||||||
|
|
||||||
In addition, it is currently a common anti-pattern for Python code to
|
In addition, it is currently a common anti-pattern for Python code to
|
||||||
inspect the types of received arguments, in order to decide what to do
|
inspect the types of received arguments, in order to decide what to do
|
||||||
with the objects. For example, code may wish to accept either an object
|
with the objects.
|
||||||
of some type, or a sequence of objects of that type.
|
|
||||||
|
|
||||||
|
For example, code may wish to accept either an object
|
||||||
|
of some type, or a sequence of objects of that type.
|
||||||
Currently, the "obvious way" to do this is by type inspection, but this
|
Currently, the "obvious way" to do this is by type inspection, but this
|
||||||
is brittle and closed to extension. Abstract Base Classes make it easier
|
is brittle and closed to extension.
|
||||||
|
|
||||||
|
Abstract Base Classes make it easier
|
||||||
to discover present behaviour, but don't help adding new behaviour.
|
to discover present behaviour, but don't help adding new behaviour.
|
||||||
A developer using an already-written library may be unable to change how
|
A developer using an already-written library may be unable to change how
|
||||||
their objects are treated by such code, especially if the objects they
|
their objects are treated by such code, especially if the objects they
|
||||||
|
@ -63,7 +66,7 @@ User API
|
||||||
|
|
||||||
To define a generic function, decorate it with the ``@singledispatch``
|
To define a generic function, decorate it with the ``@singledispatch``
|
||||||
decorator. Note that the dispatch happens on the type of the first
|
decorator. Note that the dispatch happens on the type of the first
|
||||||
argument, create your function accordingly::
|
argument. Create your function accordingly::
|
||||||
|
|
||||||
>>> from functools import singledispatch
|
>>> from functools import singledispatch
|
||||||
>>> @singledispatch
|
>>> @singledispatch
|
||||||
|
@ -73,7 +76,7 @@ argument, create your function accordingly::
|
||||||
... print(arg)
|
... print(arg)
|
||||||
|
|
||||||
To add overloaded implementations to the function, use the
|
To add overloaded implementations to the function, use the
|
||||||
``register()`` attribute of the generic function. It is a decorator,
|
``register()`` attribute of the generic function. This is a decorator,
|
||||||
taking a type parameter and decorating a function implementing the
|
taking a type parameter and decorating a function implementing the
|
||||||
operation for that type::
|
operation for that type::
|
||||||
|
|
||||||
|
@ -98,7 +101,7 @@ To enable registering lambdas and pre-existing functions, the
|
||||||
...
|
...
|
||||||
>>> fun.register(type(None), nothing)
|
>>> fun.register(type(None), nothing)
|
||||||
|
|
||||||
The ``register()`` attribute returns the undecorated function which
|
The ``register()`` attribute returns the undecorated function. This
|
||||||
enables decorator stacking, pickling, as well as creating unit tests for
|
enables decorator stacking, pickling, as well as creating unit tests for
|
||||||
each variant independently::
|
each variant independently::
|
||||||
|
|
||||||
|
@ -134,13 +137,17 @@ argument::
|
||||||
|
|
||||||
Where there is no registered implementation for a specific type, its
|
Where there is no registered implementation for a specific type, its
|
||||||
method resolution order is used to find a more generic implementation.
|
method resolution order is used to find a more generic implementation.
|
||||||
|
The original function decorated with ``@singledispatch`` is registered
|
||||||
|
for the base ``object`` type, which means it is used if no better
|
||||||
|
implementation is found.
|
||||||
|
|
||||||
To check which implementation will the generic function choose for
|
To check which implementation will the generic function choose for
|
||||||
a given type, use the ``dispatch()`` attribute::
|
a given type, use the ``dispatch()`` attribute::
|
||||||
|
|
||||||
>>> fun.dispatch(float)
|
>>> fun.dispatch(float)
|
||||||
<function fun_num at 0x104319058>
|
<function fun_num at 0x104319058>
|
||||||
>>> fun.dispatch(dict)
|
>>> fun.dispatch(dict) # note: default implementation
|
||||||
<function fun at 0x103fe4788>
|
<function fun at 0x103fe0000>
|
||||||
|
|
||||||
To access all registered implementations, use the read-only ``registry``
|
To access all registered implementations, use the read-only ``registry``
|
||||||
attribute::
|
attribute::
|
||||||
|
@ -152,7 +159,7 @@ attribute::
|
||||||
>>> fun.registry[float]
|
>>> fun.registry[float]
|
||||||
<function fun_num at 0x1035a2840>
|
<function fun_num at 0x1035a2840>
|
||||||
>>> fun.registry[object]
|
>>> fun.registry[object]
|
||||||
<function fun at 0x103170788>
|
<function fun at 0x103fe0000>
|
||||||
|
|
||||||
The proposed API is intentionally limited and opinionated, as to ensure
|
The proposed API is intentionally limited and opinionated, as to ensure
|
||||||
it is easy to explain and use, as well as to maintain consistency with
|
it is easy to explain and use, as well as to maintain consistency with
|
||||||
|
@ -168,12 +175,12 @@ implementation is mature, the goal is to move it largely as-is. The
|
||||||
reference implementation is available on hg.python.org [#ref-impl]_.
|
reference implementation is available on hg.python.org [#ref-impl]_.
|
||||||
|
|
||||||
The dispatch type is specified as a decorator argument. An alternative
|
The dispatch type is specified as a decorator argument. An alternative
|
||||||
form using function annotations has been considered but its inclusion
|
form using function annotations was considered but its inclusion
|
||||||
has been deferred. As of May 2013, this usage pattern is out of scope
|
has been rejected. As of May 2013, this usage pattern is out of scope
|
||||||
for the standard library [#pep-0008]_ and the best practices for
|
for the standard library [#pep-0008]_, and the best practices for
|
||||||
annotation usage are still debated.
|
annotation usage are still debated.
|
||||||
|
|
||||||
Based on the current ``pkgutil.simplegeneric`` implementation and
|
Based on the current ``pkgutil.simplegeneric`` implementation, and
|
||||||
following the convention on registering virtual subclasses on Abstract
|
following the convention on registering virtual subclasses on Abstract
|
||||||
Base Classes, the dispatch registry will not be thread-safe.
|
Base Classes, the dispatch registry will not be thread-safe.
|
||||||
|
|
||||||
|
@ -186,48 +193,37 @@ handling of old-style classes and Zope's ExtensionClasses. More
|
||||||
importantly, it introduces support for Abstract Base Classes (ABC).
|
importantly, it introduces support for Abstract Base Classes (ABC).
|
||||||
|
|
||||||
When a generic function implementation is registered for an ABC, the
|
When a generic function implementation is registered for an ABC, the
|
||||||
dispatch algorithm switches to a mode of MRO calculation for the
|
dispatch algorithm switches to an extended form of C3 linearization,
|
||||||
provided argument which includes the relevant ABCs. The algorithm is as
|
which includes the relevant ABCs in the MRO of the provided argument.
|
||||||
follows::
|
The algorithm inserts ABCs where their functionality is introduced, i.e.
|
||||||
|
``issubclass(cls, abc)`` returns ``True`` for the class itself but
|
||||||
|
returns ``False`` for all its direct base classes. Implicit ABCs for
|
||||||
|
a given class (either registered or inferred from the presence of
|
||||||
|
a special method like ``__len__()``) are inserted directly after the
|
||||||
|
last ABC explicitly listed in the MRO of said class.
|
||||||
|
|
||||||
def _compose_mro(cls, haystack):
|
In its most basic form, this linearization returns the MRO for the given
|
||||||
"""Calculates the MRO for a given class `cls`, including relevant
|
type::
|
||||||
abstract base classes from `haystack`."""
|
|
||||||
bases = set(cls.__mro__)
|
|
||||||
mro = list(cls.__mro__)
|
|
||||||
for regcls in haystack:
|
|
||||||
if regcls in bases or not issubclass(cls, regcls):
|
|
||||||
continue # either present in the __mro__ or unrelated
|
|
||||||
for index, base in enumerate(mro):
|
|
||||||
if not issubclass(base, regcls):
|
|
||||||
break
|
|
||||||
if base in bases and not issubclass(regcls, base):
|
|
||||||
# Conflict resolution: put classes present in __mro__
|
|
||||||
# and their subclasses first.
|
|
||||||
index += 1
|
|
||||||
mro.insert(index, regcls)
|
|
||||||
return mro
|
|
||||||
|
|
||||||
In its most basic form, it returns the MRO for the given type::
|
|
||||||
|
|
||||||
>>> _compose_mro(dict, [])
|
>>> _compose_mro(dict, [])
|
||||||
[<class 'dict'>, <class 'object'>]
|
[<class 'dict'>, <class 'object'>]
|
||||||
|
|
||||||
When the haystack consists of ABCs that the specified type is a subclass
|
When the second argument contains ABCs that the specified type is
|
||||||
of, they are inserted in a predictable order::
|
a subclass of, they are inserted in a predictable order::
|
||||||
|
|
||||||
>>> _compose_mro(dict, [Sized, MutableMapping, str,
|
>>> _compose_mro(dict, [Sized, MutableMapping, str,
|
||||||
... Sequence, Iterable])
|
... Sequence, Iterable])
|
||||||
[<class 'dict'>, <class 'collections.abc.MutableMapping'>,
|
[<class 'dict'>, <class 'collections.abc.MutableMapping'>,
|
||||||
<class 'collections.abc.Iterable'>, <class 'collections.abc.Sized'>,
|
<class 'collections.abc.Mapping'>, <class 'collections.abc.Sized'>,
|
||||||
|
<class 'collections.abc.Iterable'>, <class 'collections.abc.Container'>,
|
||||||
<class 'object'>]
|
<class 'object'>]
|
||||||
|
|
||||||
While this mode of operation is significantly slower, all dispatch
|
While this mode of operation is significantly slower, all dispatch
|
||||||
decisions are cached. The cache is invalidated on registering new
|
decisions are cached. The cache is invalidated on registering new
|
||||||
implementations on the generic function or when user code calls
|
implementations on the generic function or when user code calls
|
||||||
``register()`` on an ABC to register a new virtual subclass. In the
|
``register()`` on an ABC to implicitly subclass it. In the latter case,
|
||||||
latter case, it is possible to create a situation with ambiguous
|
it is possible to create a situation with ambiguous dispatch, for
|
||||||
dispatch, for instance::
|
instance::
|
||||||
|
|
||||||
>>> from collections import Iterable, Container
|
>>> from collections import Iterable, Container
|
||||||
>>> class P:
|
>>> class P:
|
||||||
|
@ -254,27 +250,45 @@ guess::
|
||||||
RuntimeError: Ambiguous dispatch: <class 'collections.abc.Container'>
|
RuntimeError: Ambiguous dispatch: <class 'collections.abc.Container'>
|
||||||
or <class 'collections.abc.Iterable'>
|
or <class 'collections.abc.Iterable'>
|
||||||
|
|
||||||
Note that this exception would not be raised if ``Iterable`` and
|
Note that this exception would not be raised if one or more ABCs had
|
||||||
``Container`` had been provided as base classes during class definition.
|
been provided explicitly as base classes during class definition. In
|
||||||
In this case dispatch happens in the MRO order::
|
this case dispatch happens in the MRO order::
|
||||||
|
|
||||||
>>> class Ten(Iterable, Container):
|
>>> class Ten(Iterable, Container):
|
||||||
... def __iter__(self):
|
... def __iter__(self):
|
||||||
... for i in range(10):
|
... for i in range(10):
|
||||||
... yield i
|
... yield i
|
||||||
... def __contains__(self, value):
|
... def __contains__(self, value):
|
||||||
... return value in range(10)
|
... return value in range(10)
|
||||||
...
|
...
|
||||||
>>> g(Ten())
|
>>> g(Ten())
|
||||||
'iterable'
|
'iterable'
|
||||||
|
|
||||||
|
A similar conflict arises when subclassing an ABC is inferred from the
|
||||||
|
presence of a special method like ``__len__()`` or ``__contains__()``::
|
||||||
|
|
||||||
|
>>> class Q:
|
||||||
|
... def __contains__(self, value):
|
||||||
|
... return False
|
||||||
|
...
|
||||||
|
>>> issubclass(Q, Container)
|
||||||
|
True
|
||||||
|
>>> Iterable.register(Q)
|
||||||
|
>>> g(Q())
|
||||||
|
Traceback (most recent call last):
|
||||||
|
...
|
||||||
|
RuntimeError: Ambiguous dispatch: <class 'collections.abc.Container'>
|
||||||
|
or <class 'collections.abc.Iterable'>
|
||||||
|
|
||||||
|
An early version of the PEP contained a custom approach that was simpler
|
||||||
|
but created a number of edge cases with surprising results [#why-c3]_.
|
||||||
|
|
||||||
Usage Patterns
|
Usage Patterns
|
||||||
==============
|
==============
|
||||||
|
|
||||||
This PEP proposes extending behaviour only of functions specifically
|
This PEP proposes extending behaviour only of functions specifically
|
||||||
marked as generic. Just as a base class method may be overridden by
|
marked as generic. Just as a base class method may be overridden by
|
||||||
a subclass, so too may a function be overloaded to provide custom
|
a subclass, so too a function may be overloaded to provide custom
|
||||||
functionality for a given type.
|
functionality for a given type.
|
||||||
|
|
||||||
Universal overloading does not equal *arbitrary* overloading, in the
|
Universal overloading does not equal *arbitrary* overloading, in the
|
||||||
|
@ -371,6 +385,8 @@ References
|
||||||
a particular annotation style".
|
a particular annotation style".
|
||||||
(http://www.python.org/dev/peps/pep-0008)
|
(http://www.python.org/dev/peps/pep-0008)
|
||||||
|
|
||||||
|
.. [#why-c3] http://bugs.python.org/issue18244
|
||||||
|
|
||||||
.. [#pep-3124] http://www.python.org/dev/peps/pep-3124/
|
.. [#pep-3124] http://www.python.org/dev/peps/pep-3124/
|
||||||
|
|
||||||
.. [#peak-rules] http://peak.telecommunity.com/DevCenter/PEAK_2dRules
|
.. [#peak-rules] http://peak.telecommunity.com/DevCenter/PEAK_2dRules
|
||||||
|
|
|
@ -0,0 +1,773 @@
|
||||||
|
PEP: 445
|
||||||
|
Title: Add new APIs to customize Python memory allocators
|
||||||
|
Version: $Revision$
|
||||||
|
Last-Modified: $Date$
|
||||||
|
Author: Victor Stinner <victor.stinner@gmail.com>
|
||||||
|
BDFL-Delegate: Antoine Pitrou <solipsis@pitrou.net>
|
||||||
|
Status: Accepted
|
||||||
|
Type: Standards Track
|
||||||
|
Content-Type: text/x-rst
|
||||||
|
Created: 15-june-2013
|
||||||
|
Python-Version: 3.4
|
||||||
|
Resolution: http://mail.python.org/pipermail/python-dev/2013-July/127222.html
|
||||||
|
|
||||||
|
Abstract
|
||||||
|
========
|
||||||
|
|
||||||
|
This PEP proposes new Application Programming Interfaces (API) to customize
|
||||||
|
Python memory allocators. The only implementation required to conform to
|
||||||
|
this PEP is CPython, but other implementations may choose to be compatible,
|
||||||
|
or to re-use a similar scheme.
|
||||||
|
|
||||||
|
|
||||||
|
Rationale
|
||||||
|
=========
|
||||||
|
|
||||||
|
Use cases:
|
||||||
|
|
||||||
|
* Applications embedding Python which want to isolate Python memory from
|
||||||
|
the memory of the application, or want to use a different memory
|
||||||
|
allocator optimized for its Python usage
|
||||||
|
* Python running on embedded devices with low memory and slow CPU.
|
||||||
|
A custom memory allocator can be used for efficiency and/or to get
|
||||||
|
access all the memory of the device.
|
||||||
|
* Debug tools for memory allocators:
|
||||||
|
|
||||||
|
- track the memory usage (find memory leaks)
|
||||||
|
- get the location of a memory allocation: Python filename and line
|
||||||
|
number, and the size of a memory block
|
||||||
|
- detect buffer underflow, buffer overflow and misuse of Python
|
||||||
|
allocator APIs (see `Redesign Debug Checks on Memory Block
|
||||||
|
Allocators as Hooks`_)
|
||||||
|
- force memory allocations to fail to test handling of the
|
||||||
|
``MemoryError`` exception
|
||||||
|
|
||||||
|
|
||||||
|
Proposal
|
||||||
|
========
|
||||||
|
|
||||||
|
New Functions and Structures
|
||||||
|
----------------------------
|
||||||
|
|
||||||
|
* Add a new GIL-free (no need to hold the GIL) memory allocator:
|
||||||
|
|
||||||
|
- ``void* PyMem_RawMalloc(size_t size)``
|
||||||
|
- ``void* PyMem_RawRealloc(void *ptr, size_t new_size)``
|
||||||
|
- ``void PyMem_RawFree(void *ptr)``
|
||||||
|
- The newly allocated memory will not have been initialized in any
|
||||||
|
way.
|
||||||
|
- Requesting zero bytes returns a distinct non-*NULL* pointer if
|
||||||
|
possible, as if ``PyMem_Malloc(1)`` had been called instead.
|
||||||
|
|
||||||
|
* Add a new ``PyMemAllocator`` structure::
|
||||||
|
|
||||||
|
typedef struct {
|
||||||
|
/* user context passed as the first argument to the 3 functions */
|
||||||
|
void *ctx;
|
||||||
|
|
||||||
|
/* allocate a memory block */
|
||||||
|
void* (*malloc) (void *ctx, size_t size);
|
||||||
|
|
||||||
|
/* allocate or resize a memory block */
|
||||||
|
void* (*realloc) (void *ctx, void *ptr, size_t new_size);
|
||||||
|
|
||||||
|
/* release a memory block */
|
||||||
|
void (*free) (void *ctx, void *ptr);
|
||||||
|
} PyMemAllocator;
|
||||||
|
|
||||||
|
* Add a new ``PyMemAllocatorDomain`` enum to choose the Python
|
||||||
|
allocator domain. Domains:
|
||||||
|
|
||||||
|
- ``PYMEM_DOMAIN_RAW``: ``PyMem_RawMalloc()``, ``PyMem_RawRealloc()``
|
||||||
|
and ``PyMem_RawFree()``
|
||||||
|
|
||||||
|
- ``PYMEM_DOMAIN_MEM``: ``PyMem_Malloc()``, ``PyMem_Realloc()`` and
|
||||||
|
``PyMem_Free()``
|
||||||
|
|
||||||
|
- ``PYMEM_DOMAIN_OBJ``: ``PyObject_Malloc()``, ``PyObject_Realloc()``
|
||||||
|
and ``PyObject_Free()``
|
||||||
|
|
||||||
|
* Add new functions to get and set memory block allocators:
|
||||||
|
|
||||||
|
- ``void PyMem_GetAllocator(PyMemAllocatorDomain domain, PyMemAllocator *allocator)``
|
||||||
|
- ``void PyMem_SetAllocator(PyMemAllocatorDomain domain, PyMemAllocator *allocator)``
|
||||||
|
- The new allocator must return a distinct non-*NULL* pointer when
|
||||||
|
requesting zero bytes
|
||||||
|
- For the ``PYMEM_DOMAIN_RAW`` domain, the allocator must be
|
||||||
|
thread-safe: the GIL is not held when the allocator is called.
|
||||||
|
|
||||||
|
* Add a new ``PyObjectArenaAllocator`` structure::
|
||||||
|
|
||||||
|
typedef struct {
|
||||||
|
/* user context passed as the first argument to the 2 functions */
|
||||||
|
void *ctx;
|
||||||
|
|
||||||
|
/* allocate an arena */
|
||||||
|
void* (*alloc) (void *ctx, size_t size);
|
||||||
|
|
||||||
|
/* release an arena */
|
||||||
|
void (*free) (void *ctx, void *ptr, size_t size);
|
||||||
|
} PyObjectArenaAllocator;
|
||||||
|
|
||||||
|
* Add new functions to get and set the arena allocator used by
|
||||||
|
*pymalloc*:
|
||||||
|
|
||||||
|
- ``void PyObject_GetArenaAllocator(PyObjectArenaAllocator *allocator)``
|
||||||
|
- ``void PyObject_SetArenaAllocator(PyObjectArenaAllocator *allocator)``
|
||||||
|
|
||||||
|
* Add a new function to reinstall the debug checks on memory allocators when
|
||||||
|
a memory allocator is replaced with ``PyMem_SetAllocator()``:
|
||||||
|
|
||||||
|
- ``void PyMem_SetupDebugHooks(void)``
|
||||||
|
- Install the debug hooks on all memory block allocators. The function can be
|
||||||
|
called more than once, hooks are only installed once.
|
||||||
|
- The function does nothing is Python is not compiled in debug mode.
|
||||||
|
|
||||||
|
* Memory block allocators always return *NULL* if *size* is greater than
|
||||||
|
``PY_SSIZE_T_MAX``. The check is done before calling the inner
|
||||||
|
function.
|
||||||
|
|
||||||
|
.. note::
|
||||||
|
The *pymalloc* allocator is optimized for objects smaller than 512 bytes
|
||||||
|
with a short lifetime. It uses memory mappings with a fixed size of 256
|
||||||
|
KB called "arenas".
|
||||||
|
|
||||||
|
Here is how the allocators are set up by default:
|
||||||
|
|
||||||
|
* ``PYMEM_DOMAIN_RAW``, ``PYMEM_DOMAIN_MEM``: ``malloc()``,
|
||||||
|
``realloc()`` and ``free()``; call ``malloc(1)`` when requesting zero
|
||||||
|
bytes
|
||||||
|
* ``PYMEM_DOMAIN_OBJ``: *pymalloc* allocator which falls back on
|
||||||
|
``PyMem_Malloc()`` for allocations larger than 512 bytes
|
||||||
|
* *pymalloc* arena allocator: ``VirtualAlloc()`` and ``VirtualFree()`` on
|
||||||
|
Windows, ``mmap()`` and ``munmap()`` when available, or ``malloc()``
|
||||||
|
and ``free()``
|
||||||
|
|
||||||
|
|
||||||
|
Redesign Debug Checks on Memory Block Allocators as Hooks
|
||||||
|
---------------------------------------------------------
|
||||||
|
|
||||||
|
Since Python 2.3, Python implements different checks on memory
|
||||||
|
allocators in debug mode:
|
||||||
|
|
||||||
|
* Newly allocated memory is filled with the byte ``0xCB``, freed memory
|
||||||
|
is filled with the byte ``0xDB``.
|
||||||
|
* Detect API violations, ex: ``PyObject_Free()`` called on a memory
|
||||||
|
block allocated by ``PyMem_Malloc()``
|
||||||
|
* Detect write before the start of the buffer (buffer underflow)
|
||||||
|
* Detect write after the end of the buffer (buffer overflow)
|
||||||
|
|
||||||
|
In Python 3.3, the checks are installed by replacing ``PyMem_Malloc()``,
|
||||||
|
``PyMem_Realloc()``, ``PyMem_Free()``, ``PyObject_Malloc()``,
|
||||||
|
``PyObject_Realloc()`` and ``PyObject_Free()`` using macros. The new
|
||||||
|
allocator allocates a larger buffer and writes a pattern to detect buffer
|
||||||
|
underflow, buffer overflow and use after free (by filling the buffer with
|
||||||
|
the byte ``0xDB``). It uses the original ``PyObject_Malloc()``
|
||||||
|
function to allocate memory. So ``PyMem_Malloc()`` and
|
||||||
|
``PyMem_Realloc()`` indirectly call``PyObject_Malloc()`` and
|
||||||
|
``PyObject_Realloc()``.
|
||||||
|
|
||||||
|
This PEP redesigns the debug checks as hooks on the existing allocators
|
||||||
|
in debug mode. Examples of call traces without the hooks:
|
||||||
|
|
||||||
|
* ``PyMem_RawMalloc()`` => ``_PyMem_RawMalloc()`` => ``malloc()``
|
||||||
|
* ``PyMem_Realloc()`` => ``_PyMem_RawRealloc()`` => ``realloc()``
|
||||||
|
* ``PyObject_Free()`` => ``_PyObject_Free()``
|
||||||
|
|
||||||
|
Call traces when the hooks are installed (debug mode):
|
||||||
|
|
||||||
|
* ``PyMem_RawMalloc()`` => ``_PyMem_DebugMalloc()``
|
||||||
|
=> ``_PyMem_RawMalloc()`` => ``malloc()``
|
||||||
|
* ``PyMem_Realloc()`` => ``_PyMem_DebugRealloc()``
|
||||||
|
=> ``_PyMem_RawRealloc()`` => ``realloc()``
|
||||||
|
* ``PyObject_Free()`` => ``_PyMem_DebugFree()``
|
||||||
|
=> ``_PyObject_Free()``
|
||||||
|
|
||||||
|
As a result, ``PyMem_Malloc()`` and ``PyMem_Realloc()`` now call
|
||||||
|
``malloc()`` and ``realloc()`` in both release mode and debug mode,
|
||||||
|
instead of calling ``PyObject_Malloc()`` and ``PyObject_Realloc()`` in
|
||||||
|
debug mode.
|
||||||
|
|
||||||
|
When at least one memory allocator is replaced with
|
||||||
|
``PyMem_SetAllocator()``, the ``PyMem_SetupDebugHooks()`` function must
|
||||||
|
be called to reinstall the debug hooks on top on the new allocator.
|
||||||
|
|
||||||
|
|
||||||
|
Don't call malloc() directly anymore
|
||||||
|
------------------------------------
|
||||||
|
|
||||||
|
``PyObject_Malloc()`` falls back on ``PyMem_Malloc()`` instead of
|
||||||
|
``malloc()`` if size is greater or equal than 512 bytes, and
|
||||||
|
``PyObject_Realloc()`` falls back on ``PyMem_Realloc()`` instead of
|
||||||
|
``realloc()``
|
||||||
|
|
||||||
|
Direct calls to ``malloc()`` are replaced with ``PyMem_Malloc()``, or
|
||||||
|
``PyMem_RawMalloc()`` if the GIL is not held.
|
||||||
|
|
||||||
|
External libraries like zlib or OpenSSL can be configured to allocate memory
|
||||||
|
using ``PyMem_Malloc()`` or ``PyMem_RawMalloc()``. If the allocator of a
|
||||||
|
library can only be replaced globally (rather than on an object-by-object
|
||||||
|
basis), it shouldn't be replaced when Python is embedded in an application.
|
||||||
|
|
||||||
|
For the "track memory usage" use case, it is important to track memory
|
||||||
|
allocated in external libraries to have accurate reports, because these
|
||||||
|
allocations can be large (e.g. they can raise a ``MemoryError`` exception)
|
||||||
|
and would otherwise be missed in memory usage reports.
|
||||||
|
|
||||||
|
|
||||||
|
Examples
|
||||||
|
========
|
||||||
|
|
||||||
|
Use case 1: Replace Memory Allocators, keep pymalloc
|
||||||
|
----------------------------------------------------
|
||||||
|
|
||||||
|
Dummy example wasting 2 bytes per memory block,
|
||||||
|
and 10 bytes per *pymalloc* arena::
|
||||||
|
|
||||||
|
#include <stdlib.h>
|
||||||
|
|
||||||
|
size_t alloc_padding = 2;
|
||||||
|
size_t arena_padding = 10;
|
||||||
|
|
||||||
|
void* my_malloc(void *ctx, size_t size)
|
||||||
|
{
|
||||||
|
int padding = *(int *)ctx;
|
||||||
|
return malloc(size + padding);
|
||||||
|
}
|
||||||
|
|
||||||
|
void* my_realloc(void *ctx, void *ptr, size_t new_size)
|
||||||
|
{
|
||||||
|
int padding = *(int *)ctx;
|
||||||
|
return realloc(ptr, new_size + padding);
|
||||||
|
}
|
||||||
|
|
||||||
|
void my_free(void *ctx, void *ptr)
|
||||||
|
{
|
||||||
|
free(ptr);
|
||||||
|
}
|
||||||
|
|
||||||
|
void* my_alloc_arena(void *ctx, size_t size)
|
||||||
|
{
|
||||||
|
int padding = *(int *)ctx;
|
||||||
|
return malloc(size + padding);
|
||||||
|
}
|
||||||
|
|
||||||
|
void my_free_arena(void *ctx, void *ptr, size_t size)
|
||||||
|
{
|
||||||
|
free(ptr);
|
||||||
|
}
|
||||||
|
|
||||||
|
void setup_custom_allocator(void)
|
||||||
|
{
|
||||||
|
PyMemAllocator alloc;
|
||||||
|
PyObjectArenaAllocator arena;
|
||||||
|
|
||||||
|
alloc.ctx = &alloc_padding;
|
||||||
|
alloc.malloc = my_malloc;
|
||||||
|
alloc.realloc = my_realloc;
|
||||||
|
alloc.free = my_free;
|
||||||
|
|
||||||
|
PyMem_SetAllocator(PYMEM_DOMAIN_RAW, &alloc);
|
||||||
|
PyMem_SetAllocator(PYMEM_DOMAIN_MEM, &alloc);
|
||||||
|
/* leave PYMEM_DOMAIN_OBJ unchanged, use pymalloc */
|
||||||
|
|
||||||
|
arena.ctx = &arena_padding;
|
||||||
|
arena.alloc = my_alloc_arena;
|
||||||
|
arena.free = my_free_arena;
|
||||||
|
PyObject_SetArenaAllocator(&arena);
|
||||||
|
|
||||||
|
PyMem_SetupDebugHooks();
|
||||||
|
}
|
||||||
|
|
||||||
|
|
||||||
|
Use case 2: Replace Memory Allocators, override pymalloc
|
||||||
|
--------------------------------------------------------
|
||||||
|
|
||||||
|
If you have a dedicated allocator optimized for allocations of objects
|
||||||
|
smaller than 512 bytes with a short lifetime, pymalloc can be overriden
|
||||||
|
(replace ``PyObject_Malloc()``).
|
||||||
|
|
||||||
|
Dummy example wasting 2 bytes per memory block::
|
||||||
|
|
||||||
|
#include <stdlib.h>
|
||||||
|
|
||||||
|
size_t padding = 2;
|
||||||
|
|
||||||
|
void* my_malloc(void *ctx, size_t size)
|
||||||
|
{
|
||||||
|
int padding = *(int *)ctx;
|
||||||
|
return malloc(size + padding);
|
||||||
|
}
|
||||||
|
|
||||||
|
void* my_realloc(void *ctx, void *ptr, size_t new_size)
|
||||||
|
{
|
||||||
|
int padding = *(int *)ctx;
|
||||||
|
return realloc(ptr, new_size + padding);
|
||||||
|
}
|
||||||
|
|
||||||
|
void my_free(void *ctx, void *ptr)
|
||||||
|
{
|
||||||
|
free(ptr);
|
||||||
|
}
|
||||||
|
|
||||||
|
void setup_custom_allocator(void)
|
||||||
|
{
|
||||||
|
PyMemAllocator alloc;
|
||||||
|
alloc.ctx = &padding;
|
||||||
|
alloc.malloc = my_malloc;
|
||||||
|
alloc.realloc = my_realloc;
|
||||||
|
alloc.free = my_free;
|
||||||
|
|
||||||
|
PyMem_SetAllocator(PYMEM_DOMAIN_RAW, &alloc);
|
||||||
|
PyMem_SetAllocator(PYMEM_DOMAIN_MEM, &alloc);
|
||||||
|
PyMem_SetAllocator(PYMEM_DOMAIN_OBJ, &alloc);
|
||||||
|
|
||||||
|
PyMem_SetupDebugHooks();
|
||||||
|
}
|
||||||
|
|
||||||
|
The *pymalloc* arena does not need to be replaced, because it is no more
|
||||||
|
used by the new allocator.
|
||||||
|
|
||||||
|
|
||||||
|
Use case 3: Setup Hooks On Memory Block Allocators
|
||||||
|
--------------------------------------------------
|
||||||
|
|
||||||
|
Example to setup hooks on all memory block allocators::
|
||||||
|
|
||||||
|
struct {
|
||||||
|
PyMemAllocator raw;
|
||||||
|
PyMemAllocator mem;
|
||||||
|
PyMemAllocator obj;
|
||||||
|
/* ... */
|
||||||
|
} hook;
|
||||||
|
|
||||||
|
static void* hook_malloc(void *ctx, size_t size)
|
||||||
|
{
|
||||||
|
PyMemAllocator *alloc = (PyMemAllocator *)ctx;
|
||||||
|
void *ptr;
|
||||||
|
/* ... */
|
||||||
|
ptr = alloc->malloc(alloc->ctx, size);
|
||||||
|
/* ... */
|
||||||
|
return ptr;
|
||||||
|
}
|
||||||
|
|
||||||
|
static void* hook_realloc(void *ctx, void *ptr, size_t new_size)
|
||||||
|
{
|
||||||
|
PyMemAllocator *alloc = (PyMemAllocator *)ctx;
|
||||||
|
void *ptr2;
|
||||||
|
/* ... */
|
||||||
|
ptr2 = alloc->realloc(alloc->ctx, ptr, new_size);
|
||||||
|
/* ... */
|
||||||
|
return ptr2;
|
||||||
|
}
|
||||||
|
|
||||||
|
static void hook_free(void *ctx, void *ptr)
|
||||||
|
{
|
||||||
|
PyMemAllocator *alloc = (PyMemAllocator *)ctx;
|
||||||
|
/* ... */
|
||||||
|
alloc->free(alloc->ctx, ptr);
|
||||||
|
/* ... */
|
||||||
|
}
|
||||||
|
|
||||||
|
void setup_hooks(void)
|
||||||
|
{
|
||||||
|
PyMemAllocator alloc;
|
||||||
|
static int installed = 0;
|
||||||
|
|
||||||
|
if (installed)
|
||||||
|
return;
|
||||||
|
installed = 1;
|
||||||
|
|
||||||
|
alloc.malloc = hook_malloc;
|
||||||
|
alloc.realloc = hook_realloc;
|
||||||
|
alloc.free = hook_free;
|
||||||
|
PyMem_GetAllocator(PYMEM_DOMAIN_RAW, &hook.raw);
|
||||||
|
PyMem_GetAllocator(PYMEM_DOMAIN_MEM, &hook.mem);
|
||||||
|
PyMem_GetAllocator(PYMEM_DOMAIN_OBJ, &hook.obj);
|
||||||
|
|
||||||
|
alloc.ctx = &hook.raw;
|
||||||
|
PyMem_SetAllocator(PYMEM_DOMAIN_RAW, &alloc);
|
||||||
|
|
||||||
|
alloc.ctx = &hook.mem;
|
||||||
|
PyMem_SetAllocator(PYMEM_DOMAIN_MEM, &alloc);
|
||||||
|
|
||||||
|
alloc.ctx = &hook.obj;
|
||||||
|
PyMem_SetAllocator(PYMEM_DOMAIN_OBJ, &alloc);
|
||||||
|
}
|
||||||
|
|
||||||
|
.. note::
|
||||||
|
``PyMem_SetupDebugHooks()`` does not need to be called because
|
||||||
|
memory allocator are not replaced: the debug checks on memory
|
||||||
|
block allocators are installed automatically at startup.
|
||||||
|
|
||||||
|
|
||||||
|
Performances
|
||||||
|
============
|
||||||
|
|
||||||
|
The implementation of this PEP (issue #3329) has no visible overhead on
|
||||||
|
the Python benchmark suite.
|
||||||
|
|
||||||
|
Results of the `Python benchmarks suite
|
||||||
|
<http://hg.python.org/benchmarks>`_ (-b 2n3): some tests are 1.04x
|
||||||
|
faster, some tests are 1.04 slower. Results of pybench microbenchmark:
|
||||||
|
"+0.1%" slower globally (diff between -4.9% and +5.6%).
|
||||||
|
|
||||||
|
The full output of benchmarks is attached to the issue #3329.
|
||||||
|
|
||||||
|
|
||||||
|
Rejected Alternatives
|
||||||
|
=====================
|
||||||
|
|
||||||
|
More specific functions to get/set memory allocators
|
||||||
|
----------------------------------------------------
|
||||||
|
|
||||||
|
It was originally proposed a larger set of C API functions, with one pair
|
||||||
|
of functions for each allocator domain:
|
||||||
|
|
||||||
|
* ``void PyMem_GetRawAllocator(PyMemAllocator *allocator)``
|
||||||
|
* ``void PyMem_GetAllocator(PyMemAllocator *allocator)``
|
||||||
|
* ``void PyObject_GetAllocator(PyMemAllocator *allocator)``
|
||||||
|
* ``void PyMem_SetRawAllocator(PyMemAllocator *allocator)``
|
||||||
|
* ``void PyMem_SetAllocator(PyMemAllocator *allocator)``
|
||||||
|
* ``void PyObject_SetAllocator(PyMemAllocator *allocator)``
|
||||||
|
|
||||||
|
This alternative was rejected because it is not possible to write
|
||||||
|
generic code with more specific functions: code must be duplicated for
|
||||||
|
each memory allocator domain.
|
||||||
|
|
||||||
|
|
||||||
|
Make PyMem_Malloc() reuse PyMem_RawMalloc() by default
|
||||||
|
------------------------------------------------------
|
||||||
|
|
||||||
|
If ``PyMem_Malloc()`` called ``PyMem_RawMalloc()`` by default,
|
||||||
|
calling ``PyMem_SetAllocator(PYMEM_DOMAIN_RAW, alloc)`` would also
|
||||||
|
patch ``PyMem_Malloc()`` indirectly.
|
||||||
|
|
||||||
|
This alternative was rejected because ``PyMem_SetAllocator()`` would
|
||||||
|
have a different behaviour depending on the domain. Always having the
|
||||||
|
same behaviour is less error-prone.
|
||||||
|
|
||||||
|
|
||||||
|
Add a new PYDEBUGMALLOC environment variable
|
||||||
|
--------------------------------------------
|
||||||
|
|
||||||
|
It was proposed to add a new ``PYDEBUGMALLOC`` environment variable to
|
||||||
|
enable debug checks on memory block allocators. It would have had the same
|
||||||
|
effect as calling the ``PyMem_SetupDebugHooks()``, without the need
|
||||||
|
to write any C code. Another advantage is to allow to enable debug checks
|
||||||
|
even in release mode: debug checks would always be compiled in, but only
|
||||||
|
enabled when the environment variable is present and non-empty.
|
||||||
|
|
||||||
|
This alternative was rejected because a new environment variable would
|
||||||
|
make Python initialization even more complex. `PEP 432
|
||||||
|
<http://www.python.org/dev/peps/pep-0432/>`_ tries to simplify the
|
||||||
|
CPython startup sequence.
|
||||||
|
|
||||||
|
|
||||||
|
Use macros to get customizable allocators
|
||||||
|
-----------------------------------------
|
||||||
|
|
||||||
|
To have no overhead in the default configuration, customizable
|
||||||
|
allocators would be an optional feature enabled by a configuration
|
||||||
|
option or by macros.
|
||||||
|
|
||||||
|
This alternative was rejected because the use of macros implies having
|
||||||
|
to recompile extensions modules to use the new allocator and allocator
|
||||||
|
hooks. Not having to recompile Python nor extension modules makes debug
|
||||||
|
hooks easier to use in practice.
|
||||||
|
|
||||||
|
|
||||||
|
Pass the C filename and line number
|
||||||
|
-----------------------------------
|
||||||
|
|
||||||
|
Define allocator functions as macros using ``__FILE__`` and ``__LINE__``
|
||||||
|
to get the C filename and line number of a memory allocation.
|
||||||
|
|
||||||
|
Example of ``PyMem_Malloc`` macro with the modified
|
||||||
|
``PyMemAllocator`` structure::
|
||||||
|
|
||||||
|
typedef struct {
|
||||||
|
/* user context passed as the first argument
|
||||||
|
to the 3 functions */
|
||||||
|
void *ctx;
|
||||||
|
|
||||||
|
/* allocate a memory block */
|
||||||
|
void* (*malloc) (void *ctx, const char *filename, int lineno,
|
||||||
|
size_t size);
|
||||||
|
|
||||||
|
/* allocate or resize a memory block */
|
||||||
|
void* (*realloc) (void *ctx, const char *filename, int lineno,
|
||||||
|
void *ptr, size_t new_size);
|
||||||
|
|
||||||
|
/* release a memory block */
|
||||||
|
void (*free) (void *ctx, const char *filename, int lineno,
|
||||||
|
void *ptr);
|
||||||
|
} PyMemAllocator;
|
||||||
|
|
||||||
|
void* _PyMem_MallocTrace(const char *filename, int lineno,
|
||||||
|
size_t size);
|
||||||
|
|
||||||
|
/* the function is still needed for the Python stable ABI */
|
||||||
|
void* PyMem_Malloc(size_t size);
|
||||||
|
|
||||||
|
#define PyMem_Malloc(size) \
|
||||||
|
_PyMem_MallocTrace(__FILE__, __LINE__, size)
|
||||||
|
|
||||||
|
The GC allocator functions would also have to be patched. For example,
|
||||||
|
``_PyObject_GC_Malloc()`` is used in many C functions and so objects of
|
||||||
|
different types would have the same allocation location.
|
||||||
|
|
||||||
|
This alternative was rejected because passing a filename and a line
|
||||||
|
number to each allocator makes the API more complex: pass 3 new
|
||||||
|
arguments (ctx, filename, lineno) to each allocator function, instead of
|
||||||
|
just a context argument (ctx). Having to also modify GC allocator
|
||||||
|
functions adds too much complexity for a little gain.
|
||||||
|
|
||||||
|
|
||||||
|
GIL-free PyMem_Malloc()
|
||||||
|
-----------------------
|
||||||
|
|
||||||
|
In Python 3.3, when Python is compiled in debug mode, ``PyMem_Malloc()``
|
||||||
|
indirectly calls ``PyObject_Malloc()`` which requires the GIL to be
|
||||||
|
held (it isn't thread-safe). That's why ``PyMem_Malloc()`` must be called
|
||||||
|
with the GIL held.
|
||||||
|
|
||||||
|
This PEP changes ``PyMem_Malloc()``: it now always calls ``malloc()``
|
||||||
|
rather than ``PyObject_Malloc()``. The "GIL must be held" restriction
|
||||||
|
could therefore be removed from ``PyMem_Malloc()``.
|
||||||
|
|
||||||
|
This alternative was rejected because allowing to call
|
||||||
|
``PyMem_Malloc()`` without holding the GIL can break applications
|
||||||
|
which setup their own allocators or allocator hooks. Holding the GIL is
|
||||||
|
convenient to develop a custom allocator: no need to care about other
|
||||||
|
threads. It is also convenient for a debug allocator hook: Python
|
||||||
|
objects can be safely inspected, and the C API may be used for reporting.
|
||||||
|
|
||||||
|
Moreover, calling ``PyGILState_Ensure()`` in a memory allocator has
|
||||||
|
unexpected behaviour, especially at Python startup and when creating of a
|
||||||
|
new Python thread state. It is better to free custom allocators of
|
||||||
|
the responsibility of acquiring the GIL.
|
||||||
|
|
||||||
|
|
||||||
|
Don't add PyMem_RawMalloc()
|
||||||
|
---------------------------
|
||||||
|
|
||||||
|
Replace ``malloc()`` with ``PyMem_Malloc()``, but only if the GIL is
|
||||||
|
held. Otherwise, keep ``malloc()`` unchanged.
|
||||||
|
|
||||||
|
The ``PyMem_Malloc()`` is used without the GIL held in some Python
|
||||||
|
functions. For example, the ``main()`` and ``Py_Main()`` functions of
|
||||||
|
Python call ``PyMem_Malloc()`` whereas the GIL do not exist yet. In this
|
||||||
|
case, ``PyMem_Malloc()`` would be replaced with ``malloc()`` (or
|
||||||
|
``PyMem_RawMalloc()``).
|
||||||
|
|
||||||
|
This alternative was rejected because ``PyMem_RawMalloc()`` is required
|
||||||
|
for accurate reports of the memory usage. When a debug hook is used to
|
||||||
|
track the memory usage, the memory allocated by direct calls to
|
||||||
|
``malloc()`` cannot be tracked. ``PyMem_RawMalloc()`` can be hooked and
|
||||||
|
so all the memory allocated by Python can be tracked, including
|
||||||
|
memory allocated without holding the GIL.
|
||||||
|
|
||||||
|
|
||||||
|
Use existing debug tools to analyze memory use
|
||||||
|
----------------------------------------------
|
||||||
|
|
||||||
|
There are many existing debug tools to analyze memory use. Some
|
||||||
|
examples: `Valgrind <http://valgrind.org/>`_, `Purify
|
||||||
|
<http://ibm.com/software/awdtools/purify/>`_, `Clang AddressSanitizer
|
||||||
|
<http://code.google.com/p/address-sanitizer/>`_, `failmalloc
|
||||||
|
<http://www.nongnu.org/failmalloc/>`_, etc.
|
||||||
|
|
||||||
|
The problem is to retrieve the Python object related to a memory pointer
|
||||||
|
to read its type and/or its content. Another issue is to retrieve the
|
||||||
|
source of the memory allocation: the C backtrace is usually useless
|
||||||
|
(same reasoning than macros using ``__FILE__`` and ``__LINE__``, see
|
||||||
|
`Pass the C filename and line number`_), the Python filename and line
|
||||||
|
number (or even the Python traceback) is more useful.
|
||||||
|
|
||||||
|
This alternative was rejected because classic tools are unable to
|
||||||
|
introspect Python internals to collect such information. Being able to
|
||||||
|
setup a hook on allocators called with the GIL held allows to collect a
|
||||||
|
lot of useful data from Python internals.
|
||||||
|
|
||||||
|
|
||||||
|
Add a msize() function
|
||||||
|
----------------------
|
||||||
|
|
||||||
|
Add another function to ``PyMemAllocator`` and
|
||||||
|
``PyObjectArenaAllocator`` structures::
|
||||||
|
|
||||||
|
size_t msize(void *ptr);
|
||||||
|
|
||||||
|
This function returns the size of a memory block or a memory mapping.
|
||||||
|
Return (size_t)-1 if the function is not implemented or if the pointer
|
||||||
|
is unknown (ex: NULL pointer).
|
||||||
|
|
||||||
|
On Windows, this function can be implemented using ``_msize()`` and
|
||||||
|
``VirtualQuery()``.
|
||||||
|
|
||||||
|
The function can be used to implement a hook tracking the memory usage.
|
||||||
|
The ``free()`` method of an allocator only gets the address of a memory
|
||||||
|
block, whereas the size of the memory block is required to update the
|
||||||
|
memory usage.
|
||||||
|
|
||||||
|
The additional ``msize()`` function was rejected because only few
|
||||||
|
platforms implement it. For example, Linux with the GNU libc does not
|
||||||
|
provide a function to get the size of a memory block. ``msize()`` is not
|
||||||
|
currently used in the Python source code. The function would only be
|
||||||
|
used to track memory use, and make the API more complex. A debug hook
|
||||||
|
can implement the function internally, there is no need to add it to
|
||||||
|
``PyMemAllocator`` and ``PyObjectArenaAllocator`` structures.
|
||||||
|
|
||||||
|
|
||||||
|
No context argument
|
||||||
|
-------------------
|
||||||
|
|
||||||
|
Simplify the signature of allocator functions, remove the context
|
||||||
|
argument:
|
||||||
|
|
||||||
|
* ``void* malloc(size_t size)``
|
||||||
|
* ``void* realloc(void *ptr, size_t new_size)``
|
||||||
|
* ``void free(void *ptr)``
|
||||||
|
|
||||||
|
It is likely for an allocator hook to be reused for
|
||||||
|
``PyMem_SetAllocator()`` and ``PyObject_SetAllocator()``, or even
|
||||||
|
``PyMem_SetRawAllocator()``, but the hook must call a different function
|
||||||
|
depending on the allocator. The context is a convenient way to reuse the
|
||||||
|
same custom allocator or hook for different Python allocators.
|
||||||
|
|
||||||
|
In C++, the context can be used to pass *this*.
|
||||||
|
|
||||||
|
|
||||||
|
External Libraries
|
||||||
|
==================
|
||||||
|
|
||||||
|
Examples of API used to customize memory allocators.
|
||||||
|
|
||||||
|
Libraries used by Python:
|
||||||
|
|
||||||
|
* OpenSSL: `CRYPTO_set_mem_functions()
|
||||||
|
<http://git.openssl.org/gitweb/?p=openssl.git;a=blob;f=crypto/mem.c;h=f7984fa958eb1edd6c61f6667f3f2b29753be662;hb=HEAD#l124>`_
|
||||||
|
to set memory management functions globally
|
||||||
|
* expat: `parserCreate()
|
||||||
|
<http://hg.python.org/cpython/file/cc27d50bd91a/Modules/expat/xmlparse.c#l724>`_
|
||||||
|
has a per-instance memory handler
|
||||||
|
* zlib: `zlib 1.2.8 Manual <http://www.zlib.net/manual.html#Usage>`_,
|
||||||
|
pass an opaque pointer
|
||||||
|
* bz2: `bzip2 and libbzip2, version 1.0.5
|
||||||
|
<http://www.bzip.org/1.0.5/bzip2-manual-1.0.5.html>`_,
|
||||||
|
pass an opaque pointer
|
||||||
|
* lzma: `LZMA SDK - How to Use
|
||||||
|
<http://www.asawicki.info/news_1368_lzma_sdk_-_how_to_use.html>`_,
|
||||||
|
pass an opaque pointer
|
||||||
|
* lipmpdec: no opaque pointer (classic malloc API)
|
||||||
|
|
||||||
|
Other libraries:
|
||||||
|
|
||||||
|
* glib: `g_mem_set_vtable()
|
||||||
|
<http://developer.gnome.org/glib/unstable/glib-Memory-Allocation.html#g-mem-set-vtable>`_
|
||||||
|
* libxml2:
|
||||||
|
`xmlGcMemSetup() <http://xmlsoft.org/html/libxml-xmlmemory.html>`_,
|
||||||
|
global
|
||||||
|
* Oracle's OCI: `Oracle Call Interface Programmer's Guide,
|
||||||
|
Release 2 (9.2)
|
||||||
|
<http://docs.oracle.com/cd/B10501_01/appdev.920/a96584/oci15re4.htm>`_,
|
||||||
|
pass an opaque pointer
|
||||||
|
|
||||||
|
The new *ctx* parameter of this PEP was inspired by the API of zlib and
|
||||||
|
Oracle's OCI libraries.
|
||||||
|
|
||||||
|
See also the `GNU libc: Memory Allocation Hooks
|
||||||
|
<http://www.gnu.org/software/libc/manual/html_node/Hooks-for-Malloc.html>`_
|
||||||
|
which uses a different approach to hook memory allocators.
|
||||||
|
|
||||||
|
|
||||||
|
Memory Allocators
|
||||||
|
=================
|
||||||
|
|
||||||
|
The C standard library provides the well known ``malloc()`` function.
|
||||||
|
Its implementation depends on the platform and of the C library. The GNU
|
||||||
|
C library uses a modified ptmalloc2, based on "Doug Lea's Malloc"
|
||||||
|
(dlmalloc). FreeBSD uses `jemalloc
|
||||||
|
<http://www.canonware.com/jemalloc/>`_. Google provides *tcmalloc* which
|
||||||
|
is part of `gperftools <http://code.google.com/p/gperftools/>`_.
|
||||||
|
|
||||||
|
``malloc()`` uses two kinds of memory: heap and memory mappings. Memory
|
||||||
|
mappings are usually used for large allocations (ex: larger than 256
|
||||||
|
KB), whereas the heap is used for small allocations.
|
||||||
|
|
||||||
|
On UNIX, the heap is handled by ``brk()`` and ``sbrk()`` system calls,
|
||||||
|
and it is contiguous. On Windows, the heap is handled by
|
||||||
|
``HeapAlloc()`` and can be discontiguous. Memory mappings are handled by
|
||||||
|
``mmap()`` on UNIX and ``VirtualAlloc()`` on Windows, they can be
|
||||||
|
discontiguous.
|
||||||
|
|
||||||
|
Releasing a memory mapping gives back immediatly the memory to the
|
||||||
|
system. On UNIX, the heap memory is only given back to the system if the
|
||||||
|
released block is located at the end of the heap. Otherwise, the memory
|
||||||
|
will only be given back to the system when all the memory located after
|
||||||
|
the released memory is also released.
|
||||||
|
|
||||||
|
To allocate memory on the heap, an allocator tries to reuse free space.
|
||||||
|
If there is no contiguous space big enough, the heap must be enlarged,
|
||||||
|
even if there is more free space than required size. This issue is
|
||||||
|
called the "memory fragmentation": the memory usage seen by the system
|
||||||
|
is higher than real usage. On Windows, ``HeapAlloc()`` creates
|
||||||
|
a new memory mapping with ``VirtualAlloc()`` if there is not enough free
|
||||||
|
contiguous memory.
|
||||||
|
|
||||||
|
CPython has a *pymalloc* allocator for allocations smaller than 512
|
||||||
|
bytes. This allocator is optimized for small objects with a short
|
||||||
|
lifetime. It uses memory mappings called "arenas" with a fixed size of
|
||||||
|
256 KB.
|
||||||
|
|
||||||
|
Other allocators:
|
||||||
|
|
||||||
|
* Windows provides a `Low-fragmentation Heap
|
||||||
|
<http://msdn.microsoft.com/en-us/library/windows/desktop/aa366750%28v=vs.85%29.aspx>`_.
|
||||||
|
|
||||||
|
* The Linux kernel uses `slab allocation
|
||||||
|
<http://en.wikipedia.org/wiki/Slab_allocation>`_.
|
||||||
|
|
||||||
|
* The glib library has a `Memory Slice API
|
||||||
|
<https://developer.gnome.org/glib/unstable/glib-Memory-Slices.html>`_:
|
||||||
|
efficient way to allocate groups of equal-sized chunks of memory
|
||||||
|
|
||||||
|
This PEP allows to choose exactly which memory allocator is used for your
|
||||||
|
application depending on its usage of the memory (number of allocations,
|
||||||
|
size of allocations, lifetime of objects, etc.).
|
||||||
|
|
||||||
|
|
||||||
|
Links
|
||||||
|
=====
|
||||||
|
|
||||||
|
CPython issues related to memory allocation:
|
||||||
|
|
||||||
|
* `Issue #3329: Add new APIs to customize memory allocators
|
||||||
|
<http://bugs.python.org/issue3329>`_
|
||||||
|
* `Issue #13483: Use VirtualAlloc to allocate memory arenas
|
||||||
|
<http://bugs.python.org/issue13483>`_
|
||||||
|
* `Issue #16742: PyOS_Readline drops GIL and calls PyOS_StdioReadline,
|
||||||
|
which isn't thread safe <http://bugs.python.org/issue16742>`_
|
||||||
|
* `Issue #18203: Replace calls to malloc() with PyMem_Malloc() or
|
||||||
|
PyMem_RawMalloc() <http://bugs.python.org/issue18203>`_
|
||||||
|
* `Issue #18227: Use Python memory allocators in external libraries like
|
||||||
|
zlib or OpenSSL <http://bugs.python.org/issue18227>`_
|
||||||
|
|
||||||
|
Projects analyzing the memory usage of Python applications:
|
||||||
|
|
||||||
|
* `pytracemalloc
|
||||||
|
<https://pypi.python.org/pypi/pytracemalloc>`_
|
||||||
|
* `Meliae: Python Memory Usage Analyzer
|
||||||
|
<https://pypi.python.org/pypi/meliae>`_
|
||||||
|
* `Guppy-PE: umbrella package combining Heapy and GSL
|
||||||
|
<http://guppy-pe.sourceforge.net/>`_
|
||||||
|
* `PySizer (developed for Python 2.4)
|
||||||
|
<http://pysizer.8325.org/>`_
|
||||||
|
|
||||||
|
|
||||||
|
Copyright
|
||||||
|
=========
|
||||||
|
|
||||||
|
This document has been placed into the public domain.
|
||||||
|
|
|
@ -0,0 +1,248 @@
|
||||||
|
PEP: 446
|
||||||
|
Title: Add new parameters to configure the inheritance of files and for non-blocking sockets
|
||||||
|
Version: $Revision$
|
||||||
|
Last-Modified: $Date$
|
||||||
|
Author: Victor Stinner <victor.stinner@gmail.com>
|
||||||
|
Status: Draft
|
||||||
|
Type: Standards Track
|
||||||
|
Content-Type: text/x-rst
|
||||||
|
Created: 3-July-2013
|
||||||
|
Python-Version: 3.4
|
||||||
|
|
||||||
|
|
||||||
|
Abstract
|
||||||
|
========
|
||||||
|
|
||||||
|
This PEP proposes new portable parameters and functions to configure the
|
||||||
|
inheritance of file descriptors and the non-blocking flag of sockets.
|
||||||
|
|
||||||
|
|
||||||
|
Rationale
|
||||||
|
=========
|
||||||
|
|
||||||
|
Inheritance of file descriptors
|
||||||
|
-------------------------------
|
||||||
|
|
||||||
|
The inheritance of file descriptors in child processes can be configured
|
||||||
|
on each file descriptor using a *close-on-exec* flag. By default, the
|
||||||
|
close-on-exec flag is not set.
|
||||||
|
|
||||||
|
On Windows, the close-on-exec flag is the inverse of ``HANDLE_FLAG_INHERIT``. File
|
||||||
|
descriptors are not inherited if the ``bInheritHandles`` parameter of
|
||||||
|
the ``CreateProcess()`` function is ``FALSE``, even if the
|
||||||
|
``HANDLE_FLAG_INHERIT`` flag is set. If ``bInheritHandles`` is ``TRUE``,
|
||||||
|
only file descriptors with ``HANDLE_FLAG_INHERIT`` flag set are
|
||||||
|
inherited, others are not.
|
||||||
|
|
||||||
|
On UNIX, the close-on-exec flag is ``O_CLOEXEC``. File descriptors with
|
||||||
|
the ``O_CLOEXEC`` flag set are closed at the execution of a new program
|
||||||
|
(ex: when calling ``execv()``).
|
||||||
|
|
||||||
|
The ``O_CLOEXEC`` flag has no effect on ``fork()``, all file descriptors
|
||||||
|
are inherited by the child process. Futhermore, most properties file
|
||||||
|
descriptors are shared between the parent and the child processes,
|
||||||
|
except file attributes which are duplicated (``O_CLOEXEC`` is the only
|
||||||
|
file attribute). Setting ``O_CLOEXEC`` flag of a file descriptor in the
|
||||||
|
child process does not change the ``O_CLOEXEC`` flag of the file
|
||||||
|
descriptor in the parent process.
|
||||||
|
|
||||||
|
|
||||||
|
Issues of the inheritance of file descriptors
|
||||||
|
---------------------------------------------
|
||||||
|
|
||||||
|
Inheritance of file descriptors causes issues. For example, closing a
|
||||||
|
file descriptor in the parent process does not release the resource
|
||||||
|
(file, socket, ...), because the file descriptor is still open in the
|
||||||
|
child process.
|
||||||
|
|
||||||
|
Leaking file descriptors is also a major security vulnerability. An
|
||||||
|
untrusted child process can read sensitive data like passwords and take
|
||||||
|
control of the parent process though leaked file descriptors. It is for
|
||||||
|
example a known vulnerability to escape from a chroot.
|
||||||
|
|
||||||
|
|
||||||
|
Non-blocking sockets
|
||||||
|
--------------------
|
||||||
|
|
||||||
|
To handle multiple network clients in a single thread, a multiplexing
|
||||||
|
function like ``select()`` can be used. For best performances, sockets
|
||||||
|
must be configured as non-blocking. Operations like ``send()`` and
|
||||||
|
``recv()`` return an ``EAGAIN`` or ``EWOULDBLOCK`` error if the
|
||||||
|
operation would block.
|
||||||
|
|
||||||
|
By default, newly created sockets are blocking. Setting the non-blocking
|
||||||
|
mode requires additional system calls.
|
||||||
|
|
||||||
|
On UNIX, the blocking flag is ``O_NONBLOCK``: a pipe and a socket are
|
||||||
|
non-blocking if the ``O_NONBLOCK`` flag is set.
|
||||||
|
|
||||||
|
|
||||||
|
Setting flags at the creation of the file descriptor
|
||||||
|
----------------------------------------------------
|
||||||
|
|
||||||
|
Windows and recent versions of other operating systems like Linux
|
||||||
|
support setting the close-on-exec flag directly at the creation of file
|
||||||
|
descriptors, and close-on-exec and blocking flags at the creation of
|
||||||
|
sockets.
|
||||||
|
|
||||||
|
Setting these flags at the creation is atomic and avoids additional
|
||||||
|
system calls.
|
||||||
|
|
||||||
|
|
||||||
|
Proposal
|
||||||
|
========
|
||||||
|
|
||||||
|
New cloexec And blocking Parameters
|
||||||
|
-----------------------------------
|
||||||
|
|
||||||
|
Add a new optional *cloexec* on functions creating file descriptors:
|
||||||
|
|
||||||
|
* ``io.FileIO``
|
||||||
|
* ``io.open()``
|
||||||
|
* ``open()``
|
||||||
|
* ``os.dup()``
|
||||||
|
* ``os.dup2()``
|
||||||
|
* ``os.fdopen()``
|
||||||
|
* ``os.open()``
|
||||||
|
* ``os.openpty()``
|
||||||
|
* ``os.pipe()``
|
||||||
|
* ``select.devpoll()``
|
||||||
|
* ``select.epoll()``
|
||||||
|
* ``select.kqueue()``
|
||||||
|
|
||||||
|
Add new optional *cloexec* and *blocking* parameters to functions
|
||||||
|
creating sockets:
|
||||||
|
|
||||||
|
* ``asyncore.dispatcher.create_socket()``
|
||||||
|
* ``socket.socket()``
|
||||||
|
* ``socket.socket.accept()``
|
||||||
|
* ``socket.socket.dup()``
|
||||||
|
* ``socket.socket.fromfd``
|
||||||
|
* ``socket.socketpair()``
|
||||||
|
|
||||||
|
The default value of *cloexec* is ``False`` and the default value of
|
||||||
|
*blocking* is ``True``.
|
||||||
|
|
||||||
|
The atomicity is not guaranteed. If the platform does not support
|
||||||
|
setting close-on-exec and blocking flags at the creation of the file
|
||||||
|
descriptor or socket, the flags are set using additional system calls.
|
||||||
|
|
||||||
|
|
||||||
|
New Functions
|
||||||
|
-------------
|
||||||
|
|
||||||
|
Add new functions the get and set the close-on-exec flag of a file
|
||||||
|
descriptor, available on all platforms:
|
||||||
|
|
||||||
|
* ``os.get_cloexec(fd:int) -> bool``
|
||||||
|
* ``os.set_cloexec(fd:int, cloexec: bool)``
|
||||||
|
|
||||||
|
Add new functions the get and set the blocking flag of a file
|
||||||
|
descriptor, only available on UNIX:
|
||||||
|
|
||||||
|
* ``os.get_blocking(fd:int) -> bool``
|
||||||
|
* ``os.set_blocking(fd:int, blocking: bool)``
|
||||||
|
|
||||||
|
|
||||||
|
Other Changes
|
||||||
|
-------------
|
||||||
|
|
||||||
|
The ``subprocess.Popen`` class must clear the close-on-exec flag of file
|
||||||
|
descriptors of the ``pass_fds`` parameter. The flag is cleared in the
|
||||||
|
child process before executing the program; the change does not change
|
||||||
|
the flag in the parent process.
|
||||||
|
|
||||||
|
The close-on-exec flag must also be set on private file descriptors and
|
||||||
|
sockets in the Python standard library. For example, on UNIX,
|
||||||
|
os.urandom() opens ``/dev/urandom`` to read some random bytes and the
|
||||||
|
file descriptor is closed at function exit. The file descriptor is not
|
||||||
|
expected to be inherited by child processes.
|
||||||
|
|
||||||
|
|
||||||
|
Rejected Alternatives
|
||||||
|
=====================
|
||||||
|
|
||||||
|
PEP 433
|
||||||
|
-------
|
||||||
|
|
||||||
|
The PEP 433 entitled "Easier suppression of file descriptor inheritance"
|
||||||
|
is a previous attempt proposing various other alternatives, but no
|
||||||
|
consensus could be reached.
|
||||||
|
|
||||||
|
This PEP has a well defined behaviour (the default value of the new
|
||||||
|
*cloexec* parameter is not configurable), is more conservative (no
|
||||||
|
backward compatibility issue), and is much simpler.
|
||||||
|
|
||||||
|
|
||||||
|
Add blocking parameter for file descriptors and use Windows overlapped I/O
|
||||||
|
--------------------------------------------------------------------------
|
||||||
|
|
||||||
|
Windows supports non-blocking operations on files using an extension of
|
||||||
|
the Windows API called "Overlapped I/O". Using this extension requires
|
||||||
|
to modify the Python standard library and applications to pass a
|
||||||
|
``OVERLAPPED`` structure and an event loop to wait for the completion of
|
||||||
|
operations.
|
||||||
|
|
||||||
|
This PEP only tries to expose portable flags on file descriptors and
|
||||||
|
sockets. Supporting overlapped I/O requires an abstraction providing a
|
||||||
|
high-level and portable API for asynchronous operations on files and
|
||||||
|
sockets. Overlapped I/O are out of the scope of this PEP.
|
||||||
|
|
||||||
|
UNIX supports non-blocking files, moreover recent versions of operating
|
||||||
|
systems support setting the non-blocking flag at the creation of a file
|
||||||
|
descriptor. It would be possible to add a new optional *blocking*
|
||||||
|
parameter to Python functions creating file descriptors. On Windows,
|
||||||
|
creating a file descriptor with ``blocking=False`` would raise a
|
||||||
|
``NotImplementedError``. This behaviour is not acceptable for the ``os``
|
||||||
|
module which is designed as a thin wrapper on the C functions of the
|
||||||
|
operating system. If a platform does not support a function, the
|
||||||
|
function should not be available on the platform. For example,
|
||||||
|
the ``os.fork()`` function is not available on Windows.
|
||||||
|
|
||||||
|
UNIX has more flag on file descriptors: ``O_DSYNC``, ``O_SYNC``,
|
||||||
|
``O_DIRECT``, etc. Adding all these flags complicates the signature and
|
||||||
|
the implementation of functions creating file descriptor like open().
|
||||||
|
Moreover, these flags do not work on any file type, and are not
|
||||||
|
portable.
|
||||||
|
|
||||||
|
For all these reasons, this alternative was rejected. The PEP 3156
|
||||||
|
proposes an abstraction for asynchronous I/O supporting non-blocking
|
||||||
|
files on Windows.
|
||||||
|
|
||||||
|
|
||||||
|
Links
|
||||||
|
=====
|
||||||
|
|
||||||
|
Python issues:
|
||||||
|
|
||||||
|
* `#10115: Support accept4() for atomic setting of flags at socket
|
||||||
|
creation <http://bugs.python.org/issue10115>`_
|
||||||
|
* `#12105: open() does not able to set flags, such as O_CLOEXEC
|
||||||
|
<http://bugs.python.org/issue12105>`_
|
||||||
|
* `#12107: TCP listening sockets created without FD_CLOEXEC flag
|
||||||
|
<http://bugs.python.org/issue12107>`_
|
||||||
|
* `#16850: Add "e" mode to open(): close-and-exec
|
||||||
|
(O_CLOEXEC) / O_NOINHERIT <http://bugs.python.org/issue16850>`_
|
||||||
|
* `#16860: Use O_CLOEXEC in the tempfile module
|
||||||
|
<http://bugs.python.org/issue16860>`_
|
||||||
|
* `#16946: subprocess: _close_open_fd_range_safe() does not set
|
||||||
|
close-on-exec flag on Linux < 2.6.23 if O_CLOEXEC is defined
|
||||||
|
<http://bugs.python.org/issue16946>`_
|
||||||
|
* `#17070: Use the new cloexec to improve security and avoid bugs
|
||||||
|
<http://bugs.python.org/issue17070>`_
|
||||||
|
|
||||||
|
Other links:
|
||||||
|
|
||||||
|
* `Secure File Descriptor Handling
|
||||||
|
<http://udrepper.livejournal.com/20407.html>`_ (Ulrich Drepper,
|
||||||
|
2008)
|
||||||
|
* `Ghosts of Unix past, part 2: Conflated designs
|
||||||
|
<http://lwn.net/Articles/412131/>`_ (Neil Brown, 2010) explains the
|
||||||
|
history of ``O_CLOEXEC`` and ``O_NONBLOCK`` flags
|
||||||
|
|
||||||
|
|
||||||
|
Copyright
|
||||||
|
=========
|
||||||
|
|
||||||
|
This document has been placed into the public domain.
|
||||||
|
|
|
@ -0,0 +1,408 @@
|
||||||
|
PEP: 447
|
||||||
|
Title: Add __locallookup__ method to metaclass
|
||||||
|
Version: $Revision$
|
||||||
|
Last-Modified: $Date$
|
||||||
|
Author: Ronald Oussoren <ronaldoussoren@mac.com>
|
||||||
|
Status: Draft
|
||||||
|
Type: Standards Track
|
||||||
|
Content-Type: text/x-rst
|
||||||
|
Created: 12-Jun-2013
|
||||||
|
Post-History: 2-Jul-2013, 15-Jul-2013, 29-Jul-2013
|
||||||
|
|
||||||
|
|
||||||
|
Abstract
|
||||||
|
========
|
||||||
|
|
||||||
|
Currently ``object.__getattribute__`` and ``super.__getattribute__`` peek
|
||||||
|
in the ``__dict__`` of classes on the MRO for a class when looking for
|
||||||
|
an attribute. This PEP adds an optional ``__locallookup__`` method to
|
||||||
|
a metaclass that can be used to override this behavior.
|
||||||
|
|
||||||
|
Rationale
|
||||||
|
=========
|
||||||
|
|
||||||
|
It is currently not possible to influence how the `super class`_ looks
|
||||||
|
up attributes (that is, ``super.__getattribute__`` unconditionally
|
||||||
|
peeks in the class ``__dict__``), and that can be problematic for
|
||||||
|
dynamic classes that can grow new methods on demand.
|
||||||
|
|
||||||
|
The ``__locallookup__`` method makes it possible to dynamicly add
|
||||||
|
attributes even when looking them up using the `super class`_.
|
||||||
|
|
||||||
|
The new method affects ``object.__getattribute__`` (and
|
||||||
|
`PyObject_GenericGetAttr`_) as well for consistency.
|
||||||
|
|
||||||
|
Background
|
||||||
|
----------
|
||||||
|
|
||||||
|
The current behavior of ``super.__getattribute__`` causes problems for
|
||||||
|
classes that are dynamic proxies for other (non-Python) classes or types,
|
||||||
|
an example of which is `PyObjC`_. PyObjC creates a Python class for every
|
||||||
|
class in the Objective-C runtime, and looks up methods in the Objective-C
|
||||||
|
runtime when they are used. This works fine for normal access, but doesn't
|
||||||
|
work for access with ``super`` objects. Because of this PyObjC currently
|
||||||
|
includes a custom ``super`` that must be used with its classes.
|
||||||
|
|
||||||
|
The API in this PEP makes it possible to remove the custom ``super`` and
|
||||||
|
simplifies the implementation because the custom lookup behavior can be
|
||||||
|
added in a central location.
|
||||||
|
|
||||||
|
|
||||||
|
The superclass attribute lookup hook
|
||||||
|
====================================
|
||||||
|
|
||||||
|
Both ``super.__getattribute__`` and ``object.__getattribute__`` (or
|
||||||
|
`PyObject_GenericGetAttr`_ in C code) walk an object's MRO and peek in the
|
||||||
|
class' ``__dict__`` to look up attributes. A way to affect this lookup is
|
||||||
|
using a method on the meta class for the type, that by default looks up
|
||||||
|
the name in the class ``__dict__``.
|
||||||
|
|
||||||
|
In Python code
|
||||||
|
--------------
|
||||||
|
|
||||||
|
A meta type can define a method ``__locallookup__`` that is called during
|
||||||
|
attribute resolution by both ``super.__getattribute__`` and ``object.__getattribute``::
|
||||||
|
|
||||||
|
class MetaType(type):
|
||||||
|
def __locallookup__(cls, name):
|
||||||
|
try:
|
||||||
|
return cls.__dict__[name]
|
||||||
|
except KeyError:
|
||||||
|
raise AttributeError(name) from None
|
||||||
|
|
||||||
|
The ``__locallookup__`` method has as its arguments a class and the name of the attribute
|
||||||
|
that is looked up. It should return the value of the attribute without invoking descriptors,
|
||||||
|
or raise `AttributeError`_ when the name cannot be found.
|
||||||
|
|
||||||
|
The `type`_ class provides a default implementation for ``__locallookup__``, that
|
||||||
|
looks up the name in the class dictionary.
|
||||||
|
|
||||||
|
Example usage
|
||||||
|
.............
|
||||||
|
|
||||||
|
The code below implements a silly metaclass that redirects attribute lookup to uppercase
|
||||||
|
versions of names::
|
||||||
|
|
||||||
|
class UpperCaseAccess (type):
|
||||||
|
def __locallookup__(cls, name):
|
||||||
|
return cls.__dict__[name.upper()]
|
||||||
|
|
||||||
|
class SillyObject (metaclass=UpperCaseAccess):
|
||||||
|
def m(self):
|
||||||
|
return 42
|
||||||
|
|
||||||
|
def M(self):
|
||||||
|
return "fourtytwo"
|
||||||
|
|
||||||
|
obj = SillyObject()
|
||||||
|
assert obj.m() == "fortytwo"
|
||||||
|
|
||||||
|
|
||||||
|
In C code
|
||||||
|
---------
|
||||||
|
|
||||||
|
A new slot ``tp_locallookup`` is added to the ``PyTypeObject`` struct, this slot
|
||||||
|
corresponds to the ``__locallookup__`` method on `type`_.
|
||||||
|
|
||||||
|
The slot has the following prototype::
|
||||||
|
|
||||||
|
PyObject* (*locallookupfunc)(PyTypeObject* cls, PyObject* name);
|
||||||
|
|
||||||
|
This method should lookup *name* in the namespace of *cls*, without looking at superclasses,
|
||||||
|
and should not invoke descriptors. The method returns ``NULL`` without setting an exception
|
||||||
|
when the *name* cannot be found, and returns a new reference otherwise (not a borrowed reference).
|
||||||
|
|
||||||
|
Use of this hook by the interpreter
|
||||||
|
-----------------------------------
|
||||||
|
|
||||||
|
The new method is required for metatypes and as such is defined on `type_`. Both
|
||||||
|
``super.__getattribute__`` and ``object.__getattribute__``/`PyObject_GenericGetAttr`_
|
||||||
|
(through ``_PyType_Lookup``) use the this ``__locallookup__`` method when walking
|
||||||
|
the MRO.
|
||||||
|
|
||||||
|
Other changes to the implementation
|
||||||
|
-----------------------------------
|
||||||
|
|
||||||
|
The change for `PyObject_GenericGetAttr`_ will be done by changing the private function
|
||||||
|
``_PyType_Lookup``. This currently returns a borrowed reference, but must return a new
|
||||||
|
reference when the ``__locallookup__`` method is present. Because of this ``_PyType_Lookup``
|
||||||
|
will be renamed to ``_PyType_LookupName``, this will cause compile-time errors for all out-of-tree
|
||||||
|
users of this private API.
|
||||||
|
|
||||||
|
The attribute lookup cache in ``Objects/typeobject.c`` is disabled for classes that have a
|
||||||
|
metaclass that overrides ``__locallookup__``, because using the cache might not be valid
|
||||||
|
for such classes.
|
||||||
|
|
||||||
|
Performance impact
|
||||||
|
------------------
|
||||||
|
|
||||||
|
The pybench output below compares an implementation of this PEP with the regular
|
||||||
|
source tree, both based on changeset a5681f50bae2, run on an idle machine an
|
||||||
|
Core i7 processor running Centos 6.4.
|
||||||
|
|
||||||
|
Even though the machine was idle there were clear differences between runs,
|
||||||
|
I've seen difference in "minimum time" vary from -0.1% to +1.5%, with simular
|
||||||
|
(but slightly smaller) differences in the "average time" difference.
|
||||||
|
|
||||||
|
::
|
||||||
|
|
||||||
|
-------------------------------------------------------------------------------
|
||||||
|
PYBENCH 2.1
|
||||||
|
-------------------------------------------------------------------------------
|
||||||
|
* using CPython 3.4.0a0 (default, Jul 29 2013, 13:01:34) [GCC 4.4.7 20120313 (Red Hat 4.4.7-3)]
|
||||||
|
* disabled garbage collection
|
||||||
|
* system check interval set to maximum: 2147483647
|
||||||
|
* using timer: time.perf_counter
|
||||||
|
* timer: resolution=1e-09, implementation=clock_gettime(CLOCK_MONOTONIC)
|
||||||
|
|
||||||
|
-------------------------------------------------------------------------------
|
||||||
|
Benchmark: pep447.pybench
|
||||||
|
-------------------------------------------------------------------------------
|
||||||
|
|
||||||
|
Rounds: 10
|
||||||
|
Warp: 10
|
||||||
|
Timer: time.perf_counter
|
||||||
|
|
||||||
|
Machine Details:
|
||||||
|
Platform ID: Linux-2.6.32-358.114.1.openstack.el6.x86_64-x86_64-with-centos-6.4-Final
|
||||||
|
Processor: x86_64
|
||||||
|
|
||||||
|
Python:
|
||||||
|
Implementation: CPython
|
||||||
|
Executable: /tmp/default-pep447/bin/python3
|
||||||
|
Version: 3.4.0a0
|
||||||
|
Compiler: GCC 4.4.7 20120313 (Red Hat 4.4.7-3)
|
||||||
|
Bits: 64bit
|
||||||
|
Build: Jul 29 2013 14:09:12 (#default)
|
||||||
|
Unicode: UCS4
|
||||||
|
|
||||||
|
|
||||||
|
-------------------------------------------------------------------------------
|
||||||
|
Comparing with: default.pybench
|
||||||
|
-------------------------------------------------------------------------------
|
||||||
|
|
||||||
|
Rounds: 10
|
||||||
|
Warp: 10
|
||||||
|
Timer: time.perf_counter
|
||||||
|
|
||||||
|
Machine Details:
|
||||||
|
Platform ID: Linux-2.6.32-358.114.1.openstack.el6.x86_64-x86_64-with-centos-6.4-Final
|
||||||
|
Processor: x86_64
|
||||||
|
|
||||||
|
Python:
|
||||||
|
Implementation: CPython
|
||||||
|
Executable: /tmp/default/bin/python3
|
||||||
|
Version: 3.4.0a0
|
||||||
|
Compiler: GCC 4.4.7 20120313 (Red Hat 4.4.7-3)
|
||||||
|
Bits: 64bit
|
||||||
|
Build: Jul 29 2013 13:01:34 (#default)
|
||||||
|
Unicode: UCS4
|
||||||
|
|
||||||
|
|
||||||
|
Test minimum run-time average run-time
|
||||||
|
this other diff this other diff
|
||||||
|
-------------------------------------------------------------------------------
|
||||||
|
BuiltinFunctionCalls: 45ms 44ms +1.3% 45ms 44ms +1.3%
|
||||||
|
BuiltinMethodLookup: 26ms 27ms -2.4% 27ms 27ms -2.2%
|
||||||
|
CompareFloats: 33ms 34ms -0.7% 33ms 34ms -1.1%
|
||||||
|
CompareFloatsIntegers: 66ms 67ms -0.9% 66ms 67ms -0.8%
|
||||||
|
CompareIntegers: 51ms 50ms +0.9% 51ms 50ms +0.8%
|
||||||
|
CompareInternedStrings: 34ms 33ms +0.4% 34ms 34ms -0.4%
|
||||||
|
CompareLongs: 29ms 29ms -0.1% 29ms 29ms -0.0%
|
||||||
|
CompareStrings: 43ms 44ms -1.8% 44ms 44ms -1.8%
|
||||||
|
ComplexPythonFunctionCalls: 44ms 42ms +3.9% 44ms 42ms +4.1%
|
||||||
|
ConcatStrings: 33ms 33ms -0.4% 33ms 33ms -1.0%
|
||||||
|
CreateInstances: 47ms 48ms -2.9% 47ms 49ms -3.4%
|
||||||
|
CreateNewInstances: 35ms 36ms -2.5% 36ms 36ms -2.5%
|
||||||
|
CreateStringsWithConcat: 69ms 70ms -0.7% 69ms 70ms -0.9%
|
||||||
|
DictCreation: 52ms 50ms +3.1% 52ms 50ms +3.0%
|
||||||
|
DictWithFloatKeys: 40ms 44ms -10.1% 43ms 45ms -5.8%
|
||||||
|
DictWithIntegerKeys: 32ms 36ms -11.2% 35ms 37ms -4.6%
|
||||||
|
DictWithStringKeys: 29ms 34ms -15.7% 35ms 40ms -11.0%
|
||||||
|
ForLoops: 30ms 29ms +2.2% 30ms 29ms +2.2%
|
||||||
|
IfThenElse: 38ms 41ms -6.7% 38ms 41ms -6.9%
|
||||||
|
ListSlicing: 36ms 36ms -0.7% 36ms 37ms -1.3%
|
||||||
|
NestedForLoops: 43ms 45ms -3.1% 43ms 45ms -3.2%
|
||||||
|
NestedListComprehensions: 39ms 40ms -1.7% 39ms 40ms -2.1%
|
||||||
|
NormalClassAttribute: 86ms 82ms +5.1% 86ms 82ms +5.0%
|
||||||
|
NormalInstanceAttribute: 42ms 42ms +0.3% 42ms 42ms +0.0%
|
||||||
|
PythonFunctionCalls: 39ms 38ms +3.5% 39ms 38ms +2.8%
|
||||||
|
PythonMethodCalls: 51ms 49ms +3.0% 51ms 50ms +2.8%
|
||||||
|
Recursion: 67ms 68ms -1.4% 67ms 68ms -1.4%
|
||||||
|
SecondImport: 41ms 36ms +12.5% 41ms 36ms +12.6%
|
||||||
|
SecondPackageImport: 45ms 40ms +13.1% 45ms 40ms +13.2%
|
||||||
|
SecondSubmoduleImport: 92ms 95ms -2.4% 95ms 98ms -3.6%
|
||||||
|
SimpleComplexArithmetic: 28ms 28ms -0.1% 28ms 28ms -0.2%
|
||||||
|
SimpleDictManipulation: 57ms 57ms -1.0% 57ms 58ms -1.0%
|
||||||
|
SimpleFloatArithmetic: 29ms 28ms +4.7% 29ms 28ms +4.9%
|
||||||
|
SimpleIntFloatArithmetic: 37ms 41ms -8.5% 37ms 41ms -8.7%
|
||||||
|
SimpleIntegerArithmetic: 37ms 41ms -9.4% 37ms 42ms -10.2%
|
||||||
|
SimpleListComprehensions: 33ms 33ms -1.9% 33ms 34ms -2.9%
|
||||||
|
SimpleListManipulation: 28ms 30ms -4.3% 29ms 30ms -4.1%
|
||||||
|
SimpleLongArithmetic: 26ms 26ms +0.5% 26ms 26ms +0.5%
|
||||||
|
SmallLists: 40ms 40ms +0.1% 40ms 40ms +0.1%
|
||||||
|
SmallTuples: 46ms 47ms -2.4% 46ms 48ms -3.0%
|
||||||
|
SpecialClassAttribute: 126ms 120ms +4.7% 126ms 121ms +4.4%
|
||||||
|
SpecialInstanceAttribute: 42ms 42ms +0.6% 42ms 42ms +0.8%
|
||||||
|
StringMappings: 94ms 91ms +3.9% 94ms 91ms +3.8%
|
||||||
|
StringPredicates: 48ms 49ms -1.7% 48ms 49ms -2.1%
|
||||||
|
StringSlicing: 45ms 45ms +1.4% 46ms 45ms +1.5%
|
||||||
|
TryExcept: 23ms 22ms +4.9% 23ms 22ms +4.8%
|
||||||
|
TryFinally: 32ms 32ms -0.1% 32ms 32ms +0.1%
|
||||||
|
TryRaiseExcept: 17ms 17ms +0.9% 17ms 17ms +0.5%
|
||||||
|
TupleSlicing: 49ms 48ms +1.1% 49ms 49ms +1.0%
|
||||||
|
WithFinally: 48ms 47ms +2.3% 48ms 47ms +2.4%
|
||||||
|
WithRaiseExcept: 45ms 44ms +0.8% 45ms 45ms +0.5%
|
||||||
|
-------------------------------------------------------------------------------
|
||||||
|
Totals: 2284ms 2287ms -0.1% 2306ms 2308ms -0.1%
|
||||||
|
|
||||||
|
(this=pep447.pybench, other=default.pybench)
|
||||||
|
|
||||||
|
|
||||||
|
A run of the benchmark suite (with option "-b 2n3") also seems to indicate that
|
||||||
|
the performance impact is minimal::
|
||||||
|
|
||||||
|
Report on Linux fangorn.local 2.6.32-358.114.1.openstack.el6.x86_64 #1 SMP Wed Jul 3 02:11:25 EDT 2013 x86_64 x86_64
|
||||||
|
Total CPU cores: 8
|
||||||
|
|
||||||
|
### call_method_slots ###
|
||||||
|
Min: 0.304120 -> 0.282791: 1.08x faster
|
||||||
|
Avg: 0.304394 -> 0.282906: 1.08x faster
|
||||||
|
Significant (t=2329.92)
|
||||||
|
Stddev: 0.00016 -> 0.00004: 4.1814x smaller
|
||||||
|
|
||||||
|
### call_simple ###
|
||||||
|
Min: 0.249268 -> 0.221175: 1.13x faster
|
||||||
|
Avg: 0.249789 -> 0.221387: 1.13x faster
|
||||||
|
Significant (t=2770.11)
|
||||||
|
Stddev: 0.00012 -> 0.00013: 1.1101x larger
|
||||||
|
|
||||||
|
### django_v2 ###
|
||||||
|
Min: 0.632590 -> 0.601519: 1.05x faster
|
||||||
|
Avg: 0.635085 -> 0.602653: 1.05x faster
|
||||||
|
Significant (t=321.32)
|
||||||
|
Stddev: 0.00087 -> 0.00051: 1.6933x smaller
|
||||||
|
|
||||||
|
### fannkuch ###
|
||||||
|
Min: 1.033181 -> 0.999779: 1.03x faster
|
||||||
|
Avg: 1.036457 -> 1.001840: 1.03x faster
|
||||||
|
Significant (t=260.31)
|
||||||
|
Stddev: 0.00113 -> 0.00070: 1.6112x smaller
|
||||||
|
|
||||||
|
### go ###
|
||||||
|
Min: 0.526714 -> 0.544428: 1.03x slower
|
||||||
|
Avg: 0.529649 -> 0.547626: 1.03x slower
|
||||||
|
Significant (t=-93.32)
|
||||||
|
Stddev: 0.00136 -> 0.00136: 1.0028x smaller
|
||||||
|
|
||||||
|
### iterative_count ###
|
||||||
|
Min: 0.109748 -> 0.116513: 1.06x slower
|
||||||
|
Avg: 0.109816 -> 0.117202: 1.07x slower
|
||||||
|
Significant (t=-357.08)
|
||||||
|
Stddev: 0.00008 -> 0.00019: 2.3664x larger
|
||||||
|
|
||||||
|
### json_dump_v2 ###
|
||||||
|
Min: 2.554462 -> 2.609141: 1.02x slower
|
||||||
|
Avg: 2.564472 -> 2.620013: 1.02x slower
|
||||||
|
Significant (t=-76.93)
|
||||||
|
Stddev: 0.00538 -> 0.00481: 1.1194x smaller
|
||||||
|
|
||||||
|
### meteor_contest ###
|
||||||
|
Min: 0.196336 -> 0.191925: 1.02x faster
|
||||||
|
Avg: 0.196878 -> 0.192698: 1.02x faster
|
||||||
|
Significant (t=61.86)
|
||||||
|
Stddev: 0.00053 -> 0.00041: 1.2925x smaller
|
||||||
|
|
||||||
|
### nbody ###
|
||||||
|
Min: 0.228039 -> 0.235551: 1.03x slower
|
||||||
|
Avg: 0.228857 -> 0.236052: 1.03x slower
|
||||||
|
Significant (t=-54.15)
|
||||||
|
Stddev: 0.00130 -> 0.00029: 4.4810x smaller
|
||||||
|
|
||||||
|
### pathlib ###
|
||||||
|
Min: 0.108501 -> 0.105339: 1.03x faster
|
||||||
|
Avg: 0.109084 -> 0.105619: 1.03x faster
|
||||||
|
Significant (t=311.08)
|
||||||
|
Stddev: 0.00022 -> 0.00011: 1.9314x smaller
|
||||||
|
|
||||||
|
### regex_effbot ###
|
||||||
|
Min: 0.057905 -> 0.056447: 1.03x faster
|
||||||
|
Avg: 0.058055 -> 0.056760: 1.02x faster
|
||||||
|
Significant (t=79.22)
|
||||||
|
Stddev: 0.00006 -> 0.00015: 2.7741x larger
|
||||||
|
|
||||||
|
### silent_logging ###
|
||||||
|
Min: 0.070810 -> 0.072436: 1.02x slower
|
||||||
|
Avg: 0.070899 -> 0.072609: 1.02x slower
|
||||||
|
Significant (t=-191.59)
|
||||||
|
Stddev: 0.00004 -> 0.00008: 2.2640x larger
|
||||||
|
|
||||||
|
### spectral_norm ###
|
||||||
|
Min: 0.290255 -> 0.299286: 1.03x slower
|
||||||
|
Avg: 0.290335 -> 0.299541: 1.03x slower
|
||||||
|
Significant (t=-572.10)
|
||||||
|
Stddev: 0.00005 -> 0.00015: 2.8547x larger
|
||||||
|
|
||||||
|
### threaded_count ###
|
||||||
|
Min: 0.107215 -> 0.115206: 1.07x slower
|
||||||
|
Avg: 0.107488 -> 0.115996: 1.08x slower
|
||||||
|
Significant (t=-109.39)
|
||||||
|
Stddev: 0.00016 -> 0.00076: 4.8665x larger
|
||||||
|
|
||||||
|
The following not significant results are hidden, use -v to show them:
|
||||||
|
call_method, call_method_unknown, chaos, fastpickle, fastunpickle, float, formatted_logging, hexiom2, json_load, normal_startup, nqueens, pidigits, raytrace, regex_compile, regex_v8, richards, simple_logging, startup_nosite, telco, unpack_sequence.
|
||||||
|
|
||||||
|
|
||||||
|
Alternative proposals
|
||||||
|
---------------------
|
||||||
|
|
||||||
|
``__getattribute_super__``
|
||||||
|
..........................
|
||||||
|
|
||||||
|
An earlier version of this PEP used the following static method on classes::
|
||||||
|
|
||||||
|
def __getattribute_super__(cls, name, object, owner): pass
|
||||||
|
|
||||||
|
This method performed name lookup as well as invoking descriptors and was necessarily
|
||||||
|
limited to working only with ``super.__getattribute__``.
|
||||||
|
|
||||||
|
|
||||||
|
Reuse ``tp_getattro``
|
||||||
|
.....................
|
||||||
|
|
||||||
|
It would be nice to avoid adding a new slot, thus keeping the API simpler and
|
||||||
|
easier to understand. A comment on `Issue 18181`_ asked about reusing the
|
||||||
|
``tp_getattro`` slot, that is super could call the ``tp_getattro`` slot of all
|
||||||
|
methods along the MRO.
|
||||||
|
|
||||||
|
That won't work because ``tp_getattro`` will look in the instance
|
||||||
|
``__dict__`` before it tries to resolve attributes using classes in the MRO.
|
||||||
|
This would mean that using ``tp_getattro`` instead of peeking the class
|
||||||
|
dictionaries changes the semantics of the `super class`_.
|
||||||
|
|
||||||
|
|
||||||
|
References
|
||||||
|
==========
|
||||||
|
|
||||||
|
* `Issue 18181`_ contains a prototype implementation
|
||||||
|
|
||||||
|
Copyright
|
||||||
|
=========
|
||||||
|
|
||||||
|
This document has been placed in the public domain.
|
||||||
|
|
||||||
|
.. _`Issue 18181`: http://bugs.python.org/issue18181
|
||||||
|
|
||||||
|
.. _`super class`: http://docs.python.org/3/library/functions.html#super
|
||||||
|
|
||||||
|
.. _`NotImplemented`: http://docs.python.org/3/library/constants.html#NotImplemented
|
||||||
|
|
||||||
|
.. _`PyObject_GenericGetAttr`: http://docs.python.org/3/c-api/object.html#PyObject_GenericGetAttr
|
||||||
|
|
||||||
|
.. _`type`: http://docs.python.org/3/library/functions.html#type
|
||||||
|
|
||||||
|
.. _`AttributeError`: http://docs.python.org/3/library/exceptions.html#AttributeError
|
||||||
|
|
||||||
|
.. _`PyObjC`: http://pyobjc.sourceforge.net/
|
||||||
|
|
||||||
|
.. _`classmethod`: http://docs.python.org/3/library/functions.html#classmethod
|
|
@ -0,0 +1,247 @@
|
||||||
|
PEP: 448
|
||||||
|
Title: Additional Unpacking Generalizations
|
||||||
|
Version: $Revision$
|
||||||
|
Last-Modified: $Date$
|
||||||
|
Author: Joshua Landau <joshua@landau.ws>
|
||||||
|
Discussions-To: python-ideas@python.org
|
||||||
|
Status: Draft
|
||||||
|
Type: Standards Track
|
||||||
|
Content-Type: text/x-rst
|
||||||
|
Created: 29-Jun-2013
|
||||||
|
Python-Version: 3.4
|
||||||
|
Post-History:
|
||||||
|
|
||||||
|
|
||||||
|
Abstract
|
||||||
|
========
|
||||||
|
|
||||||
|
This PEP proposes extended usages of the ``*`` iterable unpacking
|
||||||
|
operator to allow unpacking in more positions, an arbitrary number of
|
||||||
|
times, and in several additional circumstances.
|
||||||
|
|
||||||
|
Specifically:
|
||||||
|
|
||||||
|
Arbitrarily positioned unpacking operators::
|
||||||
|
|
||||||
|
>>> print(*[1], *[2], 3)
|
||||||
|
1 2 3
|
||||||
|
>>> dict(**{'x': 1}, y=3, **{'z': 2})
|
||||||
|
{'x': 1, 'y': 2, 'z': 3}
|
||||||
|
|
||||||
|
Function calls currently have the restriction that keyword arguments
|
||||||
|
must follow positional arguments and ``**`` unpackings must additionally
|
||||||
|
follow ``*`` unpackings. Because of the new levity for ``*`` and ``**``
|
||||||
|
unpackings, it may be advisable to lift some or all of these
|
||||||
|
restrictions.
|
||||||
|
|
||||||
|
As currently, if an argument is given multiple times - such as a
|
||||||
|
positional argument given both positionally and by keyword - a
|
||||||
|
TypeError is raised.
|
||||||
|
|
||||||
|
Unpacking is proposed to be allowed inside tuples, lists, sets,
|
||||||
|
dictionaries and comprehensions::
|
||||||
|
|
||||||
|
>>> *range(4), 4
|
||||||
|
(0, 1, 2, 3, 4)
|
||||||
|
>>> [*range(4), 4]
|
||||||
|
[0, 1, 2, 3, 4]
|
||||||
|
>>> {*range(4), 4}
|
||||||
|
{0, 1, 2, 3, 4}
|
||||||
|
>>> {'x': 1, **{'y': 2}}
|
||||||
|
{'x': 1, 'y': 2}
|
||||||
|
|
||||||
|
>>> ranges = [range(i) for i in range(5)]
|
||||||
|
>>> [*item for item in ranges]
|
||||||
|
[0, 0, 1, 0, 1, 2, 0, 1, 2, 3]
|
||||||
|
|
||||||
|
|
||||||
|
Rationale
|
||||||
|
=========
|
||||||
|
|
||||||
|
Current usage of the ``*`` iterable unpacking operator features
|
||||||
|
unnecessary restrictions that can harm readability.
|
||||||
|
|
||||||
|
Unpacking multiple times has an obvious rationale. When you want to
|
||||||
|
unpack several iterables into a function definition or follow an unpack
|
||||||
|
with more positional arguments, the most natural way would be to write::
|
||||||
|
|
||||||
|
function(**kw_arguments, **more_arguments)
|
||||||
|
|
||||||
|
function(*arguments, argument)
|
||||||
|
|
||||||
|
Simple examples where this is useful are ``print`` and ``str.format``.
|
||||||
|
Instead, you could be forced to write::
|
||||||
|
|
||||||
|
kwargs = dict(kw_arguments)
|
||||||
|
kwargs.update(more_arguments)
|
||||||
|
function(**kwargs)
|
||||||
|
|
||||||
|
args = list(arguments)
|
||||||
|
args.append(arg)
|
||||||
|
function(*args)
|
||||||
|
|
||||||
|
or, if you know to do so::
|
||||||
|
|
||||||
|
from collections import ChainMap
|
||||||
|
function(**ChainMap(more_arguments, arguments))
|
||||||
|
|
||||||
|
from itertools import chain
|
||||||
|
function(*chain(args, [arg]))
|
||||||
|
|
||||||
|
which add unnecessary line-noise and, with the first methods, causes
|
||||||
|
duplication of work.
|
||||||
|
|
||||||
|
|
||||||
|
There are two primary rationales for unpacking inside of containers.
|
||||||
|
Firstly there is a symmetry of assignment, where ``fst, *other, lst =
|
||||||
|
elems`` and ``elems = fst, *other, lst`` are approximate inverses,
|
||||||
|
ignoring the specifics of types. This, in effect, simplifies the
|
||||||
|
language by removing special cases.
|
||||||
|
|
||||||
|
Secondly, it vastly simplifies types of "addition" such as combining
|
||||||
|
dictionaries, and does so in an unambiguous and well-defined way::
|
||||||
|
|
||||||
|
combination = {**first_dictionary, "x": 1, "y": 2}
|
||||||
|
|
||||||
|
instead of::
|
||||||
|
|
||||||
|
combination = first_dictionary.copy()
|
||||||
|
combination.update({"x": 1, "y": 2})
|
||||||
|
|
||||||
|
which is especially important in contexts where expressions are
|
||||||
|
preferred. This is also useful as a more readable way of summing
|
||||||
|
iterables into a list, such as ``my_list + list(my_tuple) +
|
||||||
|
list(my_range)`` which is now equivalent to just ``[*my_list,
|
||||||
|
*my_tuple, *my_range]``.
|
||||||
|
|
||||||
|
|
||||||
|
The addition of unpacking to comprehensions is a logical extension.
|
||||||
|
It's usage will primarily be a neat replacement for ``[i for j in
|
||||||
|
2D_list for i in j]``, as the more readable ``[*l for l in 2D_list]``.
|
||||||
|
Other uses are possible, but expected to occur rarely.
|
||||||
|
|
||||||
|
|
||||||
|
Specification
|
||||||
|
=============
|
||||||
|
|
||||||
|
Function calls may accept an unbound number of ``*`` and ``**``
|
||||||
|
unpackings. There will be no restriction of the order of positional
|
||||||
|
arguments with relation to ``*`` unpackings nor any restriction of the
|
||||||
|
order of keyword arguments with relation to ``**`` unpackings.
|
||||||
|
|
||||||
|
Function calls currently have the restriction that keyword arguments
|
||||||
|
must follow positional arguments and ``**`` unpackings must additionally
|
||||||
|
follow ``*`` unpackings. Because of the new levity for ``*`` and ``**``
|
||||||
|
unpackings, it may be advisable to list some or all of these
|
||||||
|
restrictions.
|
||||||
|
|
||||||
|
As currently, if an argument is given multiple times - such as a
|
||||||
|
positional argument given both positionally and by keyword - a
|
||||||
|
TypeError is raised.
|
||||||
|
|
||||||
|
If the restrictions are kept, a function call will look like this::
|
||||||
|
|
||||||
|
function(
|
||||||
|
argument or *args, argument or *args, ...,
|
||||||
|
kwargument or *args, kwargument or *args, ...,
|
||||||
|
kwargument or **kwargs, kwargument or **kwargs, ...
|
||||||
|
)
|
||||||
|
|
||||||
|
If they are removed completely, a function call will look like this::
|
||||||
|
|
||||||
|
function(
|
||||||
|
argument or keyword_argument or *args or **kwargs,
|
||||||
|
argument or keyword_argument or *args or **kwargs,
|
||||||
|
...
|
||||||
|
)
|
||||||
|
|
||||||
|
|
||||||
|
Tuples, lists, sets and dictionaries will allow unpacking. This will
|
||||||
|
act as if the elements from unpacked items were inserted in order at
|
||||||
|
the site of unpacking, much as happens in unpacking in a function-call.
|
||||||
|
Dictionaries require ``**`` unpacking; all the others require ``*`` unpacking.
|
||||||
|
A dictionary's key remain in a right-to-left priority order, so
|
||||||
|
``{**{'a': 1}, 'a': 2, **{'a': 3}}`` evaluates to ``{'a': 3}``. There
|
||||||
|
is no restriction on the number or position of unpackings.
|
||||||
|
|
||||||
|
Comprehensions, by simple extension, will support unpacking. As before,
|
||||||
|
dictionaries require ``**`` unpacking, all the others require ``*``
|
||||||
|
unpacking and key priorities are unchanged.
|
||||||
|
|
||||||
|
Examples include::
|
||||||
|
|
||||||
|
{*[1, 2, 3], 4, 5, *{6, 7, 8}}
|
||||||
|
|
||||||
|
(*e for e in [[1], [3, 4, 5], [2]])
|
||||||
|
|
||||||
|
{**dictionary for dictionary in (globals(), locals())}
|
||||||
|
|
||||||
|
{**locals(), "override": None}
|
||||||
|
|
||||||
|
|
||||||
|
Disadvantages
|
||||||
|
=============
|
||||||
|
|
||||||
|
If the current restrictions for function call arguments (keyword
|
||||||
|
arguments must follow positional arguments and ``**`` unpackings must
|
||||||
|
additionally follow ``*`` unpackings) are kept, the allowable orders
|
||||||
|
for arguments in a function call is more complicated than before.
|
||||||
|
The simplest explanation for the rules may be "positional arguments
|
||||||
|
come first and keyword arguments follow, but ``*`` unpackings are
|
||||||
|
allowed after keyword arguments".
|
||||||
|
|
||||||
|
If the current restrictions are lifted, there are no obvious gains to
|
||||||
|
code as the only new orders that are allowed look silly: ``f(a, e=e,
|
||||||
|
d=d, b, c)`` being a simpler example.
|
||||||
|
|
||||||
|
|
||||||
|
Whilst ``*elements, = iterable`` causes ``elements`` to be a list,
|
||||||
|
``elements = *iterable,`` causes ``elements`` to be a tuple. The
|
||||||
|
reason for this may not be obvious at first glance and may confuse
|
||||||
|
people unfamiliar with the construct.
|
||||||
|
|
||||||
|
|
||||||
|
Implementation
|
||||||
|
==============
|
||||||
|
|
||||||
|
An implementation for an old version of Python 3 is found at Issue
|
||||||
|
2292 on bug tracker [1]_, although several changes should be made:
|
||||||
|
|
||||||
|
- It has yet to be updated to the most recent Python version
|
||||||
|
|
||||||
|
- It features a now redundant replacement for "yield from" which
|
||||||
|
should be removed
|
||||||
|
|
||||||
|
- It also loses support for calling function with keyword arguments before
|
||||||
|
positional arguments, which is an unnecessary backwards-incompatible change
|
||||||
|
|
||||||
|
- If the restrictions on the order of arguments in a function call are
|
||||||
|
partially or fully lifted, they would need to be included
|
||||||
|
|
||||||
|
|
||||||
|
References
|
||||||
|
==========
|
||||||
|
|
||||||
|
.. [1] Issue 2292, "Missing `*`-unpacking generalizations", Thomas Wouters
|
||||||
|
(http://bugs.python.org/issue2292)
|
||||||
|
|
||||||
|
.. [2] Discussion on Python-ideas list,
|
||||||
|
"list / array comprehensions extension", Alexander Heger
|
||||||
|
(http://mail.python.org/pipermail/python-ideas/2011-December/013097.html)
|
||||||
|
|
||||||
|
|
||||||
|
Copyright
|
||||||
|
=========
|
||||||
|
|
||||||
|
This document has been placed in the public domain.
|
||||||
|
|
||||||
|
|
||||||
|
|
||||||
|
..
|
||||||
|
Local Variables:
|
||||||
|
mode: indented-text
|
||||||
|
indent-tabs-mode: nil
|
||||||
|
sentence-end-double-space: t
|
||||||
|
fill-column: 70
|
||||||
|
coding: utf-8
|
||||||
|
End:
|
|
@ -0,0 +1,50 @@
|
||||||
|
import os, fcntl, sys, errno
|
||||||
|
|
||||||
|
def get_cloexec(fd):
|
||||||
|
try:
|
||||||
|
flags = fcntl.fcntl(fd, fcntl.F_GETFD)
|
||||||
|
return bool(flags & fcntl.FD_CLOEXEC)
|
||||||
|
except IOError as err:
|
||||||
|
if err.errno == errno.EBADF:
|
||||||
|
return '<invalid file descriptor>'
|
||||||
|
else:
|
||||||
|
return str(err)
|
||||||
|
|
||||||
|
def set_cloexec(fd):
|
||||||
|
flags = fcntl.fcntl(fd, fcntl.F_GETFD)
|
||||||
|
flags |= fcntl.FD_CLOEXEC
|
||||||
|
fcntl.fcntl(fd, fcntl.F_SETFD, flags)
|
||||||
|
|
||||||
|
def main():
|
||||||
|
f = open(__file__, "rb")
|
||||||
|
fd = f.fileno()
|
||||||
|
print("initial state: fd=%s, cloexec=%s" % (fd, get_cloexec(fd)))
|
||||||
|
|
||||||
|
|
||||||
|
pid = os.fork()
|
||||||
|
if not pid:
|
||||||
|
set_cloexec(fd)
|
||||||
|
print("child process after fork, set cloexec: cloexec=%s" % get_cloexec(fd))
|
||||||
|
child_argv = [sys.executable, __file__, str(fd),
|
||||||
|
'child process after exec']
|
||||||
|
os.execv(child_argv[0], child_argv)
|
||||||
|
|
||||||
|
os.waitpid(pid, 0)
|
||||||
|
print("parent process after fork: cloexec=%s" % get_cloexec(fd))
|
||||||
|
child_argv = [sys.executable, __file__, str(fd),
|
||||||
|
'parent process after exec']
|
||||||
|
os.execv(child_argv[0], child_argv)
|
||||||
|
|
||||||
|
def after_exec():
|
||||||
|
fd = int(sys.argv[1])
|
||||||
|
name = sys.argv[2]
|
||||||
|
print("%s: fd=%s, cloexec=%s"
|
||||||
|
% (name, fd, get_cloexec(fd)))
|
||||||
|
sys.exit()
|
||||||
|
|
||||||
|
if __name__ == "__main__":
|
||||||
|
if len(sys.argv) == 1:
|
||||||
|
main()
|
||||||
|
else:
|
||||||
|
after_exec()
|
||||||
|
|
118
pep-3150.txt
118
pep-3150.txt
|
@ -19,9 +19,11 @@ This PEP proposes the addition of an optional ``given`` clause to several
|
||||||
Python statements that do not currently have an associated code suite. This
|
Python statements that do not currently have an associated code suite. This
|
||||||
clause will create a statement local namespace for additional names that are
|
clause will create a statement local namespace for additional names that are
|
||||||
accessible in the associated statement, but do not become part of the
|
accessible in the associated statement, but do not become part of the
|
||||||
containing namespace. To permit a sane implementation strategy, forward
|
containing namespace.
|
||||||
references to names from the ``given`` clause will need to be marked
|
|
||||||
explicitly.
|
Adoption of a new symbol, ``?``, is proposed to denote a forward reference
|
||||||
|
to the namespace created by running the associated code suite. It will be
|
||||||
|
a reference to a ``types.SimpleNamespace`` object.
|
||||||
|
|
||||||
The primary motivation is to enable a more declarative style of programming,
|
The primary motivation is to enable a more declarative style of programming,
|
||||||
where the operation to be performed is presented to the reader first, and the
|
where the operation to be performed is presented to the reader first, and the
|
||||||
|
@ -72,12 +74,16 @@ The ``given`` clause would allow subexpressions to be referenced by
|
||||||
name in the header line, with the actual definitions following in
|
name in the header line, with the actual definitions following in
|
||||||
the indented clause. As a simple example::
|
the indented clause. As a simple example::
|
||||||
|
|
||||||
sorted_data = sorted(data, key=.sort_key) given:
|
sorted_data = sorted(data, key=?.sort_key) given:
|
||||||
def sort_key(item):
|
def sort_key(item):
|
||||||
return item.attr1, item.attr2
|
return item.attr1, item.attr2
|
||||||
|
|
||||||
The leading ``.`` on ``.sort_key`` indicates to the compiler that this
|
The new symbol ``?`` is used to refer to the given namespace. It would be a
|
||||||
is a forward reference to a name defined in the ``given`` clause.
|
``types.SimpleNamespace`` instance, so ``?.sort_key`` functions as
|
||||||
|
a forward reference to a name defined in the ``given`` clause.
|
||||||
|
|
||||||
|
A docstring would be permitted in the given clause, and would be attached
|
||||||
|
to the result namespace as its ``__doc__`` attribute.
|
||||||
|
|
||||||
The ``pass`` statement is included to provide a consistent way to skip
|
The ``pass`` statement is included to provide a consistent way to skip
|
||||||
inclusion of a meaningful expression in the header line. While this is not
|
inclusion of a meaningful expression in the header line. While this is not
|
||||||
|
@ -94,7 +100,7 @@ binding operations in the header line::
|
||||||
# Explicit early binding via given clause
|
# Explicit early binding via given clause
|
||||||
seq = []
|
seq = []
|
||||||
for i in range(10):
|
for i in range(10):
|
||||||
seq.append(.f) given i=i:
|
seq.append(.f) given i=i in:
|
||||||
def f():
|
def f():
|
||||||
return i
|
return i
|
||||||
assert [f() for f in seq] == list(range(10))
|
assert [f() for f in seq] == list(range(10))
|
||||||
|
@ -105,7 +111,7 @@ Semantics
|
||||||
|
|
||||||
The following statement::
|
The following statement::
|
||||||
|
|
||||||
op(.f, .g) given bound_a=a, bound_b=b:
|
op(?.f, ?.g) given bound_a=a, bound_b=b in:
|
||||||
def f():
|
def f():
|
||||||
return bound_a + bound_b
|
return bound_a + bound_b
|
||||||
def g():
|
def g():
|
||||||
|
@ -121,9 +127,10 @@ hidden compiler variable or simply an entry on the interpreter stack)::
|
||||||
return bound_a + bound_b
|
return bound_a + bound_b
|
||||||
def g():
|
def g():
|
||||||
return bound_a - bound_b
|
return bound_a - bound_b
|
||||||
return f, g
|
return types.SimpleNamespace(**locals())
|
||||||
__ref1, __ref2 = __scope(__arg1)
|
__ref = __scope(__arg1, __arg2)
|
||||||
op(__ref1, __ref2)
|
__ref.__doc__ = __scope.__doc__
|
||||||
|
op(__ref.f, __ref.g)
|
||||||
|
|
||||||
A ``given`` clause is essentially a nested function which is created and
|
A ``given`` clause is essentially a nested function which is created and
|
||||||
then immediately executed. Unless explicitly passed in, names are looked
|
then immediately executed. Unless explicitly passed in, names are looked
|
||||||
|
@ -158,7 +165,7 @@ New::
|
||||||
yield_stmt: yield_expr [given_clause]
|
yield_stmt: yield_expr [given_clause]
|
||||||
raise_stmt: 'raise' [test ['from' test]] [given_clause]
|
raise_stmt: 'raise' [test ['from' test]] [given_clause]
|
||||||
assert_stmt: 'assert' test [',' test] [given_clause]
|
assert_stmt: 'assert' test [',' test] [given_clause]
|
||||||
given_clause: "given" (NAME '=' test)* ":" suite
|
given_clause: "given" [(NAME '=' test)+ "in"]":" suite
|
||||||
|
|
||||||
(Note that ``expr_stmt`` in the grammar is a slight misnomer, as it covers
|
(Note that ``expr_stmt`` in the grammar is a slight misnomer, as it covers
|
||||||
assignment and augmented assignment in addition to simple expression
|
assignment and augmented assignment in addition to simple expression
|
||||||
|
@ -207,7 +214,7 @@ For reference, here are the current definitions at that level::
|
||||||
flow_stmt: break_stmt | continue_stmt | return_stmt | raise_stmt | yield_stmt
|
flow_stmt: break_stmt | continue_stmt | return_stmt | raise_stmt | yield_stmt
|
||||||
|
|
||||||
In addition to the above changes, the definition of ``atom`` would be changed
|
In addition to the above changes, the definition of ``atom`` would be changed
|
||||||
to also allow ``"." NAME``. The restriction of this usage to statements with
|
to also allow ``?``. The restriction of this usage to statements with
|
||||||
an associated ``given`` clause would be handled by a later stage of the
|
an associated ``given`` clause would be handled by a later stage of the
|
||||||
compilation process (likely AST construction, which already enforces
|
compilation process (likely AST construction, which already enforces
|
||||||
other restrictions where the grammar is overly permissive in order to
|
other restrictions where the grammar is overly permissive in order to
|
||||||
|
@ -277,13 +284,14 @@ without reading the body of the suite.
|
||||||
However, while they are the initial motivating use case, limiting this
|
However, while they are the initial motivating use case, limiting this
|
||||||
feature solely to simple assignments would be overly restrictive. Once the
|
feature solely to simple assignments would be overly restrictive. Once the
|
||||||
feature is defined at all, it would be quite arbitrary to prevent its use
|
feature is defined at all, it would be quite arbitrary to prevent its use
|
||||||
for augmented assignments, return statements, yield expressions and
|
for augmented assignments, return statements, yield expressions,
|
||||||
arbitrary expressions that may modify the application state.
|
comprehensions and arbitrary expressions that may modify the
|
||||||
|
application state.
|
||||||
|
|
||||||
The ``given`` clause may also function as a more readable
|
The ``given`` clause may also function as a more readable
|
||||||
alternative to some uses of lambda expressions and similar
|
alternative to some uses of lambda expressions and similar
|
||||||
constructs when passing one-off functions to operations
|
constructs when passing one-off functions to operations
|
||||||
like ``sorted()``.
|
like ``sorted()`` or in callback based event-driven programming.
|
||||||
|
|
||||||
In module and class level code, the ``given`` clause will serve as a
|
In module and class level code, the ``given`` clause will serve as a
|
||||||
clear and reliable replacement for usage of the ``del`` statement to keep
|
clear and reliable replacement for usage of the ``del`` statement to keep
|
||||||
|
@ -350,7 +358,7 @@ container comprehensions::
|
||||||
|
|
||||||
# would be equivalent to
|
# would be equivalent to
|
||||||
|
|
||||||
seq2 = .result given seq=seq:
|
seq2 = ?.result given seq=seq:
|
||||||
result = []
|
result = []
|
||||||
for y in seq:
|
for y in seq:
|
||||||
if p(y):
|
if p(y):
|
||||||
|
@ -367,7 +375,7 @@ Not that, unlike PEP 403, the current version of this PEP *cannot*
|
||||||
provide a precisely equivalent expansion for a generator expression. The
|
provide a precisely equivalent expansion for a generator expression. The
|
||||||
closest it can get is to define an additional level of scoping::
|
closest it can get is to define an additional level of scoping::
|
||||||
|
|
||||||
seq2 = .g(seq) given:
|
seq2 = ?.g(seq) given:
|
||||||
def g(seq):
|
def g(seq):
|
||||||
for y in seq:
|
for y in seq:
|
||||||
if p(y):
|
if p(y):
|
||||||
|
@ -375,6 +383,22 @@ closest it can get is to define an additional level of scoping::
|
||||||
if q(x):
|
if q(x):
|
||||||
yield x
|
yield x
|
||||||
|
|
||||||
|
This limitation could be remedied by permitting the given clause to be
|
||||||
|
a generator function, in which case ? would refer to a generator-iterator
|
||||||
|
object rather than a simple namespace::
|
||||||
|
|
||||||
|
seq2 = ? given seq=seq in:
|
||||||
|
for y in seq:
|
||||||
|
if p(y):
|
||||||
|
for x in y:
|
||||||
|
if q(x):
|
||||||
|
yield x
|
||||||
|
|
||||||
|
However, this would make the meaning of "?" quite ambiguous, even more so
|
||||||
|
than is already the case for the meaning of ``def`` statements (which will
|
||||||
|
usually have a docstring indicating whether or not a function definition is
|
||||||
|
actually a generator)
|
||||||
|
|
||||||
Explaining Decorator Clause Evaluation and Application
|
Explaining Decorator Clause Evaluation and Application
|
||||||
------------------------------------------------------
|
------------------------------------------------------
|
||||||
|
|
||||||
|
@ -477,13 +501,18 @@ what the language allows them to express.
|
||||||
I believe the proposal in this PEP would finally let Python get close to the
|
I believe the proposal in this PEP would finally let Python get close to the
|
||||||
"executable pseudocode" bar for the kind of thought expressed above::
|
"executable pseudocode" bar for the kind of thought expressed above::
|
||||||
|
|
||||||
sorted_list = sorted(original, key=.sort_key) given:
|
sorted_list = sorted(original, key=?.key) given:
|
||||||
def sort_key(item):
|
def key(item):
|
||||||
return item.attr1, item.attr2
|
return item.attr1, item.attr2
|
||||||
|
|
||||||
Everything is in the same order as it was in the user's original thought, the
|
Everything is in the same order as it was in the user's original thought, and
|
||||||
only addition they have to make is to give the sorting criteria a name so that
|
they don't even need to come up with a name for the sorting criteria: it is
|
||||||
the usage can be linked up to the subsequent definition.
|
possible to reuse the keyword argument name directly.
|
||||||
|
|
||||||
|
A possible enhancement to those proposal would be to provide a convenient
|
||||||
|
shorthand syntax to say "use the given clause contents as keyword
|
||||||
|
arguments". Even without dedicated syntax, that can be written simply as
|
||||||
|
``**vars(?)``.
|
||||||
|
|
||||||
|
|
||||||
Harmful to Introspection
|
Harmful to Introspection
|
||||||
|
@ -516,7 +545,7 @@ world code is genuinely enhanced.
|
||||||
This is more of a deficiency in the PEP rather than the idea, though. If
|
This is more of a deficiency in the PEP rather than the idea, though. If
|
||||||
it wasn't a real world problem, we wouldn't get so many complaints about
|
it wasn't a real world problem, we wouldn't get so many complaints about
|
||||||
the lack of multi-line lambda support and Ruby's block construct
|
the lack of multi-line lambda support and Ruby's block construct
|
||||||
probaly wouldn't be quite so popular.
|
probably wouldn't be quite so popular.
|
||||||
|
|
||||||
|
|
||||||
Open Questions
|
Open Questions
|
||||||
|
@ -525,9 +554,12 @@ Open Questions
|
||||||
Syntax for Forward References
|
Syntax for Forward References
|
||||||
-----------------------------
|
-----------------------------
|
||||||
|
|
||||||
The leading ``.`` arguably fails the "syntax shall not look like grit on
|
The ``?`` symbol is proposed for forward references to the given namespace
|
||||||
Uncle Tim's monitor" test. However, it does have the advantages of being
|
as it is short, currently unused and suggests "there's something missing
|
||||||
easy to type and already having an association with namespaces.
|
here that will be filled in later".
|
||||||
|
|
||||||
|
The proposal in the PEP doesn't neatly parallel any existing Python feature,
|
||||||
|
so reusing an already used symbol has been deliberately avoided.
|
||||||
|
|
||||||
|
|
||||||
Handling of ``nonlocal`` and ``global``
|
Handling of ``nonlocal`` and ``global``
|
||||||
|
@ -541,8 +573,8 @@ Alternatively, they could be defined as operating as if the anonymous
|
||||||
functions were defined as in the expansion above.
|
functions were defined as in the expansion above.
|
||||||
|
|
||||||
|
|
||||||
Detailed Semantics #3: Handling of ``break`` and ``continue``
|
Handling of ``break`` and ``continue``
|
||||||
-------------------------------------------------------------
|
--------------------------------------
|
||||||
|
|
||||||
``break`` and ``continue`` will operate as if the anonymous functions were
|
``break`` and ``continue`` will operate as if the anonymous functions were
|
||||||
defined as in the expansion above. They will be syntax errors if they occur
|
defined as in the expansion above. They will be syntax errors if they occur
|
||||||
|
@ -561,6 +593,25 @@ they appear within a ``def`` statement within that suite.
|
||||||
Examples
|
Examples
|
||||||
========
|
========
|
||||||
|
|
||||||
|
Defining callbacks for event driven programming::
|
||||||
|
|
||||||
|
# Current Python (definition before use)
|
||||||
|
def cb(sock):
|
||||||
|
# Do something with socket
|
||||||
|
def eb(exc):
|
||||||
|
logging.exception(
|
||||||
|
"Failed connecting to %s:%s", host, port)
|
||||||
|
loop.create_connection((host, port), cb, eb) given:
|
||||||
|
|
||||||
|
# Becomes:
|
||||||
|
loop.create_connection((host, port), ?.cb, ?.eb) given:
|
||||||
|
def cb(sock):
|
||||||
|
# Do something with socket
|
||||||
|
def eb(exc):
|
||||||
|
logging.exception(
|
||||||
|
"Failed connecting to %s:%s", host, port)
|
||||||
|
|
||||||
|
|
||||||
Defining "one-off" classes which typically only have a single instance::
|
Defining "one-off" classes which typically only have a single instance::
|
||||||
|
|
||||||
# Current Python (instantiation after definition)
|
# Current Python (instantiation after definition)
|
||||||
|
@ -579,7 +630,7 @@ Defining "one-off" classes which typically only have a single instance::
|
||||||
... # However many lines
|
... # However many lines
|
||||||
|
|
||||||
# Becomes:
|
# Becomes:
|
||||||
public_name = .MeaningfulClassName(*params) given:
|
public_name = ?.MeaningfulClassName(*params) given:
|
||||||
class MeaningfulClassName():
|
class MeaningfulClassName():
|
||||||
... # Should trawl the stdlib for an example of doing this
|
... # Should trawl the stdlib for an example of doing this
|
||||||
|
|
||||||
|
@ -593,7 +644,7 @@ Calculating attributes without polluting the local namespace (from os.py)::
|
||||||
del _createenviron
|
del _createenviron
|
||||||
|
|
||||||
# Becomes:
|
# Becomes:
|
||||||
environ = ._createenviron() given:
|
environ = ?._createenviron() given:
|
||||||
def _createenviron():
|
def _createenviron():
|
||||||
... # 27 line function
|
... # 27 line function
|
||||||
|
|
||||||
|
@ -606,7 +657,7 @@ Replacing default argument hack (from functools.lru_cache)::
|
||||||
return decorating_function
|
return decorating_function
|
||||||
|
|
||||||
# Becomes:
|
# Becomes:
|
||||||
return .decorating_function given:
|
return ?.decorating_function given:
|
||||||
# Cell variables rather than locals, but should give similar speedup
|
# Cell variables rather than locals, but should give similar speedup
|
||||||
tuple, sorted, len, KeyError = tuple, sorted, len, KeyError
|
tuple, sorted, len, KeyError = tuple, sorted, len, KeyError
|
||||||
def decorating_function(user_function):
|
def decorating_function(user_function):
|
||||||
|
@ -701,6 +752,9 @@ References
|
||||||
.. [9] Possible PEP 3150 style guidelines (#2):
|
.. [9] Possible PEP 3150 style guidelines (#2):
|
||||||
http://mail.python.org/pipermail/python-ideas/2011-October/012341.html
|
http://mail.python.org/pipermail/python-ideas/2011-October/012341.html
|
||||||
|
|
||||||
|
.. [10] Multi-line lambdas (again!)
|
||||||
|
http://mail.python.org/pipermail/python-ideas/2013-August/022526.html
|
||||||
|
|
||||||
Copyright
|
Copyright
|
||||||
=========
|
=========
|
||||||
|
|
||||||
|
|
30
pep-3156.txt
30
pep-3156.txt
|
@ -846,6 +846,12 @@ public API is as follows, indicating the differences with PEP 3148:
|
||||||
convention from the section "Callback Style" below) is always called
|
convention from the section "Callback Style" below) is always called
|
||||||
with a single argument, the Future object.
|
with a single argument, the Future object.
|
||||||
|
|
||||||
|
- ``remove_done_callback(fn)``. Remove the argument from the list of
|
||||||
|
callbacks. This method is not defined by PEP 3148. The argument
|
||||||
|
must be equal (using ``==``) to the argument passed to
|
||||||
|
``add_done_callback()``. Returns the number of times the callback
|
||||||
|
was removed.
|
||||||
|
|
||||||
- ``set_result(result)``. The Future must not be done (nor cancelled)
|
- ``set_result(result)``. The Future must not be done (nor cancelled)
|
||||||
already. This makes the Future done and schedules the callbacks.
|
already. This makes the Future done and schedules the callbacks.
|
||||||
Difference with PEP 3148: This is a public API.
|
Difference with PEP 3148: This is a public API.
|
||||||
|
@ -1302,25 +1308,25 @@ package are provided:
|
||||||
|
|
||||||
- ``FIRST_EXCEPTION``: Wait until at least one Future is done (not
|
- ``FIRST_EXCEPTION``: Wait until at least one Future is done (not
|
||||||
cancelled) with an exception set. (The exclusion of cancelled
|
cancelled) with an exception set. (The exclusion of cancelled
|
||||||
Futures from the filter is surprising, but PEP 3148 does it this
|
Futures from the condition is surprising, but PEP 3148 does it
|
||||||
way.)
|
this way.)
|
||||||
|
|
||||||
- ``tulip.as_completed(fs, timeout=None)``. Returns an iterator whose
|
- ``tulip.as_completed(fs, timeout=None)``. Returns an iterator whose
|
||||||
values are Futures; waiting for successive values waits until the
|
values are Futures or coroutines; waiting for successive values
|
||||||
next Future or coroutine from the set ``fs`` completes, and returns
|
waits until the next Future or coroutine from the set ``fs``
|
||||||
its result (or raises its exception). The optional argument
|
completes, and returns its result (or raises its exception). The
|
||||||
``timeout`` has the same meaning and default as it does for
|
optional argument ``timeout`` has the same meaning and default as it
|
||||||
``concurrent.futures.wait()``: when the timeout occurs, the next
|
does for ``concurrent.futures.wait()``: when the timeout occurs, the
|
||||||
Future returned by the iterator will raise ``TimeoutError`` when
|
next Future returned by the iterator will raise ``TimeoutError``
|
||||||
waited for. Example of use::
|
when waited for. Example of use::
|
||||||
|
|
||||||
for f in as_completed(fs):
|
for f in as_completed(fs):
|
||||||
result = yield from f # May raise an exception.
|
result = yield from f # May raise an exception.
|
||||||
# Use result.
|
# Use result.
|
||||||
|
|
||||||
Note: if you do not wait for the futures as they are produced by the
|
Note: if you do not wait for the values produced by the iterator,
|
||||||
iterator, your ``for`` loop may not make progress (since you are not
|
your ``for`` loop may not make progress (since you are not allowing
|
||||||
allowing other tasks to run).
|
other tasks to run).
|
||||||
|
|
||||||
Sleeping
|
Sleeping
|
||||||
--------
|
--------
|
||||||
|
|
|
@ -202,7 +202,7 @@ def fixfile(inpath, input_lines, outfile):
|
||||||
print >> outfile, '</td></tr></table>'
|
print >> outfile, '</td></tr></table>'
|
||||||
print >> outfile, '<div class="header">\n<table border="0">'
|
print >> outfile, '<div class="header">\n<table border="0">'
|
||||||
for k, v in header:
|
for k, v in header:
|
||||||
if k.lower() in ('author', 'discussions-to'):
|
if k.lower() in ('author', 'bdfl-delegate', 'discussions-to'):
|
||||||
mailtos = []
|
mailtos = []
|
||||||
for part in re.split(',\s*', v):
|
for part in re.split(',\s*', v):
|
||||||
if '@' in part:
|
if '@' in part:
|
||||||
|
|
Loading…
Reference in New Issue