Update from Lennart

This commit is contained in:
Brett Cannon 2013-01-25 12:32:59 -05:00
parent 7057ba616f
commit 039fc5d9da
1 changed files with 171 additions and 111 deletions

View File

@ -8,7 +8,7 @@ Status: Draft
Type: Standards Track
Content-Type: text/x-rst
Created: 11-Dec-2012
Post-History: 11-Dec-2012
Post-History: 11-Dec-2012, 28-Dec-2012
Abstract
@ -26,38 +26,41 @@ Concrete time zone support
--------------------------
The time zone support in Python has no concrete implementation in the
standard library, only a tzinfo baseclass, and since Python 3.2, one concrete
time zone: UTC. To properly support time zones you need to include a database
over all time zones, both current and historical, including daylight saving
changes. But such information changes frequently, so even if we include the
last information in a Python release, that information would be outdated just
a few months later.
standard library outside of a tzinfo baseclass that supports fixed offsets.
To properly support time zones you need to include a database over all time
zones, both current and historical, including daylight saving changes.
But such information changes frequently, so even if we include the last
information in a Python release, that information would be outdated just a
few months later.
Timezone support has therefore only been available through two third-party
Time zone support has therefore only been available through two third-party
modules, ``pytz`` and ``dateutil``, both who include and wrap the "zoneinfo"
database. This database, also called "tz" or "The Olsen database", is the
de-facto standard time zone database over time zones, and it is included in
most variants of Unix operating systems, including OS X.
most Unix and Unix-like operating systems, including OS X.
This gives us the opportunity to include only the code that supports the
zoneinfo data in the standard library, but by default use the operating
systems copy of the data, which typically will be kept updated by the
updating mechanism of the operating system or distribution.
This gives us the opportunity to include the code that supports the zoneinfo
data in the standard library, but by default use the operating system's copy
of the data, which typically will be kept updated by the updating mechanism
of the operating system or distribution.
For those who have an operating system that does not include the tz database,
for example Windows, a distribution containing the latest tz database should
also be available at the Python Package Index, so it can be easily installed
with the Python packaging tools such as ``easy_install`` or ``pip``. This
could also be done on Unices that are no longer receiving updates and
therefore have an outdated database.
For those who have an operating system that does not include the zoneinfo
database, for example Windows, the Python source distribution will include a
copy of the zoneinfo database, and a distribution containing the latest
zoneinfo database will also be available at the Python Package Index, so it
can be easily installed with the Python packaging tools such as
``easy_install`` or ``pip``. This could also be done on Unices that are no
longer receiving updates and therefore have an outdated database.
With such a mechanism Python would have full time zone support in the
standard library on most platforms, and a simple package installation would
provide time zone support on those platforms where the tz database isn't
included, such as Windows.
standard library on any platform, and a simple package installation would
provide an updated time zone database on those platforms where the zoneinfo
database isn't included, such as Windows, or on platforms where OS updates
are no longer provided.
The time zone support will be implemented by a new module called ``timezone``,
based on Stuart Bishop's ``pytz`` module.
The time zone support will be implemented by making the ``datetime`` module
into a package, and adding time zone support to ``datetime`` based on Stuart
Bishop's ``pytz`` module.
Getting the local time zone
@ -76,16 +79,23 @@ you don't know the name of the time zone. A function in ``datetime`` should
be provided to return the local time zone.
The support for this will be made by integrating Lennart Regebro's
``tzlocal`` module into the new ``timezone`` module.
``tzlocal`` module into the new ``datetime`` module.
For Windows it will look up the local Windows time zone name, and use a
mapping between Windows time zone names and zoneinfo time zone names provided
by the Unicode consortium to convert that to a zoneinfo time zone.
The mapping should be updated before each major or bugfix release, scripts
for doing so will be provided in the ``Tools/`` directory.
Ambiguous times
---------------
When changing over from daylight savings time (DST) the clock is turned back one
hour. This means that the times during that hour happens twice, once without
DST and then once with DST. Similarly, when changing to daylight savings
time, one hour goes missing.
When changing over from daylight savings time (DST) the clock is turned back
one hour. This means that the times during that hour happens twice, once
without DST and then once with DST. Similarly, when changing to daylight
savings time, one hour goes missing.
The current time zone API can not differentiate between the two ambiguous
times during a change from DST. For example, in Stockholm the time of
@ -96,10 +106,10 @@ The current time zone API can not disambiguate this and therefore it's
unclear which time should be returned::
# This could be either 00:00 or 01:00 UTC:
>>> dt = datetime(2012, 11, 28, 2, 0, tzinfo=timezone('Europe/Stockholm'))
>>> dt = datetime(2012, 10, 28, 2, 0, tzinfo=zoneinfo('Europe/Stockholm'))
# But we can not specify which:
>>> dt.astimezone(timezone('UTC'))
datetime.datetime(2012, 11, 28, 1, 0, tzinfo=<UTC>)
>>> dt.astimezone(zoneinfo('UTC'))
datetime.datetime(2012, 10, 28, 1, 0, tzinfo=<UTC>)
``pytz`` solved this problem by adding ``is_dst`` parameters to several
methods of the tzinfo objects to make it possible to disambiguate times when
@ -108,88 +118,131 @@ this is desired.
This PEP proposes to add these ``is_dst`` parameters to the relevant methods
of the ``datetime`` API, and therefore add this functionality directly to
``datetime``. This is likely the hardest part of this PEP as this involves
updating the
updating the C version of the ``datetime`` library with this functionality,
as this involved writing new code, and not just reorganizing existing
external libraries.
Implementation API
==================
The new ``timezone``-module
---------------------------
The zoneinfo database
---------------------
The public API of the new ``timezone``-module contains one new class, one new
function and one new exception.
The latest version of the zoneinfo database should exist in the
``Lib/tzdata`` directory of the Python source control system. This copy of
the database should be updated before every Python feature and bug-fix
release, but not for releases of Python versions that are in
security-fix-only-mode.
* New class: ``DstTzInfo``
Scripts to update the database will be provided in ``Tools/``, and the
release instructions will be updated to include this update.
This class provides a concrete implementation of the ``zoneinfo`` base
class that implements DST support.
New configure options ``--enable-internal-timezone-database`` and
``--disable-internal-timezone-database`` will be implemented to enable and
disable the installation of this database when installing from source. A
source install will default to installing them.
Binary installers for systems that have a system-provided zoneinfo database
may skip installing the included database since it would never be used for
these platforms. For other platforms, for example Windows, binary installers
must install the included database.
* New function :``get_timezone(name=None, db=None)``
Changes in the ``datetime``-module
----------------------------------
This function takes a name string that must be a string specifying a
valid zoneinfo timezone, i.e. "US/Eastern", "Europe/Warsaw" or "Etc/GMT+11".
If not given, the local timezone will be looked up. If an invalid zone name
is given, or the local timezone can not be retrieved, the function raises
`UnknownTimeZoneError`.
The public API of the new time zone support contains one new class, one new
function, one new exception and four new collections. In addition to this, several
methods on the datetime object gets a new ``is_dst`` parameter.
The function also takes an optional path to the location of the zoneinfo
database which should be used. If not specified, the function will check if
the `timezonedata` module is installed, and then use that location or
otherwise use the database in ``/usr/share/zoneinfo``.
New class ``DstTzInfo``
^^^^^^^^^^^^^^^^^^^^^^^^
If no database is found an ``UnknownTimeZoneError`` or subclass thereof will
be raised with a message explaining that no zoneinfo database can be found,
but that you can install one with the ``timezonedata`` package.
This class provides a concrete implementation of the ``zoneinfo`` base
class that implements DST support.
* New Exception: ``UnknownTimeZoneError``
New function ``zoneinfo(name=None, db_path=None)``
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
This exception is raised when giving a time zone specification that can't be
found::
This function takes a name string that must be a string specifying a
valid zoneinfo time zone, i.e. "US/Eastern", "Europe/Warsaw" or "Etc/GMT".
If not given, the local time zone will be looked up. If an invalid zone name
is given, or the local time zone can not be retrieved, the function raises
`UnknownTimeZoneError`.
The function also takes an optional path to the location of the zoneinfo
database which should be used. If not specified, the function will look for
databases in the following order:
1. Use the database in ``/usr/share/zoneinfo``, if it exists.
2. Check if the `tzdata-update` module is installed, and then use that
database.
3. Use the Python-provided database in ``Lib/tzdata``.
If no database is found an ``UnknownTimeZoneError`` or subclass thereof will
be raised with a message explaining that no zoneinfo database can be found,
but that you can install one with the ``tzdata-update`` package.
New parameter ``is_dst``
^^^^^^^^^^^^^^^^^^^^^^^^
A new ``is_dst`` parameter is added to several methods to handle time
ambiguity during DST changeovers.
* ``tzinfo.utcoffset(dt, is_dst=False)``
* ``tzinfo.dst(dt, is_dst=False)``
* ``tzinfo.tzname(dt, is_dst=False)``
* ``datetime.astimezone(tz, is_dst=False)``
The ``is_dst`` parameter can be ``False`` (default), ``True``, or ``None``.
``False`` will specify that the given datetime should be interpreted as not
happening during daylight savings time, i.e. that the time specified is after
the change from DST.
``True`` will specify that the given datetime should be interpreted as happening
during daylight savings time, i.e. that the time specified is before the change
from DST.
``None`` will raise an ``AmbiguousTimeError`` exception if the time specified
was during a DST change over. It will also raise a ``NonExistentTimeError``
if a time is specified during the "missing time" in a change to DST.
New exceptions
^^^^^^^^^^^^^^
* ``UnknownTimeZoneError``
This exception is a subclass of KeyError and raised when giving a time
zone specification that can't be found::
>>> datetime.Timezone('Europe/New_York')
Traceback (most recent call last):
...
UnknownTimeZoneError: There is no time zone called 'Europe/New_York'
* ``InvalidTimeError``
Changes in the ``datetime``-module
--------------------------------------
A new ``is_dst`` parameter is added to several of the `tzinfo` methods to
handle time ambiguity during DST changeovers.
* ``tzinfo.utcoffset(self, dt, is_dst=True)``
* ``tzinfo.dst(self, dt, is_dst=True)``
* ``tzinfo.tzname(self, dt, is_dst=True)``
The ``is_dst`` parameter can be ``True`` (default), ``False``, or ``None``.
``True`` will specify that the given datetime should be interpreted as
happening during daylight savings time, i.e. that the time specified is before
the change from DST.
``False`` will specify that the given datetime should be interpreted as not
happening during daylight savings time, i.e. that the time specified is after
the change from DST.
``None`` will raise an ``AmbiguousTimeError`` exception if the time specified
was during a DST change over. It will also raise a ``NonExistentTimeError``
if a time is specified during the "missing time" in a change to DST.
There are also two new exceptions:
This exception serves as a base for ``AmbiguousTimeError`` and
``NonExistentTimeError``, to enable you to trap these two separately. It
will subclass from ValueError, so that you can catch these errors together
with inputs like the 29th of February 2011.
* ``AmbiguousTimeError``
This exception is raised when giving a datetime specification that is
ambiguous while setting ``is_dst`` to None::
This exception is raised when giving a datetime specification that is ambiguous
while setting ``is_dst`` to None::
>>> datetime(2012, 11, 28, 2, 0, tzinfo=timezone('Europe/Stockholm'), is_dst=None)
>>>
>>> datetime(2012, 11, 28, 2, 0, tzinfo=zoneinfo('Europe/Stockholm'), is_dst=None)
>>>
Traceback (most recent call last):
...
AmbiguousTimeError: 2012-10-28 02:00:00 is ambiguous in time zone Europe/Stockholm
@ -197,17 +250,31 @@ There are also two new exceptions:
* ``NonExistentTimeError``
This exception is raised when giving a datetime specification that is
non-existent while setting ``is_dst`` to None::
This exception is raised when giving a datetime specification that is ambiguous
while setting ``is_dst`` to None::
>>> datetime(2012, 3, 25, 2, 0, tzinfo=timezone('Europe/Stockholm'), is_dst=None)
>>>
>>> datetime(2012, 3, 25, 2, 0, tzinfo=zoneinfo('Europe/Stockholm'), is_dst=None)
>>>
Traceback (most recent call last):
...
NonExistentTimeError: 2012-03-25 02:00:00 does not exist in time zone Europe/Stockholm
The ``timezonedata``-package
New collections
^^^^^^^^^^^^^^^
* ``all_timezones`` is the exhaustive list of the time zone names that can
be used, listed alphabethically.
* ``all_timezones_set`` is a set of the time zones in ``all_timezones``.
* ``common_timezones`` is a list of useful, current time zones, listed
alphabethically.
* ``common_timezones_set`` is a set of the time zones in ``common_timezones``.
The ``tzdata-update``-package
-----------------------------
The zoneinfo database will be packaged for easy installation with
@ -215,6 +282,10 @@ The zoneinfo database will be packaged for easy installation with
Python code, and will not contain any Python code except that which is needed
for installation.
It will be kept updated with the same tools as the internal database, but
released whenever the ``zoneinfo``-database is updated, and use the same
version schema.
Differences from the ``pytz`` API
=================================
@ -223,28 +294,15 @@ Differences from the ``pytz`` API
around that ``tzinfo`` doesn't have is_dst. When ``is_dst`` is
implemented directly in ``datetime.tzinfo`` they are no longer needed.
* The ``pytz`` method ``timezone()`` is instead called ``get_timezone()`` for
clarity.
* The ``timezone()`` function is called ``zoneinfo()`` to avoid clashing with
the ``timezone`` class introduced in Python 3.2.
* ``get_timezone()`` will return the local time zone if called without
arguments.
* ``zoneinfo()`` will return the local time zone if called without arguments.
* The class ``pytz.StaticTzInfo`` is there to provide the ``is_dst`` support
for static timezones. When ``is_dst`` support is included in
``datetime.tzinfo`` it is no longer needed.
* The class ``pytz.StaticTzInfo`` is there to provide the ``is_dst`` support for static
time zones. When ``is_dst`` support is included in ``datetime.tzinfo`` it is no longer needed.
Discussion
==========
Should the windows installer include the data package?
------------------------------------------------------
It has been suggested that the Windows installer should include the data
package. This would mean that an explicit installation would no longer be
needed on Windows. On the other hand, that would mean that many using Windows
would not be aware that the database quickly becomes outdated and would not
keep it updated.
* ``InvalidTimeError`` subclasses from ``ValueError``.
Resources
@ -256,6 +314,8 @@ Resources
* http://pypi.python.org/pypi/python-dateutil
* http://unicode.org/cldr/data/common/supplemental/windowsZones.xml
Copyright
=========