2014-07-24 07:37:56 -04:00
|
|
|
|
PEP: 475
|
|
|
|
|
Title: Retry system calls failing with EINTR
|
|
|
|
|
Version: $Revision$
|
|
|
|
|
Last-Modified: $Date$
|
|
|
|
|
Author: Charles-François Natali <cf.natali@gmail.com>, Victor Stinner <victor.stinner@gmail.com>
|
2014-10-29 06:42:14 -04:00
|
|
|
|
BDFL-Delegate: Antoine Pitrou <solipsis@pitrou.net>
|
2014-07-24 07:37:56 -04:00
|
|
|
|
Status: Draft
|
|
|
|
|
Type: Standards Track
|
|
|
|
|
Content-Type: text/x-rst
|
|
|
|
|
Created: 29-July-2014
|
|
|
|
|
Python-Version: 3.5
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Abstract
|
|
|
|
|
========
|
|
|
|
|
|
|
|
|
|
Retry system calls failing with the ``EINTR`` error and recompute
|
2014-07-29 17:16:04 -04:00
|
|
|
|
timeout if needed.
|
2014-07-24 07:37:56 -04:00
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Rationale
|
|
|
|
|
=========
|
|
|
|
|
|
|
|
|
|
Interrupted system calls
|
|
|
|
|
------------------------
|
|
|
|
|
|
|
|
|
|
On POSIX systems, signals are common. Your program must be prepared to
|
|
|
|
|
handle them. Examples of signals:
|
|
|
|
|
|
|
|
|
|
* The most common signal is ``SIGINT``, signal sent when CTRL+c is
|
|
|
|
|
pressed. By default, Python raises a ``KeyboardInterrupt`` exception
|
|
|
|
|
when this signal is received.
|
|
|
|
|
* When running subprocesses, the ``SIGCHLD`` signal is sent when a
|
|
|
|
|
child process exits.
|
|
|
|
|
* Resizing the terminal sends the ``SIGWINCH`` signal to the
|
|
|
|
|
applications running in the terminal.
|
|
|
|
|
* Putting the application in background (ex: press CTRL-z and then
|
|
|
|
|
type the ``bg`` command) sends the ``SIGCONT`` signal.
|
|
|
|
|
|
2014-07-29 16:10:53 -04:00
|
|
|
|
Writing a signal handler is difficult, only "async-signal safe"
|
|
|
|
|
functions can be called. For example, ``printf()`` and ``malloc()``
|
|
|
|
|
are not async-signal safe. When a signal is sent to a process calling
|
|
|
|
|
a system call, the system call can fail with the ``EINTR`` error to
|
|
|
|
|
give the program an opportunity to handle the signal without the
|
|
|
|
|
restriction on signal safe functions. Depending on the platform, on
|
|
|
|
|
the system call and the ``SA_RESTART`` flag, the system call may or
|
|
|
|
|
may not fail with ``EINTR``.
|
|
|
|
|
|
|
|
|
|
If the signal handler was set with the ``SA_RESTART`` flag set, the
|
|
|
|
|
kernel retries some the system call instead of failing with
|
2014-07-24 07:37:56 -04:00
|
|
|
|
``EINTR``. For example, ``read()`` is retried, whereas ``select()`` is
|
|
|
|
|
not retried. The Python function ``signal.signal()`` clears the
|
|
|
|
|
``SA_RESTART`` flag when setting the signal handler: all system calls
|
|
|
|
|
should fail with ``EINTR`` in Python.
|
|
|
|
|
|
|
|
|
|
The problem is that handling ``EINTR`` should be done for all system
|
|
|
|
|
calls. The problem is similar to handling errors in the C language
|
|
|
|
|
which does not have exceptions: you must check all function returns to
|
|
|
|
|
check for error, and usually duplicate the code checking for errors.
|
|
|
|
|
Python does not have this issue, it uses exceptions to notify errors.
|
|
|
|
|
|
|
|
|
|
|
2014-10-28 17:13:38 -04:00
|
|
|
|
Status in Python 3.4
|
|
|
|
|
--------------------
|
2014-07-24 07:37:56 -04:00
|
|
|
|
|
2014-10-28 17:13:38 -04:00
|
|
|
|
In Python 3.4, the code to handle the ``InterruptedError``
|
2014-07-24 07:37:56 -04:00
|
|
|
|
exception (``EINTR`` error) is duplicated on case by case. Only a few
|
|
|
|
|
Python modules handle this exception, and fixes usually took several
|
|
|
|
|
years to cover a whole module. Example of code retrying
|
|
|
|
|
``file.read()`` on ``InterruptedError``::
|
|
|
|
|
|
|
|
|
|
while True:
|
|
|
|
|
try:
|
2014-07-28 06:50:14 -04:00
|
|
|
|
data = file.read(size)
|
2014-07-24 07:37:56 -04:00
|
|
|
|
break
|
|
|
|
|
except InterruptedError:
|
|
|
|
|
continue
|
|
|
|
|
|
|
|
|
|
List of Python modules of the standard library which handle
|
|
|
|
|
``InterruptedError``:
|
|
|
|
|
|
|
|
|
|
* ``asyncio``
|
|
|
|
|
* ``asyncore``
|
|
|
|
|
* ``io``, ``_pyio``
|
|
|
|
|
* ``multiprocessing``
|
|
|
|
|
* ``selectors``
|
|
|
|
|
* ``socket``
|
|
|
|
|
* ``socketserver``
|
|
|
|
|
* ``subprocess``
|
|
|
|
|
|
2014-07-29 16:10:53 -04:00
|
|
|
|
Other programming languages like Perl, Java and Go already retry
|
|
|
|
|
system calls failing with ``EINTR``.
|
2014-07-24 07:37:56 -04:00
|
|
|
|
|
2014-07-29 16:10:53 -04:00
|
|
|
|
|
|
|
|
|
Use Case 1: Don't Bother With Signals
|
|
|
|
|
-------------------------------------
|
2014-07-24 07:37:56 -04:00
|
|
|
|
|
|
|
|
|
In most cases, you don't want to be interrupted by signals and you
|
|
|
|
|
don't expect to get ``InterruptedError`` exceptions. For example, do
|
|
|
|
|
you really want to write such complex code for an "Hello World"
|
|
|
|
|
example?
|
|
|
|
|
|
|
|
|
|
::
|
|
|
|
|
|
|
|
|
|
while True:
|
|
|
|
|
try:
|
|
|
|
|
print("Hello World")
|
2014-07-24 16:33:23 -04:00
|
|
|
|
break
|
2014-07-24 07:37:56 -04:00
|
|
|
|
except InterruptedError:
|
2014-07-24 16:33:23 -04:00
|
|
|
|
continue
|
2014-07-24 07:37:56 -04:00
|
|
|
|
|
|
|
|
|
``InterruptedError`` can happen in unexpected places. For example,
|
|
|
|
|
``os.close()`` and ``FileIO.close()`` can raises ``InterruptedError``:
|
|
|
|
|
see the article `close() and EINTR
|
|
|
|
|
<http://alobbs.com/post/54503240599/close-and-eintr>`_.
|
|
|
|
|
|
|
|
|
|
The `Python issues related to EINTR`_ section below gives examples of
|
|
|
|
|
bugs caused by "EINTR".
|
|
|
|
|
|
|
|
|
|
The expectation is that Python hides the ``InterruptedError``: retry
|
|
|
|
|
system calls failing with the ``EINTR`` error.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Use Case 2: Be notified of signals as soon as possible
|
|
|
|
|
------------------------------------------------------
|
|
|
|
|
|
|
|
|
|
Sometimes, you expect some signals and you want to handle them as soon
|
|
|
|
|
as possible. For example, you may want to quit immediatly a program
|
|
|
|
|
using the ``CTRL+c`` keyboard shortcut.
|
|
|
|
|
|
|
|
|
|
Some signals are not interesting and should not interrupt the the
|
|
|
|
|
application. There are two options to only interrupt an application
|
|
|
|
|
on some signals:
|
|
|
|
|
|
|
|
|
|
* Raise an exception in the signal handler, like ``KeyboardInterrupt`` for
|
|
|
|
|
``SIGINT``
|
|
|
|
|
* Use a I/O multiplexing function like ``select()`` with the Python
|
|
|
|
|
signal "wakeup" file descriptor: see the function
|
2014-08-27 17:17:15 -04:00
|
|
|
|
``signal.set_wakeup_fd()``.
|
2014-07-24 07:37:56 -04:00
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Proposition
|
|
|
|
|
===========
|
|
|
|
|
|
|
|
|
|
If a system call fails with ``EINTR``, Python must call signal
|
|
|
|
|
handlers: call ``PyErr_CheckSignals()``. If a signal handler raises
|
|
|
|
|
an exception, the Python function fails with the exception.
|
|
|
|
|
Otherwise, the system call is retried. If the system call takes a
|
|
|
|
|
timeout parameter, the timeout is recomputed.
|
|
|
|
|
|
|
|
|
|
Modified functions
|
|
|
|
|
------------------
|
|
|
|
|
|
|
|
|
|
Example of functions that need to be modified:
|
|
|
|
|
|
|
|
|
|
* ``os.read()``, ``io.FileIO.read()``, ``io.FileIO.readinto()``
|
|
|
|
|
* ``os.write()``, ``io.FileIO.write()``
|
|
|
|
|
* ``os.waitpid()``
|
|
|
|
|
* ``socket.accept()``
|
|
|
|
|
* ``socket.connect()``
|
|
|
|
|
* ``socket.recv()``, ``socket.recv_into()``
|
|
|
|
|
* ``socket.recv_from()``
|
|
|
|
|
* ``socket.send()``
|
|
|
|
|
* ``socket.sendto()``
|
|
|
|
|
* ``time.sleep()``
|
|
|
|
|
* ``select.select()``
|
|
|
|
|
* ``select.poll()``
|
|
|
|
|
* ``select.epoll.poll()``
|
|
|
|
|
* ``select.devpoll.poll()``
|
|
|
|
|
* ``select.kqueue.control()``
|
|
|
|
|
* ``selectors.SelectSelector.select()`` and other selector classes
|
|
|
|
|
|
|
|
|
|
Note: The ``selector`` module already retries on ``InterruptedError``, but it
|
|
|
|
|
doesn't recompute the timeout yet.
|
|
|
|
|
|
|
|
|
|
|
2014-10-28 17:00:32 -04:00
|
|
|
|
InterruptedError
|
|
|
|
|
----------------
|
|
|
|
|
|
|
|
|
|
Since interrupted system calls are automatically retried, the
|
|
|
|
|
``InterruptedError`` exception should not occur anymore. The code handling
|
|
|
|
|
``InterruptedError`` can be removed from in the standard library to simply the
|
|
|
|
|
code.
|
|
|
|
|
|
|
|
|
|
|
2014-07-24 07:37:56 -04:00
|
|
|
|
Backward Compatibility
|
|
|
|
|
======================
|
|
|
|
|
|
|
|
|
|
Applications relying on the fact that system calls are interrupted
|
|
|
|
|
with ``InterruptedError`` will hang. The authors of this PEP don't
|
|
|
|
|
think that such application exist.
|
|
|
|
|
|
2014-07-29 16:10:53 -04:00
|
|
|
|
If such applications exist, they are not portable and are subject to
|
|
|
|
|
race conditions (deadlock if the signal comes before the system call).
|
|
|
|
|
These applications must be fixed to handle signals differently, to
|
|
|
|
|
have a reliable behaviour on all platforms and all Python versions.
|
2014-08-27 17:17:15 -04:00
|
|
|
|
For example, use a signal handler which raises an exception, or use a
|
2014-07-29 16:10:53 -04:00
|
|
|
|
wakeup file descriptor.
|
2014-07-24 07:37:56 -04:00
|
|
|
|
|
|
|
|
|
For applications using event loops, ``signal.set_wakeup_fd()`` is the
|
|
|
|
|
recommanded option to handle signals. The signal handler writes signal
|
|
|
|
|
numbers into the file descriptor and the event loop is awaken to read
|
2014-08-27 17:17:15 -04:00
|
|
|
|
them. The event loop can handle these signals without the restriction
|
|
|
|
|
of signal handlers.
|
2014-07-24 07:37:56 -04:00
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Appendix
|
|
|
|
|
========
|
|
|
|
|
|
|
|
|
|
Wakeup file descriptor
|
|
|
|
|
----------------------
|
|
|
|
|
|
|
|
|
|
Since Python 3.3, ``signal.set_wakeup_fd()`` writes the signal number
|
|
|
|
|
into the file descriptor, whereas it only wrote a null byte before.
|
|
|
|
|
It becomes possible to handle different signals using the wakeup file
|
|
|
|
|
descriptor.
|
|
|
|
|
|
|
|
|
|
Linux has a ``signalfd()`` which provides more information on each
|
|
|
|
|
signal. For example, it's possible to know the pid and uid who sent
|
|
|
|
|
the signal. This function is not exposed in Python yet (see the
|
|
|
|
|
`issue 12304 <http://bugs.python.org/issue12304>`_).
|
|
|
|
|
|
|
|
|
|
On Unix, the ``asyncio`` module uses the wakeup file descriptor to
|
|
|
|
|
wake up its event loop.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Multithreading
|
|
|
|
|
--------------
|
|
|
|
|
|
|
|
|
|
A C signal handler can be called from any thread, but the Python
|
|
|
|
|
signal handler should only be called in the main thread.
|
|
|
|
|
|
|
|
|
|
Python has a ``PyErr_SetInterrupt()`` function which calls the
|
|
|
|
|
``SIGINT`` signal handler to interrupt the Python main thread.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Signals on Windows
|
|
|
|
|
------------------
|
|
|
|
|
|
|
|
|
|
Control events
|
|
|
|
|
^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
|
|
Windows uses "control events":
|
|
|
|
|
|
|
|
|
|
* ``CTRL_BREAK_EVENT``: Break (``SIGBREAK``)
|
|
|
|
|
* ``CTRL_CLOSE_EVENT``: Close event
|
|
|
|
|
* ``CTRL_C_EVENT``: CTRL+C (``SIGINT``)
|
|
|
|
|
* ``CTRL_LOGOFF_EVENT``: Logoff
|
|
|
|
|
* ``CTRL_SHUTDOWN_EVENT``: Shutdown
|
|
|
|
|
|
|
|
|
|
The `SetConsoleCtrlHandler() function
|
|
|
|
|
<http://msdn.microsoft.com/en-us/library/windows/desktop/ms686016%28v=vs.85%29.aspx>`_
|
|
|
|
|
can be used to install a control handler.
|
|
|
|
|
|
|
|
|
|
The ``CTRL_C_EVENT`` and ``CTRL_BREAK_EVENT`` events can be sent to a
|
|
|
|
|
process using the `GenerateConsoleCtrlEvent() function
|
|
|
|
|
<http://msdn.microsoft.com/en-us/library/windows/desktop/ms683155%28v=vs.85%29.aspx>`_.
|
|
|
|
|
This function is exposed in Python as ``os.kill()``.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Signals
|
|
|
|
|
^^^^^^^
|
|
|
|
|
|
|
|
|
|
The following signals are supported on Windows:
|
|
|
|
|
|
|
|
|
|
* ``SIGABRT``
|
|
|
|
|
* ``SIGBREAK`` (``CTRL_BREAK_EVENT``): signal only available on Windows
|
|
|
|
|
* ``SIGFPE``
|
|
|
|
|
* ``SIGILL``
|
|
|
|
|
* ``SIGINT`` (``CTRL_C_EVENT``)
|
|
|
|
|
* ``SIGSEGV``
|
|
|
|
|
* ``SIGTERM``
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
SIGINT
|
|
|
|
|
^^^^^^
|
|
|
|
|
|
|
|
|
|
The default Python signal handler for ``SIGINT`` sets a Windows event
|
|
|
|
|
object: ``sigint_event``.
|
|
|
|
|
|
|
|
|
|
``time.sleep()`` is implemented with ``WaitForSingleObjectEx()``, it
|
|
|
|
|
waits for the ``sigint_event`` object using ``time.sleep()`` parameter
|
|
|
|
|
as the timeout. So the sleep can be interrupted by ``SIGINT``.
|
|
|
|
|
|
|
|
|
|
``_winapi.WaitForMultipleObjects()`` automatically adds
|
|
|
|
|
``sigint_event`` to the list of watched handles, so it can also be
|
|
|
|
|
interrupted.
|
|
|
|
|
|
|
|
|
|
``PyOS_StdioReadline()`` also used ``sigint_event`` when ``fgets()``
|
|
|
|
|
failed to check if Ctrl-C or Ctrl-Z was pressed.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Links
|
|
|
|
|
-----
|
|
|
|
|
|
|
|
|
|
Misc
|
|
|
|
|
^^^^
|
|
|
|
|
|
|
|
|
|
* `glibc manual: Primitives Interrupted by Signals
|
|
|
|
|
<http://www.gnu.org/software/libc/manual/html_node/Interrupted-Primitives.html>`_
|
|
|
|
|
* `Bug #119097 for perl5: print returning EINTR in 5.14
|
|
|
|
|
<https://rt.perl.org/Public/Bug/Display.html?id=119097>`_.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Python issues related to EINTR
|
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
|
|
The main issue is: `handle EINTR in the stdlib
|
|
|
|
|
<http://bugs.python.org/issue18885>`_.
|
|
|
|
|
|
|
|
|
|
Open issues:
|
|
|
|
|
|
|
|
|
|
* `Add a new signal.set_wakeup_socket() function
|
|
|
|
|
<http://bugs.python.org/issue22018>`_
|
|
|
|
|
* `signal.set_wakeup_fd(fd): set the fd to non-blocking mode
|
|
|
|
|
<http://bugs.python.org/issue22042>`_
|
|
|
|
|
* `Use a monotonic clock to compute timeouts
|
|
|
|
|
<http://bugs.python.org/issue22043>`_
|
|
|
|
|
* `sys.stdout.write on OS X is not EINTR safe
|
|
|
|
|
<http://bugs.python.org/issue22007>`_
|
|
|
|
|
* `platform.uname() not EINTR safe
|
|
|
|
|
<http://bugs.python.org/issue21772>`_
|
|
|
|
|
* `asyncore does not handle EINTR in recv, send, connect, accept,
|
|
|
|
|
<http://bugs.python.org/issue11266>`_
|
|
|
|
|
* `socket.create_connection() doesn't handle EINTR properly
|
|
|
|
|
<http://bugs.python.org/issue20611>`_
|
|
|
|
|
|
|
|
|
|
Closed issues:
|
|
|
|
|
|
|
|
|
|
* `Interrupted system calls are not retried
|
|
|
|
|
<http://bugs.python.org/issue9867>`_
|
|
|
|
|
* `Solaris: EINTR exception in select/socket calls in telnetlib
|
|
|
|
|
<http://bugs.python.org/issue1049450>`_
|
|
|
|
|
* `subprocess: Popen.communicate() doesn't handle EINTR in some cases
|
|
|
|
|
<http://bugs.python.org/issue12493>`_
|
|
|
|
|
* `multiprocessing.util._eintr_retry doen't recalculate timeouts
|
|
|
|
|
<http://bugs.python.org/issue12338>`_
|
|
|
|
|
* `file readline, readlines & readall methods can lose data on EINTR
|
|
|
|
|
<http://bugs.python.org/issue12268>`_
|
|
|
|
|
* `multiprocessing BaseManager serve_client() does not check EINTR on recv
|
|
|
|
|
<http://bugs.python.org/issue17097>`_
|
|
|
|
|
* `selectors behaviour on EINTR undocumented
|
|
|
|
|
<http://bugs.python.org/issue19849>`_
|
|
|
|
|
* `asyncio: limit EINTR occurrences with SA_RESTART
|
|
|
|
|
<http://bugs.python.org/issue19850>`_
|
|
|
|
|
* `smtplib.py socket.create_connection() also doesn't handle EINTR properly
|
|
|
|
|
<http://bugs.python.org/issue21602>`_
|
|
|
|
|
* `Faulty RESTART/EINTR handling in Parser/myreadline.c
|
|
|
|
|
<http://bugs.python.org/issue11650>`_
|
|
|
|
|
* `test_httpservers intermittent failure, test_post and EINTR
|
|
|
|
|
<http://bugs.python.org/issue3771>`_
|
|
|
|
|
* `os.spawnv(P_WAIT, ...) on Linux doesn't handle EINTR
|
|
|
|
|
<http://bugs.python.org/issue686667>`_
|
|
|
|
|
* `asyncore fails when EINTR happens in pol
|
|
|
|
|
<http://bugs.python.org/issue517554>`_
|
|
|
|
|
* `file.write and file.read don't handle EINTR
|
|
|
|
|
<http://bugs.python.org/issue10956>`_
|
|
|
|
|
* `socket.readline() interface doesn't handle EINTR properly
|
|
|
|
|
<http://bugs.python.org/issue1628205>`_
|
|
|
|
|
* `subprocess is not EINTR-safe
|
|
|
|
|
<http://bugs.python.org/issue1068268>`_
|
|
|
|
|
* `SocketServer doesn't handle syscall interruption
|
|
|
|
|
<http://bugs.python.org/issue7978>`_
|
|
|
|
|
* `subprocess deadlock when read() is interrupted
|
|
|
|
|
<http://bugs.python.org/issue17367>`_
|
|
|
|
|
* `time.sleep(1): call PyErr_CheckSignals() if the sleep was interrupted
|
|
|
|
|
<http://bugs.python.org/issue12462>`_
|
|
|
|
|
* `siginterrupt with flag=False is reset when signal received
|
|
|
|
|
<http://bugs.python.org/issue8354>`_
|
|
|
|
|
* `need siginterrupt() on Linux - impossible to do timeouts
|
|
|
|
|
<http://bugs.python.org/issue1089358>`_
|
|
|
|
|
* `[Windows] Can not interrupt time.sleep()
|
|
|
|
|
<http://bugs.python.org/issue581232>`_
|
|
|
|
|
|
|
|
|
|
Python issues related to signals
|
|
|
|
|
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
|
|
|
|
|
|
|
|
|
|
Open issues:
|
|
|
|
|
|
2014-07-28 09:02:38 -04:00
|
|
|
|
* `signal.default_int_handler should set signal number on the raised
|
|
|
|
|
exception <http://bugs.python.org/issue17182>`_
|
2014-07-24 07:37:56 -04:00
|
|
|
|
* `expose signalfd(2) in the signal module
|
|
|
|
|
<http://bugs.python.org/issue12304>`_
|
|
|
|
|
* `missing return in win32_kill?
|
|
|
|
|
<http://bugs.python.org/issue14484>`_
|
|
|
|
|
* `Interrupts are lost during readline PyOS_InputHook processing
|
|
|
|
|
<http://bugs.python.org/issue3180>`_
|
|
|
|
|
* `cannot catch KeyboardInterrupt when using curses getkey()
|
|
|
|
|
<http://bugs.python.org/issue1687125>`_
|
|
|
|
|
* `Deferred KeyboardInterrupt in interactive mode
|
|
|
|
|
<http://bugs.python.org/issue16151>`_
|
|
|
|
|
|
|
|
|
|
Closed issues:
|
|
|
|
|
|
|
|
|
|
* `sys.interrupt_main()
|
|
|
|
|
<http://bugs.python.org/issue753733>`_
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Copyright
|
|
|
|
|
=========
|
|
|
|
|
|
|
|
|
|
This document has been placed in the public domain.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
..
|
|
|
|
|
Local Variables:
|
|
|
|
|
mode: indented-text
|
|
|
|
|
indent-tabs-mode: nil
|
|
|
|
|
sentence-end-double-space: t
|
|
|
|
|
fill-column: 70
|
|
|
|
|
coding: utf-8
|
|
|
|
|
End:
|