Changes to pep-3118.txt
This commit is contained in:
parent
ac9a2701b5
commit
5f33b4b632
515
pep-3118.txt
515
pep-3118.txt
|
@ -1,8 +1,9 @@
|
||||||
|
|
||||||
PEP: 3118
|
PEP: 3118
|
||||||
Title: Revising the buffer protocol
|
Title: Revising the buffer protocol
|
||||||
Version: $Revision$
|
Version: $Revision$
|
||||||
Last-Modified: $Date$
|
Last-Modified: $Date$
|
||||||
Author: Travis Oliphant <oliphant@ee.byu.edu>, Carl Banks <pythondev@aerojockey.com>
|
Authors: Travis Oliphant <oliphant@ee.byu.edu>, Carl Banks <pythondev@aerojockey.com>
|
||||||
Status: Draft
|
Status: Draft
|
||||||
Type: Standards Track
|
Type: Standards Track
|
||||||
Content-Type: text/x-rst
|
Content-Type: text/x-rst
|
||||||
|
@ -13,8 +14,8 @@ Abstract
|
||||||
========
|
========
|
||||||
|
|
||||||
This PEP proposes re-designing the buffer interface (PyBufferProcs
|
This PEP proposes re-designing the buffer interface (PyBufferProcs
|
||||||
function pointers) to improve the way Python allows memory sharing
|
function pointers) to improve the way Python allows memory sharing in
|
||||||
in Python 3.0
|
Python 3.0
|
||||||
|
|
||||||
In particular, it is proposed that the character buffer portion
|
In particular, it is proposed that the character buffer portion
|
||||||
of the API be elminated and the multiple-segment portion be
|
of the API be elminated and the multiple-segment portion be
|
||||||
|
@ -73,7 +74,7 @@ has issues:
|
||||||
|
|
||||||
NumPy uses the notion of constant striding in each dimension as its
|
NumPy uses the notion of constant striding in each dimension as its
|
||||||
basic concept of an array. With this concept, a simple sub-region
|
basic concept of an array. With this concept, a simple sub-region
|
||||||
of a larger array can be described without copying the data. T
|
of a larger array can be described without copying the data.
|
||||||
Thus, stride information is the additional information that must be
|
Thus, stride information is the additional information that must be
|
||||||
shared.
|
shared.
|
||||||
|
|
||||||
|
@ -151,7 +152,7 @@ Change the PyBufferProcs structure to
|
||||||
|
|
||||||
::
|
::
|
||||||
|
|
||||||
typedef int (*getbufferproc)(PyObject *obj, struct bufferinfo *view, int flags)
|
typedef int (*getbufferproc)(PyObject *obj, PyBuffer *view, int flags)
|
||||||
|
|
||||||
This function returns 0 on success and -1 on failure (and raises an
|
This function returns 0 on success and -1 on failure (and raises an
|
||||||
error). The first variable is the "exporting" object. The second
|
error). The first variable is the "exporting" object. The second
|
||||||
|
@ -160,54 +161,137 @@ then no information is returned but a lock on the memory is still
|
||||||
obtained. In this case, the corresponding releasebuffer should also
|
obtained. In this case, the corresponding releasebuffer should also
|
||||||
be called with NULL.
|
be called with NULL.
|
||||||
|
|
||||||
The third argument indicates what kind of buffer the exporter is allowed to return. It tells the
|
The third argument indicates what kind of buffer the consumer is
|
||||||
exporter what elements the bufferinfo structure the consumer is going to make use of. This
|
prepared to deal with and therefore what kind of buffer the exporter
|
||||||
allows the exporter to simplify and/or raise an error if it can't support the operation.
|
is allowed to return. The new buffer interface allows for much more
|
||||||
|
complicated memory sharing possibilities. Some consumers may not be
|
||||||
|
able to handle all the complexibity but may want to see if the
|
||||||
|
exporter will let them take a simpler view to its memory.
|
||||||
|
|
||||||
It also allows the caller to make a request for a simple "view" and
|
In addition, some exporters may not be able to share memory in every
|
||||||
receive it or have an error raised if it's not possible.
|
possible way and may need to raise errors to signal to some consumers
|
||||||
|
that something is just not possible. These errors should be
|
||||||
|
PyErr_BufferError unless there is another error that is actually
|
||||||
|
causing the problem. The exporter can use flags information to
|
||||||
|
simplify how much of the PyBuffer structure is filled in with
|
||||||
|
non-default values and/or raise an error if the object can't support a
|
||||||
|
simpler view of its memory.
|
||||||
|
|
||||||
All of the following assume that at least buf, len, and readonly will be
|
The exporter should always fill in all elements of the buffer
|
||||||
utilized by the caller.
|
structure (with defaults if nothing else is requested).
|
||||||
|
|
||||||
Py_BUF_SIMPLE
|
Some flags are useful for requesting a specific kind of memory
|
||||||
The returned buffer will only be assumed to be readable (the object
|
segment, while others indicate to the exporter what kind of
|
||||||
may or may not have writeable memory). Only the buf, len, and
|
information the consumer can deal with. If certain information is not
|
||||||
readonly variables may be accessed. The format will be
|
asked for by the consumer, but the exporter cannot share its memory
|
||||||
assumed to be unsigned bytes. This is a "stand-alone" flag constant.
|
without that information, then a PyErr_BufferError should be raised.
|
||||||
It never needs to be \|'d to the others.
|
|
||||||
|
|
||||||
Py_BUF_WRITEABLE
|
|
||||||
The returned buffer must be writeable. If it cannot be, then raise an error.
|
|
||||||
|
|
||||||
Py_BUF_READONLY
|
PyBUF_SIMPLE
|
||||||
The returned buffer must be readonly and the underlying object should make
|
|
||||||
its memory readonly if that is possible.
|
|
||||||
|
|
||||||
Py_BUF_FORMAT
|
This is the default flag state (0). The returned buffer may or may
|
||||||
The consumer will be using the format string information so make sure that
|
not have writeable memory. The format will be assumed to be
|
||||||
member is filled correctly.
|
unsigned bytes . This is a "stand-alone" flag constant. It never
|
||||||
|
needs to be |'d to the others. The exporter will raise an error if
|
||||||
|
it cannot provide such a contiguous buffer of bytes.
|
||||||
|
|
||||||
Py_BUF_SHAPE
|
PyBUF_REQ_WRITEABLE
|
||||||
The consumer can (and might) make use of using the ndims and shape members of the structure
|
|
||||||
so make sure they are filled in correctly.
|
|
||||||
|
|
||||||
Py_BUF_STRIDES (implies SHAPE)
|
The returned buffer must be writeable. If it is not writeable,
|
||||||
The consumer can (and might) make use of the strides member of the structure (as well
|
then raise an error.
|
||||||
as ndims and shape)
|
|
||||||
|
|
||||||
Py_BUF_OFFSETS (implies STRIDES)
|
PyBUF_REQ_LOCKDATA
|
||||||
The consumer can (and might) make use of the suboffsets member (as well as
|
|
||||||
ndims, shape, and strides)
|
|
||||||
|
|
||||||
Thus, the consumer simply wanting an contiguous chunk of bytes from
|
The returned buffer must be readonly. If the object is already
|
||||||
the object would use Py_BUF_SIMPLE, while a consumer that understands
|
read-only or it can make its memory read-only (and there are no
|
||||||
how to make use of the most complicated cases would use
|
other views on the object) then it should do so and return the
|
||||||
Py_BUF_OFFSETS.
|
buffer information. If the object does not have read-only memory
|
||||||
|
(or cannot make it read-only), then an error should be raised.
|
||||||
|
|
||||||
|
PyBUF_REQ_FORMAT
|
||||||
|
|
||||||
|
The returned buffer must have true format information if this flag
|
||||||
|
is provided. This would be used when the consumer is going to be
|
||||||
|
checking for what 'kind' of data is actually stored. An exporter
|
||||||
|
should always be able to provide this information if requested. If
|
||||||
|
format is not explicitly requested then the format must be returned
|
||||||
|
as NULL (which means "B")
|
||||||
|
|
||||||
|
PyBUF_REQ_ALIGNED (implies PyBUF_REQ_FORMAT)
|
||||||
|
|
||||||
|
The returned buffer must have all (primitive data-type entries)
|
||||||
|
aligned as the compiler would align them
|
||||||
|
|
||||||
|
PyBUF_ALW_ND
|
||||||
|
|
||||||
|
The returned buffer must provide shape information. The memory will
|
||||||
|
be assumed C-style contiguous (last dimension varies the fastest).
|
||||||
|
The exporter may raise an error if it cannot provide this kind of
|
||||||
|
contiguous buffer.
|
||||||
|
|
||||||
|
PyBUF_ALW_STRIDES (implies PyBUF_ALW_ND)
|
||||||
|
|
||||||
|
The returned buffer must provide strides information. This would be
|
||||||
|
used when the consumer can handle strided, discontiguous arrays.
|
||||||
|
Handling strides automatically assumes you can handle shape.
|
||||||
|
The exporter may raise an error if cannot provide a strided-only
|
||||||
|
representation of the data (i.e. without the suboffsets).
|
||||||
|
|
||||||
|
PyBUF_REQ_C_CONTIGUOUS
|
||||||
|
PyBUF_REQ_F_CONTIGUOUS
|
||||||
|
PyBUF_REQ_ANY_CONTIGUOUS
|
||||||
|
|
||||||
|
These flags indicate that the returned buffer must be respectively,
|
||||||
|
C-contiguous (last dimension varies the fastest), Fortran
|
||||||
|
contiguous (first dimension varies the fastest) or either one.
|
||||||
|
All of these flags imply PyBUF_ALW_STRIDES and guarantee that the
|
||||||
|
strides buffer info structure will be filled in correctly.
|
||||||
|
|
||||||
|
|
||||||
|
PyBUF_ALW_INDIRECT (implies PyBUF_ALW_STRIDES)
|
||||||
|
|
||||||
|
The returned buffer must have suboffsets information. This would
|
||||||
|
be used when the consumer can handle indirect array referencing
|
||||||
|
implied by these suboffsets.
|
||||||
|
|
||||||
|
|
||||||
|
Specialized combinations of flags for specific kinds of memory_sharing.
|
||||||
|
|
||||||
|
Multi-dimensional (but contiguous)
|
||||||
|
|
||||||
|
PyBUF_CONTIG (PyBUF_ALW_ND | PyBUF_REQ_WRITEABLE | PyBUF_REQ_ALIGNED)
|
||||||
|
PyBUF_CONTIG_RO (PyBUF_ALW_ND | PyBUF_REQ_ALIGNED)
|
||||||
|
PyBUF_CONTIG_LCK (PyBUF_ALW_ND | PyBUF_REQ_LOCKDATA | PyBUF_REQ_ALIGNED)
|
||||||
|
|
||||||
|
Multi-dimensional using strides but aligned
|
||||||
|
|
||||||
|
PyBUF_STRIDED (PyBUF_ALW_STRIDES | PyBUF_REQ_WRITEABLE | PyBUF_REQ_ALIGNED)
|
||||||
|
PyBUF_STRIDED_RO (PyBUF_ALW_STRIDES | PyBUF_REQ_ALIGNED)
|
||||||
|
PyBUF_STRIDED_LCK (PyBUF_ALW_STRIDES | PyBUF_REQ_LOCKDATA | PyBUF_REQ_ALIGNED)
|
||||||
|
|
||||||
|
Multi-dimensional using strides and not necessarily aligned
|
||||||
|
|
||||||
|
PyBUF_RECORDS (PyBUF_ALW_STRIDES | PyBUF_REQ_WRITEABLE | PyBUF_REQ_FORMAT)
|
||||||
|
PyBUF_RECORDS_RO (PyBUF_ALW_STRIDES | PyBUF_REQ_FORMAT)
|
||||||
|
PyBUF_RECORDS_LCK (PyBUF_ALW_STRIDES | PyBUF_REQ_LOCKDATA | PyBUF_REQ_FORMAT)
|
||||||
|
|
||||||
|
Multi-dimensional using sub-offsets
|
||||||
|
|
||||||
|
PyBUF_FULL (PyBUF_ALW_INDIRECT | PyBUF_REQ_WRITEABLE | PyBUF_REQ_FORMAT)
|
||||||
|
PyBUF_FULL_RO (PyBUF_ALW_INDIRECT | PyBUF_REQ_FORMAT)
|
||||||
|
PyBUF_FULL_LCK (PyBUF_ALW_INDIRECT | PyBUF_REQ_LOCKDATA | PyBUF_REQ_FORMAT)
|
||||||
|
|
||||||
|
Thus, the consumer simply wanting a contiguous chunk of bytes from
|
||||||
|
the object would use PyBUF_SIMPLE, while a consumer that understands
|
||||||
|
how to make use of the most complicated cases could use PyBUF_FULL
|
||||||
|
|
||||||
|
The format information is only guaranteed to be non-NULL if
|
||||||
|
PyBUF_REQ_FORMAT is in the flag argument, otherwise it is expected the
|
||||||
|
consumer will assume unsigned bytes.
|
||||||
|
|
||||||
There is a C-API that simple exporting objects can use to fill-in the
|
There is a C-API that simple exporting objects can use to fill-in the
|
||||||
buffer info structure correctly according to the provided flags if a
|
buffer info structure correctly according to the provided flags if a
|
||||||
contiguous chunk of memory is all that can be exported.
|
contiguous chunk of "unsigned bytes" is all that can be exported.
|
||||||
|
|
||||||
|
|
||||||
The bufferinfo structure is::
|
The bufferinfo structure is::
|
||||||
|
@ -216,21 +300,23 @@ The bufferinfo structure is::
|
||||||
void *buf;
|
void *buf;
|
||||||
Py_ssize_t len;
|
Py_ssize_t len;
|
||||||
int readonly;
|
int readonly;
|
||||||
char *format;
|
const char *format;
|
||||||
int ndims;
|
int ndim;
|
||||||
Py_ssize_t *shape;
|
Py_ssize_t *shape;
|
||||||
Py_ssize_t *strides;
|
Py_ssize_t *strides;
|
||||||
Py_ssize_t *suboffsets;
|
Py_ssize_t *suboffsets;
|
||||||
|
int itemsize;
|
||||||
void *internal;
|
void *internal;
|
||||||
};
|
} PyBuffer;
|
||||||
|
|
||||||
Before calling this function, the bufferinfo structure can be filled with
|
Before calling this function, the bufferinfo structure can be filled
|
||||||
whatever. Upon return from getbufferproc, the bufferinfo structure is filled in
|
with whatever. Upon return from getbufferproc, the bufferinfo
|
||||||
with relevant information about the buffer. This same bufferinfo
|
structure is filled in with relevant information about the buffer.
|
||||||
structure must be passed to bf_releasebuffer (if available) when the
|
This same bufferinfo structure must be passed to bf_releasebuffer (if
|
||||||
consumer is done with the memory. The caller is responsible for
|
available) when the consumer is done with the memory. The caller is
|
||||||
keeping a reference to obj until releasebuffer is called (i.e. this
|
responsible for keeping a reference to obj until releasebuffer is
|
||||||
call does not alter the reference count of obj).
|
called (i.e. the call to bf_getbuffer does not alter the reference
|
||||||
|
count of obj).
|
||||||
|
|
||||||
The members of the bufferinfo structure are:
|
The members of the bufferinfo structure are:
|
||||||
|
|
||||||
|
@ -243,32 +329,38 @@ len
|
||||||
bytes per item of memory.
|
bytes per item of memory.
|
||||||
|
|
||||||
readonly
|
readonly
|
||||||
an integer variable to hold whether or not the memory is
|
an integer variable to hold whether or not the memory is readonly.
|
||||||
readonly. 1 means the memory is readonly, zero means the
|
1 means the memory is readonly, zero means the memory is
|
||||||
memory is writeable.
|
writeable, and -1 means the memory was "locked" (changed from
|
||||||
|
writeable to readonly) when this PyBuffer structure was filled-in
|
||||||
|
and therefore should be unlocked when this PyBuffer structure is
|
||||||
|
"released" (this is supported only by some exporters).
|
||||||
|
|
||||||
format
|
format
|
||||||
a NULL-terminated format-string (following the struct-style syntax
|
a NULL-terminated format-string (following the struct-style syntax
|
||||||
including extensions) indicating what is in each element of
|
including extensions) indicating what is in each element of
|
||||||
memory. The number of elements is len / itemsize, where itemsize
|
memory. The number of elements is len / itemsize, where itemsize
|
||||||
is the number of bytes implied by the format. For standard
|
is the number of bytes implied by the format. This can be NULL which
|
||||||
unsigned bytes use a format string of "B".
|
implies standard unsigned bytes ("B").
|
||||||
|
|
||||||
ndims
|
ndims
|
||||||
a variable storing the number of dimensions the memory represents.
|
a variable storing the number of dimensions the memory represents.
|
||||||
Must be >=0.
|
Must be >=0. A value of 0 means that shape and strides and suboffsets
|
||||||
|
must be NULL (i.e. the memory represents a scalar).
|
||||||
|
|
||||||
shape
|
shape
|
||||||
an array of ``Py_ssize_t`` of length ``ndims`` indicating the
|
an array of ``Py_ssize_t`` of length ``ndims`` indicating the
|
||||||
shape of the memory as an N-D array. Note that ``((*shape)[0] *
|
shape of the memory as an N-D array. Note that ``((*shape)[0] *
|
||||||
... * (*shape)[ndims-1])*itemsize = len``.
|
... * (*shape)[ndims-1])*itemsize = len``. If ndims is 0 (indicating
|
||||||
|
a scalar), then this must be NULL.
|
||||||
|
|
||||||
strides
|
strides
|
||||||
address of a ``Py_ssize_t*`` variable that will be filled with a
|
address of a ``Py_ssize_t*`` variable that will be filled with a
|
||||||
pointer to an array of ``Py_ssize_t`` of length ``*ndims``
|
pointer to an array of ``Py_ssize_t`` of length ``ndims`` (or NULL
|
||||||
indicating the number of bytes to skip to get to the next element
|
if ndims is 0). indicating the number of bytes to skip to get to
|
||||||
in each dimension. For C-style contiguous arrays (where the
|
the next element in each dimension. If this is not requested by
|
||||||
last-dimension varies the fastest) this must be filled in.
|
the caller (PyBUF_ALW_STRIDES is not set), then this should be set
|
||||||
|
to NULL which indicates a C-style contiguous array.
|
||||||
|
|
||||||
suboffsets
|
suboffsets
|
||||||
address of a ``Py_ssize_t *`` variable that will be filled with a
|
address of a ``Py_ssize_t *`` variable that will be filled with a
|
||||||
|
@ -279,14 +371,14 @@ suboffsets
|
||||||
suboffset value that it negative indicates that no de-referencing
|
suboffset value that it negative indicates that no de-referencing
|
||||||
should occur (striding in a contiguous memory block). If all
|
should occur (striding in a contiguous memory block). If all
|
||||||
suboffsets are negative (i.e. no de-referencing is needed, then
|
suboffsets are negative (i.e. no de-referencing is needed, then
|
||||||
this must be NULL.
|
this must be NULL (the default value).
|
||||||
|
|
||||||
For clarity, here is a function that returns a pointer to the
|
For clarity, here is a function that returns a pointer to the
|
||||||
element in an N-D array pointed to by an N-dimesional index when
|
element in an N-D array pointed to by an N-dimesional index when
|
||||||
there are both strides and suboffsets.::
|
there are both non-NULL strides and suboffsets.::
|
||||||
|
|
||||||
void* get_item_pointer(int ndim, void* buf, Py_ssize_t* strides,
|
void* get_item_pointer(int ndim, void* buf, Py_ssize_t* strides,
|
||||||
Py_ssize_t* suboffsets, Py_ssize_t *indices) {
|
Py_ssize_t* suboffsets, Py_ssize_t *indices) {
|
||||||
char* pointer = (char*)buf;
|
char* pointer = (char*)buf;
|
||||||
int i;
|
int i;
|
||||||
for (i = 0; i < ndim; i++) {
|
for (i = 0; i < ndim; i++) {
|
||||||
|
@ -304,28 +396,36 @@ suboffsets
|
||||||
the location of the starting pointer directly (i.e. buf would
|
the location of the starting pointer directly (i.e. buf would
|
||||||
be modified).
|
be modified).
|
||||||
|
|
||||||
|
itemsize
|
||||||
|
This is a storage for the itemsize of each element of the shared
|
||||||
|
memory. It is strictly un-necessary as it can be obtained using
|
||||||
|
PyBuffer_SizeFromFormat, however an exporter may know this
|
||||||
|
information without parsing the format string and it is necessary
|
||||||
|
to know the itemsize for proper interpretation of striding.
|
||||||
|
Therefore, storing it is more convenient and faster.
|
||||||
|
|
||||||
internal
|
internal
|
||||||
This is for use internally by the exporting object. For example,
|
This is for use internally by the exporting object. For example,
|
||||||
this might be re-cast as an integer by the exporter and used to
|
this might be re-cast as an integer by the exporter and used to
|
||||||
store flags about whether or not the shape, strides, and suboffsets
|
store flags about whether or not the shape, strides, and suboffsets
|
||||||
arrays must be freed when the buffer is released. The consumer
|
arrays must be freed when the buffer is released. The consumer
|
||||||
should never touch this value.
|
should never alter this value.
|
||||||
|
|
||||||
|
|
||||||
The exporter is responsible for making sure the memory pointed to by
|
The exporter is responsible for making sure that any memory pointed to
|
||||||
buf, format, shape, strides, and suboffsets is valid until
|
by buf, format, shape, strides, and suboffsets is valid until
|
||||||
releasebuffer is called. If the exporter wants to be able to change
|
releasebuffer is called. If the exporter wants to be able to change
|
||||||
shape, strides, and/or suboffsets before releasebuffer is called then
|
an object's shape, strides, and/or suboffsets before releasebuffer is
|
||||||
it should allocate those arrays when getbuffer is called (pointing to
|
called then it should allocate those arrays when getbuffer is called
|
||||||
them in the buffer-info structure provided) and free them when
|
(pointing to them in the buffer-info structure provided) and free them
|
||||||
releasebuffer is called.
|
when releasebuffer is called.
|
||||||
|
|
||||||
|
|
||||||
The same bufferinfo struct should be used in the release-buffer
|
The same bufferinfo struct should be used in the release-buffer
|
||||||
interface call. The caller is responsible for the memory of the
|
interface call. The caller is responsible for the memory of the
|
||||||
bufferinfo structure itself.
|
bufferinfo structure itself.
|
||||||
|
|
||||||
``typedef int (*releasebufferproc)(PyObject *obj, struct bufferinfo *view)``
|
``typedef int (*releasebufferproc)(PyObject *obj, PyBuffer *view)``
|
||||||
Callers of getbufferproc must make sure that this function is
|
Callers of getbufferproc must make sure that this function is
|
||||||
called when memory previously acquired from the object is no
|
called when memory previously acquired from the object is no
|
||||||
longer needed. The exporter of the interface must make sure that
|
longer needed. The exporter of the interface must make sure that
|
||||||
|
@ -348,7 +448,8 @@ each object.
|
||||||
|
|
||||||
All that is specifically required by the exporter, however, is to
|
All that is specifically required by the exporter, however, is to
|
||||||
ensure that any memory shared through the bufferinfo structure remains
|
ensure that any memory shared through the bufferinfo structure remains
|
||||||
valid until releasebuffer is called on the bufferinfo structure.
|
valid until releasebuffer is called on the bufferinfo structure
|
||||||
|
exporting that memory.
|
||||||
|
|
||||||
|
|
||||||
New C-API calls are proposed
|
New C-API calls are proposed
|
||||||
|
@ -362,7 +463,8 @@ Return 1 if the getbuffer function is available otherwise 0.
|
||||||
|
|
||||||
::
|
::
|
||||||
|
|
||||||
int PyObject_GetBuffer(PyObject *obj, struct bufferinfo *view, int flags)
|
int PyObject_GetBuffer(PyObject *obj, PyBuffer *view,
|
||||||
|
int flags)
|
||||||
|
|
||||||
This is a C-API version of the getbuffer function call. It checks to
|
This is a C-API version of the getbuffer function call. It checks to
|
||||||
make sure object has the required function pointer and issues the
|
make sure object has the required function pointer and issues the
|
||||||
|
@ -370,61 +472,85 @@ call. Returns -1 and raises an error on failure and returns 0 on
|
||||||
success.
|
success.
|
||||||
|
|
||||||
::
|
::
|
||||||
|
int PyObject_ReleaseBuffer(PyObject *obj, PyBuffer *view)
|
||||||
|
|
||||||
int PyObject_ReleaseBuffer(PyObject *obj, struct bufferinfo *view)
|
This is a C-API version of the releasebuffer function call. It checks
|
||||||
|
to make sure the object has the required function pointer and issues
|
||||||
This is a C-API version of the releasebuffer function call. It checks to
|
the call. Returns 0 on success and -1 (with an error raised) on
|
||||||
make sure the object has the required function pointer and issues the call. Returns 0
|
failure. This function always succeeds if there is no releasebuffer
|
||||||
on success and -1 (with an error raised) on failure. This function always
|
function for the object.
|
||||||
succeeds if there is no releasebuffer function for the object.
|
|
||||||
|
|
||||||
::
|
::
|
||||||
|
|
||||||
PyObject *PyObject_GetMemoryView(PyObject *obj)
|
PyObject *PyObject_GetMemoryView(PyObject *obj)
|
||||||
|
|
||||||
Return a memory-view object from an object that defines the buffer interface.
|
Return a memory-view object from an object that defines the buffer interface.
|
||||||
If make_ro is non-zero then request that the memory is made read-only until
|
|
||||||
release buffer is called.
|
|
||||||
|
|
||||||
A memory-view object is an extended buffer object that should replace
|
A memory-view object is an extended buffer object that could replace
|
||||||
the buffer object in Python 3K. It's C-structure is::
|
the buffer object (but doesn't have to as that could be kept as a
|
||||||
|
simple 1-d memory-view object). It's C-structure is
|
||||||
|
|
||||||
|
::
|
||||||
|
|
||||||
typedef struct {
|
typedef struct {
|
||||||
PyObject_HEAD
|
PyObject_HEAD
|
||||||
PyObject *base;
|
PyObject *base;
|
||||||
struct bufferinfo view;
|
int ndims;
|
||||||
int itemsize;
|
Py_ssize_t *starts; /* slice starts */
|
||||||
int flags;
|
Py_ssize_t *stops; /* slice stops */
|
||||||
|
Py_ssize_t *steps; /* slice steps */
|
||||||
} PyMemoryViewObject;
|
} PyMemoryViewObject;
|
||||||
|
|
||||||
This is very similar to the current buffer object except offset has
|
This is functionally similar to the current buffer object except only
|
||||||
been removed because ptr can just be modified by offset and a single
|
a reference to base is kept. The actual memory for base must be
|
||||||
offset is not sufficient for the sub-offsets. Also the hash has been
|
re-grabbed using the buffer-protocol, whenever it is actually
|
||||||
removed because using the buffer object as a hash even if it is
|
needed (after which it is immediately released).
|
||||||
read-only is rarely useful.
|
|
||||||
|
|
||||||
Also, the format, ndims, shape, strides, and suboffsets have been
|
The getbuffer and releasebuffer for this object use the underlying
|
||||||
added. These additions will allow multi-dimensional slicing of the
|
base object (adjusted as needed using the slice information). If the
|
||||||
memory-view object which can be added at some point. This object
|
number of dimensions of the base object (or the strides or the size)
|
||||||
always owns it's own shape, strides, and suboffsets arrays and it's
|
has changed when a new view is requested, then the getbuffer will
|
||||||
own format string, but always borrows the memory from the object
|
trigger an error. Methods on the MemoryView object will allow
|
||||||
pointed to by base.
|
manual locking and unlocking of the underlying buffer.
|
||||||
|
|
||||||
The itemsize is a convenience and specifies the number of bytes
|
This memory-view object will support mult-dimensional slicing and be
|
||||||
indicated by the format string if positive.
|
the first object provided with Python to do so. Slices of the
|
||||||
|
memory-view object are other memory-view objects. When an "element"
|
||||||
|
from the memory-view is returned it is always a bytes object whose
|
||||||
|
format should be interpreted by the format attribute of the memoryview
|
||||||
|
object. The struct module can be used to "decode" the bytes in Python
|
||||||
|
if desired.
|
||||||
|
|
||||||
This object never reallocates ptr, shape, strides, subboffsets or
|
The Python name will be
|
||||||
format and therefore does not need to keep track of how many views it
|
|
||||||
has exported.
|
__builtin__.memory
|
||||||
|
|
||||||
|
Methods:
|
||||||
|
|
||||||
|
lock
|
||||||
|
unlock
|
||||||
|
__getitem__ (will support multi-dimensional slicing)
|
||||||
|
__setitem__ (will support multi-dimensional slicing)
|
||||||
|
tobytes (obtain a bytes-object of everything in the memory).
|
||||||
|
tolist (obtain a "nested" list of the memory. Everything
|
||||||
|
is interpreted into standard Python objects
|
||||||
|
as the struct module unpack would do).
|
||||||
|
|
||||||
|
Attributes (taken from the memory of the base object)
|
||||||
|
|
||||||
|
format
|
||||||
|
itemsize
|
||||||
|
shape
|
||||||
|
strides
|
||||||
|
suboffsets
|
||||||
|
size
|
||||||
|
readonly
|
||||||
|
ndim
|
||||||
|
|
||||||
It exports a view using the base object. It releases a view by
|
|
||||||
releasing the view on the base object. Because, it will never
|
|
||||||
re-allocate memory, it does not need to keep track of how many it has
|
|
||||||
exported but simple reference counting will suffice.
|
|
||||||
|
|
||||||
::
|
::
|
||||||
|
|
||||||
int PyObject_SizeFromFormat(char *)
|
int PyBuffer_SizeFromFormat(const char *)
|
||||||
|
|
||||||
Return the implied itemsize of the data-format area from a struct-style
|
Return the implied itemsize of the data-format area from a struct-style
|
||||||
description.
|
description.
|
||||||
|
@ -432,65 +558,82 @@ description.
|
||||||
::
|
::
|
||||||
|
|
||||||
int PyObject_GetContiguous(PyObject *obj, void **buf, Py_ssize_t *len,
|
int PyObject_GetContiguous(PyObject *obj, void **buf, Py_ssize_t *len,
|
||||||
int fortran)
|
char **format, char fortran)
|
||||||
|
|
||||||
Return a contiguous chunk of memory representing the buffer. If a
|
Return a contiguous chunk of memory representing the buffer. If a
|
||||||
copy is made then return 1. If no copy was needed return 0. If an
|
copy is made then return 1. If no copy was needed return 0. If an
|
||||||
error occurred in probing the buffer interface, then return -1. The
|
error occurred in probing the buffer interface, then return -1. The
|
||||||
contiguous chunk of memory is pointed to by ``*buf`` and the length of
|
contiguous chunk of memory is pointed to by ``*buf`` and the length of
|
||||||
that memory is ``*len``. If the object is multi-dimensional, then if
|
that memory is ``*len``. If the object is multi-dimensional, then if
|
||||||
fortran is 1, the first dimension of the underlying array will vary
|
fortran is 'F', the first dimension of the underlying array will vary
|
||||||
the fastest in the buffer. If fortran is 0, then the last dimension
|
the fastest in the buffer. If fortran is 'C', then the last dimension
|
||||||
will vary the fastest (C-style contiguous). If fortran is -1, then it
|
will vary the fastest (C-style contiguous). If fortran is 'A', then it
|
||||||
does not matter and you will get whatever the object decides is more
|
does not matter and you will get whatever the object decides is more
|
||||||
efficient.
|
efficient. If a copy is made, then the memory must be freed by calling
|
||||||
|
``PyMem_Free``.
|
||||||
|
|
||||||
::
|
::
|
||||||
|
|
||||||
int PyObject_CopyToObject(PyObject *obj, void *buf, Py_ssize_t len,
|
int PyObject_CopyToObject(PyObject *obj, void *buf, Py_ssize_t len,
|
||||||
int fortran)
|
char fortran)
|
||||||
|
|
||||||
Copy ``len`` bytes of data pointed to by the contiguous chunk of
|
Copy ``len`` bytes of data pointed to by the contiguous chunk of
|
||||||
memory pointed to by ``buf`` into the buffer exported by obj. Return
|
memory pointed to by ``buf`` into the buffer exported by obj. Return
|
||||||
0 on success and return -1 and raise an error on failure. If the
|
0 on success and return -1 and raise an error on failure. If the
|
||||||
object does not have a writeable buffer, then an error is raised. If
|
object does not have a writeable buffer, then an error is raised. If
|
||||||
fortran is 1, then if the object is multi-dimensional, then the data
|
fortran is 'F', then if the object is multi-dimensional, then the data
|
||||||
will be copied into the array in Fortran-style (first dimension varies
|
will be copied into the array in Fortran-style (first dimension varies
|
||||||
the fastest). If fortran is 0, then the data will be copied into the
|
the fastest). If fortran is 'C', then the data will be copied into the
|
||||||
array in C-style (last dimension varies the fastest). If fortran is -1, then
|
array in C-style (last dimension varies the fastest). If fortran is 'A', then
|
||||||
it does not matter and the copy will be made in whatever way is more
|
it does not matter and the copy will be made in whatever way is more
|
||||||
efficient.
|
efficient.
|
||||||
|
|
||||||
The last two C-API calls allow a standard way of getting data in and
|
|
||||||
|
These last three C-API calls allow a standard way of getting data in and
|
||||||
out of Python objects into contiguous memory areas no matter how it is
|
out of Python objects into contiguous memory areas no matter how it is
|
||||||
actually stored. These calls use the extended buffer interface to perform
|
actually stored. These calls use the extended buffer interface to perform
|
||||||
their work.
|
their work.
|
||||||
|
|
||||||
::
|
::
|
||||||
|
|
||||||
int PyObject_IsContiguous(struct bufferinfo *view, int fortran);
|
int PyBuffer_IsContiguous(PyBuffer *view, char fortran);
|
||||||
|
|
||||||
Return 1 if the memory defined by the view object is C-style (fortran = 0)
|
Return 1 if the memory defined by the view object is C-style (fortran
|
||||||
or Fortran-style (fortran = 1) contiguous. Return 0 otherwise.
|
= 'C') or Fortran-style (fortran = 'F') contiguous or either one
|
||||||
|
(fortran = 'A'). Return 0 otherwise.
|
||||||
|
|
||||||
|
::
|
||||||
|
int PyBuffer_IsAligned(PyBuffer *view);
|
||||||
|
|
||||||
|
Return 1 if the memory at all elements of the array implied by the
|
||||||
|
view object is aligned
|
||||||
|
|
||||||
::
|
::
|
||||||
|
|
||||||
void PyObject_FillContiguousStrides(int *ndims, Py_ssize_t *shape,
|
void PyBuffer_FillContiguousStrides(int *ndims, Py_ssize_t *shape,
|
||||||
int itemsize,
|
int itemsize,
|
||||||
Py_ssize_t *strides, int fortran)
|
Py_ssize_t *strides, char fortran)
|
||||||
|
|
||||||
Fill the strides array with byte-strides of a contiguous (C-style if
|
Fill the strides array with byte-strides of a contiguous (C-style if
|
||||||
fortran is 0 or Fortran-style if fortran is 1) array of the given
|
fortran is 0 or Fortran-style if fortran is 1) array of the given
|
||||||
shape with the given number of bytes per element.
|
shape with the given number of bytes per element.
|
||||||
|
|
||||||
::
|
::
|
||||||
|
int PyBuffer_FillInfo(PyBuffer *view, void *buf,
|
||||||
|
Py_ssize_t len, int readonly, int infoflags)
|
||||||
|
|
||||||
int PyObject_FillBufferInfo(struct bufferinfo *view, void *buf, Py_ssize_t len,
|
Fills in a buffer-info structure correctly for an exporter that can
|
||||||
int readonly, int infoflags)
|
only share a contiguous chunk of memory of "unsigned bytes" of the
|
||||||
|
given length. Returns 0 on success and -1 (with raising an error) on
|
||||||
|
error.
|
||||||
|
|
||||||
Fills in a buffer-info structure correctly for an exporter that can only share
|
::
|
||||||
a contiguous chunk of memory of "unsigned bytes" of the given length. Returns 0 on success
|
PyErr_BufferError
|
||||||
and -1 (with raising an error) on error
|
|
||||||
|
A new error object for returning buffer errors which arise because an
|
||||||
|
exporter cannot provide the kind of buffer that a consumer expects.
|
||||||
|
This will also be raised when a consumer requests a buffer from an
|
||||||
|
object that does not provide the protocol.
|
||||||
|
|
||||||
|
|
||||||
Additions to the struct string-syntax
|
Additions to the struct string-syntax
|
||||||
|
@ -521,22 +664,29 @@ Character Description
|
||||||
':name:' optional name of the preceeding element
|
':name:' optional name of the preceeding element
|
||||||
'X{}' pointer to a function (optional function
|
'X{}' pointer to a function (optional function
|
||||||
signature inside {})
|
signature inside {})
|
||||||
' \n\t' ignored (allow better readability) -- this may already be true
|
' \n\t' ignored (allow better readability)
|
||||||
|
-- this may already be true
|
||||||
================ ===========
|
================ ===========
|
||||||
|
|
||||||
The struct module will be changed to understand these as well and
|
The struct module will be changed to understand these as well and
|
||||||
return appropriate Python objects on unpacking. Un-packing a
|
return appropriate Python objects on unpacking. Unpacking a
|
||||||
long-double will return a decimal object or a ctypes long-double.
|
long-double will return a decimal object or a ctypes long-double.
|
||||||
Unpacking 'u' or 'w' will return Python unicode. Unpacking a
|
Unpacking 'u' or 'w' will return Python unicode. Unpacking a
|
||||||
multi-dimensional array will return a list of lists. Un-packing a
|
multi-dimensional array will return a list (of lists if >1d).
|
||||||
pointer will return a ctypes pointer object. Un-packing a bit will
|
Unpacking a pointer will return a ctypes pointer object. Unpacking a
|
||||||
return a Python Bool. Spaces in the struct-string syntax will be
|
function pointer will return a ctypes call-object (perhaps). Unpacking
|
||||||
ignored. Unpacking a named-object will return a Python class with
|
a bit will return a Python Bool. White-space in the struct-string
|
||||||
attributes having those names.
|
syntax will be ignored if it isn't already. Unpacking a named-object
|
||||||
|
will return some kind of named-tuple-like object that acts like a
|
||||||
|
tuple but whose entries can also be accessed by name. Unpacking a
|
||||||
|
nested structure will return a nested tuple.
|
||||||
|
|
||||||
Endian-specification ('=','>','<') is also allowed inside the
|
Endian-specification ('!', '@','=','>','<', '^') is also allowed
|
||||||
string so that it can change if needed. The previously-specified
|
inside the string so that it can change if needed. The
|
||||||
endian string is in force until changed. The default endian is '='.
|
previously-specified endian string is in force until changed. The
|
||||||
|
default endian is '@' which means native data-types and alignment. If
|
||||||
|
un-aligned, native data-types are requested, then the endian
|
||||||
|
specification is '^'.
|
||||||
|
|
||||||
According to the struct-module, a number can preceed a character
|
According to the struct-module, a number can preceed a character
|
||||||
code to specify how many of that type there are. The
|
code to specify how many of that type there are. The
|
||||||
|
@ -551,16 +701,21 @@ Examples of Data-Format Descriptions
|
||||||
====================================
|
====================================
|
||||||
|
|
||||||
Here are some examples of C-structures and how they would be
|
Here are some examples of C-structures and how they would be
|
||||||
represented using the struct-style syntax:
|
represented using the struct-style syntax.
|
||||||
|
|
||||||
|
<named> is the constructor for a named-tuple (not-specified yet).
|
||||||
|
|
||||||
float
|
float
|
||||||
'f'
|
'f' <--> Python float
|
||||||
complex double
|
complex double
|
||||||
'Zd'
|
'Zd' <--> Python complex
|
||||||
RGB Pixel data
|
RGB Pixel data
|
||||||
'BBB' or 'B:r: B:g: B:b:'
|
'BBB' <--> (int, int, int)
|
||||||
|
'B:r: B:g: B:b:' <--> <named>((int, int, int), ('r','g','b'))
|
||||||
|
|
||||||
Mixed endian (weird but possible)
|
Mixed endian (weird but possible)
|
||||||
'>i:big: <i:little:'
|
'>i:big: <i:little:' <--> <named>((int, int), ('big', 'little'))
|
||||||
|
|
||||||
Nested structure
|
Nested structure
|
||||||
::
|
::
|
||||||
|
|
||||||
|
@ -586,7 +741,9 @@ Nested array
|
||||||
int ival;
|
int ival;
|
||||||
double data[16*4];
|
double data[16*4];
|
||||||
}
|
}
|
||||||
'i:ival: (16,4)d:data:'
|
"""i:ival:
|
||||||
|
(16,4)d:data:
|
||||||
|
"""
|
||||||
|
|
||||||
|
|
||||||
Code to be affected
|
Code to be affected
|
||||||
|
@ -662,10 +819,12 @@ Examples
|
||||||
=========
|
=========
|
||||||
|
|
||||||
Ex. 1
|
Ex. 1
|
||||||
----------
|
-----------
|
||||||
|
|
||||||
This example shows how an image object that uses contiguous lines might expose its buffer.::
|
This example shows how an image object that uses contiguous lines might expose its buffer.::
|
||||||
|
|
||||||
|
::
|
||||||
|
|
||||||
struct rgba {
|
struct rgba {
|
||||||
unsigned char r, g, b, a;
|
unsigned char r, g, b, a;
|
||||||
};
|
};
|
||||||
|
@ -688,7 +847,9 @@ In order to access, say, the red value of the pixel at x=30, y=50, you'd use "li
|
||||||
|
|
||||||
So what does ImageObject's getbuffer do? Leaving error checking out::
|
So what does ImageObject's getbuffer do? Leaving error checking out::
|
||||||
|
|
||||||
int Image_getbuffer(PyObject *self, struct bufferinfo *view, int flags) {
|
::
|
||||||
|
|
||||||
|
int Image_getbuffer(PyObject *self, PyBuffer *view, int flags) {
|
||||||
|
|
||||||
static Py_ssize_t suboffsets[2] = { -1, 0 };
|
static Py_ssize_t suboffsets[2] = { -1, 0 };
|
||||||
|
|
||||||
|
@ -710,7 +871,7 @@ So what does ImageObject's getbuffer do? Leaving error checking out::
|
||||||
}
|
}
|
||||||
|
|
||||||
|
|
||||||
int Image_releasebuffer(PyObject *self, struct bufferinfo *view) {
|
int Image_releasebuffer(PyObject *self, PyBuffer *view) {
|
||||||
self->view_count--;
|
self->view_count--;
|
||||||
return 0;
|
return 0;
|
||||||
}
|
}
|
||||||
|
@ -721,33 +882,36 @@ Ex. 2
|
||||||
|
|
||||||
This example shows how an object that wants to expose a contiguous
|
This example shows how an object that wants to expose a contiguous
|
||||||
chunk of memory (which will never be re-allocated while the object is
|
chunk of memory (which will never be re-allocated while the object is
|
||||||
alive) would do that.::
|
alive) would do that.
|
||||||
|
|
||||||
int myobject_getbuffer(PyObject *self, struct bufferinfo *view, int flags) {
|
::
|
||||||
|
|
||||||
void *buf;
|
int myobject_getbuffer(PyObject *self, PyBuffer *view, int flags) {
|
||||||
Py_ssize_t len;
|
|
||||||
int readonly=0;
|
|
||||||
|
|
||||||
buf = /* Point to buffer */
|
void *buf;
|
||||||
len = /* Set to size of buffer */
|
Py_ssize_t len;
|
||||||
readonly = /* Set to 1 if readonly */
|
int readonly=0;
|
||||||
|
|
||||||
return PyObject_FillBufferInfo(view, buf, len, readonly, flags);
|
buf = /* Point to buffer */
|
||||||
|
len = /* Set to size of buffer */
|
||||||
|
readonly = /* Set to 1 if readonly */
|
||||||
|
|
||||||
|
return PyObject_FillBufferInfo(view, buf, len, readonly, flags);
|
||||||
}
|
}
|
||||||
|
|
||||||
/* No releasebuffer is necessary because the memory will never
|
/* No releasebuffer is necessary because the memory will never
|
||||||
be re-allocated so the locking mechanism is not needed
|
be re-allocated so the locking mechanism is not needed
|
||||||
*/
|
*/
|
||||||
|
|
||||||
Ex. 3
|
Ex. 3
|
||||||
-----------
|
-----------
|
||||||
|
|
||||||
A consumer that wants to only get a simple contiguous chunk of bytes
|
A consumer that wants to only get a simple contiguous chunk of bytes
|
||||||
from a Python object, obj would do the following::
|
from a Python object, obj would do the following:
|
||||||
|
|
||||||
|
::
|
||||||
|
|
||||||
struct bufferinfo view;
|
PyBuffer view;
|
||||||
int ret;
|
int ret;
|
||||||
|
|
||||||
if (PyObject_GetBuffer(obj, &view, Py_BUF_SIMPLE) < 0) {
|
if (PyObject_GetBuffer(obj, &view, Py_BUF_SIMPLE) < 0) {
|
||||||
|
@ -767,6 +931,35 @@ from a Python object, obj would do the following::
|
||||||
}
|
}
|
||||||
|
|
||||||
|
|
||||||
|
Ex. 4
|
||||||
|
-----------
|
||||||
|
|
||||||
|
A consumer that wants to be able to use any object's memory but is
|
||||||
|
writing an algorithm that only handle contiguous memory could do the following:
|
||||||
|
|
||||||
|
::
|
||||||
|
|
||||||
|
void *buf;
|
||||||
|
Py_ssize_t len;
|
||||||
|
char *format;
|
||||||
|
|
||||||
|
if (PyObject_GetContiguous(obj, &buf, &len, &format, 0) < 0) {
|
||||||
|
/* error return */
|
||||||
|
}
|
||||||
|
|
||||||
|
/* process memory pointed to by buffer if format is correct */
|
||||||
|
|
||||||
|
/* Optional:
|
||||||
|
|
||||||
|
if, after processing, we want to copy data from buffer back
|
||||||
|
into the the object
|
||||||
|
|
||||||
|
we could do
|
||||||
|
*/
|
||||||
|
|
||||||
|
if (PyObject_CopyToObject(obj, buf, len, 0) < 0) {
|
||||||
|
/* error return */
|
||||||
|
}
|
||||||
|
|
||||||
|
|
||||||
Copyright
|
Copyright
|
||||||
|
|
Loading…
Reference in New Issue