Memory (management)#

Devices#

classDevice:publicstd::enable_shared_from_this<Device >,publicarrow::util::EqualityComparable<Device >#

EXPERIMENTAL: Abstract interface for hardware devices.

This object represents a device with access to some memory spaces. When handling a Buffer or raw memory address, it allows deciding in which context the raw memory address should be interpreted (e.g. CPU-accessible memory, or embedded memory on some particular GPU).

Subclassed by arrow::CPUDevice, arrow::cuda::CudaDevice

Public Functions

virtualconstchar*type_name()const=0#

A shorthand for this device’s type.

The returned value is different for each device class, but is the same for all instances of a given class. It can be used as a replacement for RTTI.

virtualstd::stringToString()const=0#

A human-readable description of the device.

The returned value should be detailed enough to distinguish between different instances, where necessary.

virtualboolEquals(constDevice &)const=0#

Whether this instance points to the same device as another one.

inlinevirtualint64_tdevice_id()const#

A device ID to identify this device if there are multiple of this type.

If there is no "device_id" equivalent (such as for the main CPU device on non-numa systems) returns -1.

inlineboolis_cpu()const#

Whether this device is the main CPU device.

This shorthand method is very useful when deciding whether a memory address is CPU-accessible.

virtualstd::shared_ptr<MemoryManager >default_memory_manager()=0#

Return a MemoryManager instance tied to this device.

The returned instance uses default parameters for this device type’s MemoryManager implementation. Some devices also allow constructing MemoryManager instances with non-default parameters.

virtualDeviceAllocationTypedevice_type()const=0#

Return the DeviceAllocationType of this device.

inlinevirtualResult <std::shared_ptr<Stream >>MakeStream(unsignedintflags)#

Create a new device stream.

This should create the appropriate stream type for the device, derived from Device::Stream to allow for stream ordered events and memory allocations.

inlinevirtualResult <std::shared_ptr<Stream >>WrapStream(void*device_stream, Stream ::release_fn_trelease_fn)#

Wrap an existing device stream alongside a release function.

Parameters:
  • device_stream – a pointer to the stream to wrap

  • release_fn – a function to call during destruction, nullptr or a no-op function can be passed to indicate ownership is maintained externally

classStream#

EXPERIMENTAL: An opaque wrapper for Device-specific streams.

In essence this is just a wrapper around a void* to represent the standard concept of a stream/queue on a device. Derived classes should be trivially constructible from it’s device-specific counterparts.

Subclassed by arrow::cuda::CudaDevice::Stream

Public Functions

virtualStatus WaitEvent(constSyncEvent &)=0#

Make the stream wait on the provided event.

Tells the stream that it should wait until the synchronization event is completed without blocking the CPU.

virtualStatus Synchronize()const=0#

Blocks the current thread until a stream’s remaining tasks are completed.

classSyncEvent#

EXPERIMENTAL: An object that provides event/stream sync primitives.

Subclassed by arrow::cuda::CudaDevice::SyncEvent

Public Functions

virtualStatus Wait()=0#

Block until sync event is completed.

virtualStatus Record(constStream &)=0#

Record the wrapped event on the stream so it triggers the event when the stream gets to that point in its queue.

classCPUDevice:publicarrow::Device #

Public Functions

virtualconstchar*type_name()constoverride#

A shorthand for this device’s type.

The returned value is different for each device class, but is the same for all instances of a given class. It can be used as a replacement for RTTI.

virtualstd::stringToString()constoverride#

A human-readable description of the device.

The returned value should be detailed enough to distinguish between different instances, where necessary.

virtualboolEquals(constDevice &)constoverride#

Whether this instance points to the same device as another one.

inlinevirtualDeviceAllocationTypedevice_type()constoverride#

Return the DeviceAllocationType of this device.

virtualstd::shared_ptr<MemoryManager >default_memory_manager()override#

Return a MemoryManager instance tied to this device.

The returned instance uses default parameters for this device type’s MemoryManager implementation. Some devices also allow constructing MemoryManager instances with non-default parameters.

Public Static Functions

staticstd::shared_ptr<Device >Instance()#

Return the global CPUDevice instance.

staticstd::shared_ptr<MemoryManager >memory_manager(MemoryPool *pool)#

Create a MemoryManager.

The returned MemoryManager will use the given MemoryPool for allocations.

std::shared_ptr<MemoryManager >arrow::default_cpu_memory_manager()#

Return the default CPU MemoryManager instance.

The returned singleton instance uses the default MemoryPool. This function is a faster spelling of CPUDevice::Instance() ->default_memory_manager().

Memory Managers#

classMemoryManager:publicstd::enable_shared_from_this<MemoryManager >#

EXPERIMENTAL: An object that provides memory management primitives.

A MemoryManager is always tied to a particular Device instance. It can also have additional parameters (such as a MemoryPool to allocate CPU memory).

Subclassed by arrow::CPUMemoryManager, arrow::cuda::CudaMemoryManager

Public Functions

inlineconststd::shared_ptr<Device >&device()const#

The device this MemoryManager is tied to.

inlineboolis_cpu()const#

Whether this MemoryManager is tied to the main CPU device.

This shorthand method is very useful when deciding whether a memory address is CPU-accessible.

virtualResult <std::shared_ptr<io::RandomAccessFile >>GetBufferReader(std::shared_ptr<Buffer >buf)=0#

Create a RandomAccessFile to read a particular buffer.

The given buffer must be tied to this MemoryManager.

See also the Buffer::GetReader shorthand.

virtualResult <std::shared_ptr<io::OutputStream >>GetBufferWriter(std::shared_ptr<Buffer >buf)=0#

Create a OutputStream to write to a particular buffer.

The given buffer must be mutable and tied to this MemoryManager. The returned stream object writes into the buffer’s underlying memory (but it won’t resize it).

See also the Buffer::GetWriter shorthand.

virtualResult <std::unique_ptr<Buffer >>AllocateBuffer(int64_tsize)=0#

Allocate a (mutable) Buffer.

The buffer will be allocated in the device’s memory.

virtualResult <std::shared_ptr<Device ::SyncEvent >>MakeDeviceSyncEvent()#

Create a new SyncEvent.

This version should construct the appropriate event for the device and provide the unique_ptr with the correct deleter for the event type. If the device does not require or work with any synchronization, it is allowed for it to return a nullptr.

virtualResult <std::shared_ptr<Device ::SyncEvent >>WrapDeviceSyncEvent(void*sync_event, Device ::SyncEvent ::release_fn_trelease_sync_event)#

Wrap an event into a SyncEvent.

Parameters:
  • sync_event – passed in sync_event (should be a pointer to the appropriate type)

  • release_sync_event – destructor to free sync_event. nullptr may be passed to indicate that no destruction/freeing is necessary

Public Static Functions

staticResult <std::shared_ptr<Buffer >>CopyBuffer(conststd::shared_ptr<Buffer >&source, conststd::shared_ptr<MemoryManager >&to)#

Copy a Buffer to a destination MemoryManager.

See also the Buffer::Copy shorthand.

staticResult <std::unique_ptr<Buffer >>CopyNonOwned(constBuffer &source, conststd::shared_ptr<MemoryManager >&to)#

Copy a non-owned Buffer to a destination MemoryManager.

This is useful for cases where the source memory area is externally managed (its lifetime not tied to the source Buffer), otherwise please use CopyBuffer().

staticResult <std::shared_ptr<Buffer >>ViewBuffer(conststd::shared_ptr<Buffer >&source, conststd::shared_ptr<MemoryManager >&to)#

Make a no-copy Buffer view in a destination MemoryManager.

See also the Buffer::View shorthand.

staticStatus CopyBufferSliceToCPU(conststd::shared_ptr<Buffer >&buf, int64_toffset, int64_tlength, uint8_t*out_data)#

Copy a slice of a buffer into a CPU pointer.

classCPUMemoryManager:publicarrow::MemoryManager #

Public Functions

virtualResult <std::shared_ptr<io::RandomAccessFile >>GetBufferReader(std::shared_ptr<Buffer >buf)override#

Create a RandomAccessFile to read a particular buffer.

The given buffer must be tied to this MemoryManager.

See also the Buffer::GetReader shorthand.

virtualResult <std::shared_ptr<io::OutputStream >>GetBufferWriter(std::shared_ptr<Buffer >buf)override#

Create a OutputStream to write to a particular buffer.

The given buffer must be mutable and tied to this MemoryManager. The returned stream object writes into the buffer’s underlying memory (but it won’t resize it).

See also the Buffer::GetWriter shorthand.

virtualResult <std::unique_ptr<Buffer >>AllocateBuffer(int64_tsize)override#

Allocate a (mutable) Buffer.

The buffer will be allocated in the device’s memory.

inlineMemoryPool *pool()const#

Return the MemoryPool associated with this MemoryManager.

Buffers#

classBuffer#

Object containing a pointer to a piece of contiguous memory with a particular size.

Buffers have two related notions of length: size and capacity. Size is the number of bytes that might have valid data. Capacity is the number of bytes that were allocated for the buffer in total.

The Buffer base class does not own its memory, but subclasses often do.

The following invariant is always true: Size <= Capacity

Subclassed by arrow::MutableBuffer, arrow::cuda::CudaBuffer

Public Functions

inlineBuffer(constuint8_t*data, int64_tsize)#

Construct from buffer and size without copying memory.

Note

The passed memory must be kept alive through some other means

Parameters:
  • data[in] a memory buffer

  • size[in] buffer size

inlineexplicitBuffer(std::string_viewdata)#

Construct from string_view without copying memory.

Note

The memory viewed by data must not be deallocated in the lifetime of the Buffer; temporary rvalue strings must be stored in an lvalue somewhere

Parameters:

data[in] a string_view object

inlineBuffer(std::shared_ptr<Buffer >parent, constint64_toffset, constint64_tsize)#

An offset into data that is owned by another buffer, but we want to be able to retain a valid pointer to it even after other shared_ptr’s to the parent buffer have been destroyed.

This method makes no assertions about alignment or padding of the buffer but in general we expected buffers to be aligned and padded to 64 bytes. In the future we might add utility methods to help determine if a buffer satisfies this contract.

std::stringToHexString()#

Construct a new std::string with a hexadecimal representation of the buffer.

Returns:

std::string

boolEquals(constBuffer &other, int64_tnbytes)const#

Return true if both buffers are the same size and contain the same bytes up to the number of compared bytes.

boolEquals(constBuffer &other)const#

Return true if both buffers are the same size and contain the same bytes.

Result <std::shared_ptr<Buffer >>CopySlice(constint64_tstart, constint64_tnbytes, MemoryPool *pool=default_memory_pool ())const#

Copy a section of the buffer into a new Buffer.

inlinevoidZeroPadding()#

Zero bytes in padding, i.e. bytes between size_ and capacity_.

std::stringToString()const#

Copy buffer contents into a new std::string.

Note

Can throw std::bad_alloc if buffer is large

Returns:

std::string

inlineexplicitoperatorstd::string_view()const#

View buffer contents as a std::string_view.

Returns:

std::string_view

inlineconstuint8_t*data()const#

Return a pointer to the buffer’s data.

The buffer has to be a CPU buffer (is_cpu() is true). Otherwise, an assertion may be thrown or a null pointer may be returned.

To get the buffer’s data address regardless of its device, call address() .

template<typenameT>
inlineconstT *data_as()const#

Return a pointer to the buffer’s data cast to a specific type.

The buffer has to be a CPU buffer (is_cpu() is true). Otherwise, an assertion may be thrown or a null pointer may be returned.

template<typenameT>
inlineutil::span<constT >span_as()const#

Return the buffer’s data as a span.

inlineuint8_t*mutable_data()#

Return a writable pointer to the buffer’s data.

The buffer has to be a mutable CPU buffer (is_cpu() and is_mutable() are true). Otherwise, an assertion may be thrown or a null pointer may be returned.

To get the buffer’s mutable data address regardless of its device, call mutable_address() .

template<typenameT>
inlineT *mutable_data_as()#

Return a writable pointer to the buffer’s data cast to a specific type.

The buffer has to be a mutable CPU buffer (is_cpu() and is_mutable() are true). Otherwise, an assertion may be thrown or a null pointer may be returned.

template<typenameT>
inlineutil::span<T >mutable_span_as()#

Return the buffer’s mutable data as a span.

inlineuintptr_taddress()const#

Return the device address of the buffer’s data.

inlineuintptr_tmutable_address()const#

Return a writable device address to the buffer’s data.

The buffer has to be a mutable buffer (is_mutable() is true). Otherwise, an assertion may be thrown or 0 may be returned.

inlineint64_tsize()const#

Return the buffer’s size in bytes.

inlineint64_tcapacity()const#

Return the buffer’s capacity (number of allocated bytes)

inlineboolis_cpu()const#

Whether the buffer is directly CPU-accessible.

If this function returns true, you can read directly from the buffer’s data() pointer. Otherwise, you’ll have to View() or Copy() it.

inlineboolis_mutable()const#

Whether the buffer is mutable.

If this function returns true, you are allowed to modify buffer contents using the pointer returned by mutable_data() or mutable_address() .

Public Static Functions

staticstd::shared_ptr<Buffer >FromString(std::stringdata)#

Construct an immutable buffer that takes ownership of the contents of an std::string (without copying it).

Parameters:

data[in] a string to own

Returns:

a new Buffer instance

template<typenameT>
staticinlinestd::shared_ptr<Buffer >FromVector(std::vector<T >vec)#

Construct an immutable buffer that takes ownership of the contents of an std::vector (without copying it).

Only vectors of TrivialType objects (integers, floating point numbers, ...) can be wrapped by this function.

Parameters:

vec[in] a vector to own

Returns:

a new Buffer instance

template<typenameT,typenameSizeType=int64_t>
staticinlinestd::shared_ptr<Buffer >Wrap(constT *data, SizeType length)#

Create buffer referencing typed memory with some length without copying.

Parameters:
  • data[in] the typed memory as C array

  • length[in] the number of values in the array

Returns:

a new shared_ptr<Buffer>

template<typenameT>
staticinlinestd::shared_ptr<Buffer >Wrap(conststd::vector<T >&data)#

Create buffer referencing std::vector with some length without copying.

Parameters:

data[in] the vector to be referenced. If this vector is changed, the buffer may become invalid

Returns:

a new shared_ptr<Buffer>

staticResult <std::shared_ptr<io::RandomAccessFile >>GetReader(std::shared_ptr<Buffer >)#

Get a RandomAccessFile for reading a buffer.

The returned file object reads from this buffer’s underlying memory.

staticResult <std::shared_ptr<io::OutputStream >>GetWriter(std::shared_ptr<Buffer >)#

Get a OutputStream for writing to a buffer.

The buffer must be mutable. The returned stream object writes into the buffer’s underlying memory (but it won’t resize it).

staticResult <std::shared_ptr<Buffer >>Copy(std::shared_ptr<Buffer >source, conststd::shared_ptr<MemoryManager >&to)#

Copy buffer.

The buffer contents will be copied into a new buffer allocated by the given MemoryManager. This function supports cross-device copies.

staticResult <std::unique_ptr<Buffer >>CopyNonOwned(constBuffer &source, conststd::shared_ptr<MemoryManager >&to)#

Copy a non-owned buffer.

This is useful for cases where the source memory area is externally managed (its lifetime not tied to the source Buffer), otherwise please use Copy().

staticResult <std::shared_ptr<Buffer >>View(std::shared_ptr<Buffer >source, conststd::shared_ptr<MemoryManager >&to)#

View buffer.

Return a Buffer that reflects this buffer, seen potentially from another device, without making an explicit copy of the contents. The underlying mechanism is typically implemented by the kernel or device driver, and may involve lazy caching of parts of the buffer contents on the destination device’s memory.

If a non-copy view is unsupported for the buffer on the given device, nullptr is returned. An error can be returned if some low-level operation fails (such as an out-of-memory condition).

staticResult <std::shared_ptr<Buffer >>ViewOrCopy(std::shared_ptr<Buffer >source, conststd::shared_ptr<MemoryManager >&to)#

View or copy buffer.

Try to view buffer contents on the given MemoryManager’s device, but fall back to copying if a no-copy view isn’t supported.

classMutableBuffer:publicarrow::Buffer #

A Buffer whose contents can be mutated.

May or may not own its data.

Subclassed by arrow::ResizableBuffer, arrow::cuda::CudaHostBuffer

Public Static Functions

template<typenameT,typenameSizeType=int64_t>
staticinlinestd::shared_ptr<Buffer >Wrap(T *data, SizeType length)#

Create buffer referencing typed memory with some length.

Parameters:
  • data[in] the typed memory as C array

  • length[in] the number of values in the array

Returns:

a new shared_ptr<Buffer>

classResizableBuffer:publicarrow::MutableBuffer #

A mutable buffer that can be resized.

Public Functions

virtualStatus Resize(constint64_tnew_size, boolshrink_to_fit)=0#

Change buffer reported size to indicated size, allocating memory if necessary.

This will ensure that the capacity of the buffer is a multiple of 64 bytes as defined in Layout.md. Consider using ZeroPadding afterwards, to conform to the Arrow layout specification.

Parameters:
  • new_size – The new size for the buffer.

  • shrink_to_fit – Whether to shrink the capacity if new size < current size

virtualStatus Reserve(constint64_tnew_capacity)=0#

Ensure that buffer has enough memory allocated to fit the indicated capacity (and meets the 64 byte padding requirement in Layout.md).

It does not change buffer’s reported size and doesn’t zero the padding.

Memory Pools#

MemoryPool *arrow::default_memory_pool()#

Return the process-wide default memory pool.

Status arrow::jemalloc_memory_pool(MemoryPool **out)#

Return a process-wide memory pool based on jemalloc.

May return NotImplemented if jemalloc is not available.

Status arrow::mimalloc_memory_pool(MemoryPool **out)#

Return a process-wide memory pool based on mimalloc.

May return NotImplemented if mimalloc is not available.

MemoryPool *arrow::system_memory_pool()#

Return a process-wide memory pool based on the system allocator.

classMemoryPool#

Base class for memory allocation on the CPU.

Besides tracking the number of allocated bytes, the allocator also should take care of the required 64-byte alignment.

Subclassed by arrow::LoggingMemoryPool, arrow::ProxyMemoryPool, arrow::stl::STLMemoryPool< Allocator >

Public Functions

inlineStatus Allocate(int64_tsize, uint8_t**out)#

Allocate a new memory region of at least size bytes.

The allocated region shall be 64-byte aligned.

virtualStatus Allocate(int64_tsize, int64_talignment, uint8_t**out)=0#

Allocate a new memory region of at least size bytes aligned to alignment.

virtualStatus Reallocate(int64_told_size, int64_tnew_size, int64_talignment, uint8_t**ptr)=0#

Resize an already allocated memory section.

As by default most default allocators on a platform don’t support aligned reallocation, this function can involve a copy of the underlying data.

virtualvoidFree(uint8_t*buffer, int64_tsize, int64_talignment)=0#

Free an allocated region.

Parameters:
  • buffer – Pointer to the start of the allocated memory region

  • size – Allocated size located at buffer. An allocator implementation may use this for tracking the amount of allocated bytes as well as for faster deallocation if supported by its backend.

  • alignment – The alignment of the allocation. Defaults to 64 bytes.

inlinevirtualvoidReleaseUnused()#

Return unused memory to the OS.

Only applies to allocators that hold onto unused memory. This will be best effort, a memory pool may not implement this feature or may be unable to fulfill the request due to fragmentation.

inlinevirtualvoidPrintStats()#

Print statistics.

Print allocation statistics on stderr. The output format is implementation-specific. Not all memory pools implement this method.

virtualint64_tbytes_allocated()const=0#

The number of bytes that were allocated and not yet free’d through this allocator.

virtualint64_tmax_memory()const#

Return peak memory allocation in this memory pool.

Returns:

Maximum bytes allocated. If not known (or not implemented), returns -1

virtualint64_ttotal_bytes_allocated()const=0#

The number of bytes that were allocated.

virtualint64_tnum_allocations()const=0#

The number of allocations or reallocations that were requested.

virtualstd::stringbackend_name()const=0#

The name of the backend used by this MemoryPool (e.g. "system" or "jemalloc").

Public Static Functions

staticstd::unique_ptr<MemoryPool >CreateDefault()#

EXPERIMENTAL. Create a new instance of the default MemoryPool.

classLoggingMemoryPool:publicarrow::MemoryPool #

Public Functions

virtualStatus Allocate(int64_tsize, int64_talignment, uint8_t**out)override#

Allocate a new memory region of at least size bytes aligned to alignment.

virtualStatus Reallocate(int64_told_size, int64_tnew_size, int64_talignment, uint8_t**ptr)override#

Resize an already allocated memory section.

As by default most default allocators on a platform don’t support aligned reallocation, this function can involve a copy of the underlying data.

virtualvoidFree(uint8_t*buffer, int64_tsize, int64_talignment)override#

Free an allocated region.

Parameters:
  • buffer – Pointer to the start of the allocated memory region

  • size – Allocated size located at buffer. An allocator implementation may use this for tracking the amount of allocated bytes as well as for faster deallocation if supported by its backend.

  • alignment – The alignment of the allocation. Defaults to 64 bytes.

virtualvoidReleaseUnused()override#

Return unused memory to the OS.

Only applies to allocators that hold onto unused memory. This will be best effort, a memory pool may not implement this feature or may be unable to fulfill the request due to fragmentation.

virtualvoidPrintStats()override#

Print statistics.

Print allocation statistics on stderr. The output format is implementation-specific. Not all memory pools implement this method.

virtualint64_tbytes_allocated()constoverride#

The number of bytes that were allocated and not yet free’d through this allocator.

virtualint64_tmax_memory()constoverride#

Return peak memory allocation in this memory pool.

Returns:

Maximum bytes allocated. If not known (or not implemented), returns -1

virtualint64_ttotal_bytes_allocated()constoverride#

The number of bytes that were allocated.

virtualint64_tnum_allocations()constoverride#

The number of allocations or reallocations that were requested.

virtualstd::stringbackend_name()constoverride#

The name of the backend used by this MemoryPool (e.g. "system" or "jemalloc").

inlineStatus Allocate(int64_tsize, uint8_t**out)#

Allocate a new memory region of at least size bytes.

The allocated region shall be 64-byte aligned.

virtualStatus Allocate(int64_tsize, int64_talignment, uint8_t**out)=0

Allocate a new memory region of at least size bytes aligned to alignment.

virtualvoidFree(uint8_t*buffer, int64_tsize, int64_talignment)=0

Free an allocated region.

Parameters:
  • buffer – Pointer to the start of the allocated memory region

  • size – Allocated size located at buffer. An allocator implementation may use this for tracking the amount of allocated bytes as well as for faster deallocation if supported by its backend.

  • alignment – The alignment of the allocation. Defaults to 64 bytes.

virtualStatus Reallocate(int64_told_size, int64_tnew_size, int64_talignment, uint8_t**ptr)=0

Resize an already allocated memory section.

As by default most default allocators on a platform don’t support aligned reallocation, this function can involve a copy of the underlying data.

classProxyMemoryPool:publicarrow::MemoryPool #

Derived class for memory allocation.

Tracks the number of bytes and maximum memory allocated through its direct calls. Actual allocation is delegated to MemoryPool class.

Public Functions

virtualStatus Allocate(int64_tsize, int64_talignment, uint8_t**out)override#

Allocate a new memory region of at least size bytes aligned to alignment.

virtualStatus Reallocate(int64_told_size, int64_tnew_size, int64_talignment, uint8_t**ptr)override#

Resize an already allocated memory section.

As by default most default allocators on a platform don’t support aligned reallocation, this function can involve a copy of the underlying data.

virtualvoidFree(uint8_t*buffer, int64_tsize, int64_talignment)override#

Free an allocated region.

Parameters:
  • buffer – Pointer to the start of the allocated memory region

  • size – Allocated size located at buffer. An allocator implementation may use this for tracking the amount of allocated bytes as well as for faster deallocation if supported by its backend.

  • alignment – The alignment of the allocation. Defaults to 64 bytes.

virtualvoidReleaseUnused()override#

Return unused memory to the OS.

Only applies to allocators that hold onto unused memory. This will be best effort, a memory pool may not implement this feature or may be unable to fulfill the request due to fragmentation.

virtualvoidPrintStats()override#

Print statistics.

Print allocation statistics on stderr. The output format is implementation-specific. Not all memory pools implement this method.

virtualint64_tbytes_allocated()constoverride#

The number of bytes that were allocated and not yet free’d through this allocator.

virtualint64_tmax_memory()constoverride#

Return peak memory allocation in this memory pool.

Returns:

Maximum bytes allocated. If not known (or not implemented), returns -1

virtualint64_ttotal_bytes_allocated()constoverride#

The number of bytes that were allocated.

virtualint64_tnum_allocations()constoverride#

The number of allocations or reallocations that were requested.

virtualstd::stringbackend_name()constoverride#

The name of the backend used by this MemoryPool (e.g. "system" or "jemalloc").

inlineStatus Allocate(int64_tsize, uint8_t**out)#

Allocate a new memory region of at least size bytes.

The allocated region shall be 64-byte aligned.

virtualStatus Allocate(int64_tsize, int64_talignment, uint8_t**out)=0

Allocate a new memory region of at least size bytes aligned to alignment.

virtualvoidFree(uint8_t*buffer, int64_tsize, int64_talignment)=0

Free an allocated region.

Parameters:
  • buffer – Pointer to the start of the allocated memory region

  • size – Allocated size located at buffer. An allocator implementation may use this for tracking the amount of allocated bytes as well as for faster deallocation if supported by its backend.

  • alignment – The alignment of the allocation. Defaults to 64 bytes.

virtualStatus Reallocate(int64_told_size, int64_tnew_size, int64_talignment, uint8_t**ptr)=0

Resize an already allocated memory section.

As by default most default allocators on a platform don’t support aligned reallocation, this function can involve a copy of the underlying data.

std::vector<std::string>arrow::SupportedMemoryBackendNames()#

Return the names of the backends supported by this Arrow build.

Allocation Functions#

These functions allocate a buffer from a particular memory pool.

Result<std::unique_ptr<Buffer>>AllocateBuffer(constint64_tsize, MemoryPool*pool=NULLPTR)#

Allocate a fixed size mutable buffer from a memory pool, zero its padding.

Parameters:
  • size[in] size of buffer to allocate

  • pool[in] a memory pool

Result<std::unique_ptr<Buffer>>AllocateBuffer(constint64_tsize, int64_talignment, MemoryPool*pool=NULLPTR)#
Result<std::unique_ptr<ResizableBuffer>>AllocateResizableBuffer(constint64_tsize, MemoryPool*pool=NULLPTR)#

Allocate a resizeable buffer from a memory pool, zero its padding.

Parameters:
  • size[in] size of buffer to allocate

  • pool[in] a memory pool

Result<std::unique_ptr<ResizableBuffer>>AllocateResizableBuffer(constint64_tsize, constint64_talignment, MemoryPool*pool=NULLPTR)#
Result<std::shared_ptr<Buffer>>AllocateBitmap(int64_tlength, MemoryPool*pool=NULLPTR)#

Allocate a bitmap buffer from a memory pool no guarantee on values is provided.

Parameters:
  • length[in] size in bits of bitmap to allocate

  • pool[in] memory pool to allocate memory from

Result<std::shared_ptr<Buffer>>AllocateEmptyBitmap(int64_tlength, MemoryPool*pool=NULLPTR)#

Allocate a zero-initialized bitmap buffer from a memory pool.

Parameters:
  • length[in] size in bits of bitmap to allocate

  • pool[in] memory pool to allocate memory from

Result<std::shared_ptr<Buffer>>AllocateEmptyBitmap(int64_tlength, int64_talignment, MemoryPool*pool=NULLPTR)#
Result<std::shared_ptr<Buffer>>ConcatenateBuffers(constBufferVector&buffers, MemoryPool*pool=NULLPTR)#

Concatenate multiple buffers into a single buffer.

Parameters:
  • buffers[in] to be concatenated

  • pool[in] memory pool to allocate the new buffer from

Slicing#

staticinlinestd::shared_ptr<Buffer>SliceBuffer(std::shared_ptr<Buffer>buffer, constint64_toffset, constint64_tlength)#

Construct a view on a buffer at the given offset and length.

This function cannot fail and does not check for errors (except in debug builds)

staticinlinestd::shared_ptr<Buffer>SliceBuffer(std::shared_ptr<Buffer>buffer, constint64_toffset)#

Construct a view on a buffer at the given offset, up to the buffer’s end.

This function cannot fail and does not check for errors (except in debug builds)

Result<std::shared_ptr<Buffer>>SliceBufferSafe(std::shared_ptr<Buffer>buffer, int64_toffset)#

Input-checking version of SliceBuffer.

An Invalid Status is returned if the requested slice falls out of bounds.

Result<std::shared_ptr<Buffer>>SliceBufferSafe(std::shared_ptr<Buffer>buffer, int64_toffset, int64_tlength)#

Input-checking version of SliceBuffer.

An Invalid Status is returned if the requested slice falls out of bounds. Note that unlike SliceBuffer, length isn’t clamped to the available buffer size.

std::shared_ptr<Buffer>SliceMutableBuffer(std::shared_ptr<Buffer>buffer, constint64_toffset, constint64_tlength)#

Like SliceBuffer, but construct a mutable buffer slice.

If the parent buffer is not mutable, behavior is undefined (it may abort in debug builds).

staticinlinestd::shared_ptr<Buffer>SliceMutableBuffer(std::shared_ptr<Buffer>buffer, constint64_toffset)#

Like SliceBuffer, but construct a mutable buffer slice.

If the parent buffer is not mutable, behavior is undefined (it may abort in debug builds).

Result<std::shared_ptr<Buffer>>SliceMutableBufferSafe(std::shared_ptr<Buffer>buffer, int64_toffset)#

Input-checking version of SliceMutableBuffer.

An Invalid Status is returned if the requested slice falls out of bounds.

Result<std::shared_ptr<Buffer>>SliceMutableBufferSafe(std::shared_ptr<Buffer>buffer, int64_toffset, int64_tlength)#

Input-checking version of SliceMutableBuffer.

An Invalid Status is returned if the requested slice falls out of bounds. Note that unlike SliceBuffer, length isn’t clamped to the available buffer size.

Buffer Builders#

classBufferBuilder#

A class for incrementally building a contiguous chunk of in-memory data.

Public Functions

inlineexplicitBufferBuilder(std::shared_ptr<ResizableBuffer >buffer, MemoryPool *pool=default_memory_pool (), int64_talignment=kDefaultBufferAlignment)#

Constructs new Builder that will start using the provided buffer until Finish/Reset are called.

The buffer is not resized.

inlineStatus Resize(constint64_tnew_capacity, boolshrink_to_fit=true)#

Resize the buffer to the nearest multiple of 64 bytes.

Parameters:
  • new_capacity – the new capacity of the of the builder. Will be rounded up to a multiple of 64 bytes for padding

  • shrink_to_fit – if new capacity is smaller than the existing, reallocate internal buffer. Set to false to avoid reallocations when shrinking the builder.

Returns:

Status

inlineStatus Reserve(constint64_tadditional_bytes)#

Ensure that builder can accommodate the additional number of bytes without the need to perform allocations.

Parameters:

additional_bytes[in] number of additional bytes to make space for

Returns:

Status

inlineStatus Append(constvoid*data, constint64_tlength)#

Append the given data to the buffer.

The buffer is automatically expanded if necessary.

inlineStatus Append(std::string_viewv)#

Append the given data to the buffer.

The buffer is automatically expanded if necessary.

inlineStatus Append(constint64_tnum_copies, uint8_tvalue)#

Append copies of a value to the buffer.

The buffer is automatically expanded if necessary.

inlineStatus Finish(std::shared_ptr<Buffer >*out, boolshrink_to_fit=true)#

Return result of builder as a Buffer object.

The builder is reset and can be reused afterwards.

Parameters:
  • out[out] the finalized Buffer object

  • shrink_to_fit – if the buffer size is smaller than its capacity, reallocate to fit more tightly in memory. Set to false to avoid a reallocation, at the expense of potentially more memory consumption.

Returns:

Status

inlineResult <std::shared_ptr<Buffer >>FinishWithLength(int64_tfinal_length, boolshrink_to_fit=true)#

Like Finish, but override the final buffer size.

This is useful after writing data directly into the builder memory without calling the Append methods (basically, when using BufferBuilder mostly for memory allocation).

inlinevoidRewind(int64_tposition)#

Set size to a smaller value without modifying builder contents.

For reusable BufferBuilder classes

Parameters:

position[in] must be non-negative and less than or equal to the current length()

Public Static Functions

staticinlineint64_tGrowByFactor(int64_tcurrent_capacity, int64_tnew_capacity)#

Return a capacity expanded by the desired growth factor.

template<typenameT,typenameEnable=void>
classTypedBufferBuilder#

STL Integration#

template<classT>
classallocator#

A STL allocator delegating allocations to a Arrow MemoryPool.

Public Functions

inlineallocator()noexcept#

Construct an allocator from the default MemoryPool.

inlineexplicitallocator(MemoryPool *pool)noexcept#

Construct an allocator from the given MemoryPool.

template<classU>
structrebind#
template<typenameAllocator=std::allocator<uint8_t>>
classSTLMemoryPool:publicarrow::MemoryPool #

A MemoryPool implementation delegating allocations to a STL allocator.

Note that STL allocators don’t provide a resizing operation, and therefore any buffer resizes will do a full reallocation and copy.

Public Functions

inlineexplicitSTLMemoryPool(constAllocator &alloc)#

Construct a memory pool from the given allocator.

inlinevirtualStatus Allocate(int64_tsize, int64_t, uint8_t**out)override#

Allocate a new memory region of at least size bytes aligned to alignment.

inlinevirtualStatus Reallocate(int64_told_size, int64_tnew_size, int64_t, uint8_t**ptr)override#

Resize an already allocated memory section.

As by default most default allocators on a platform don’t support aligned reallocation, this function can involve a copy of the underlying data.

inlinevirtualvoidFree(uint8_t*buffer, int64_tsize, int64_t)override#

Free an allocated region.

Parameters:
  • buffer – Pointer to the start of the allocated memory region

  • size – Allocated size located at buffer. An allocator implementation may use this for tracking the amount of allocated bytes as well as for faster deallocation if supported by its backend.

  • alignment – The alignment of the allocation. Defaults to 64 bytes.

inlinevirtualint64_tbytes_allocated()constoverride#

The number of bytes that were allocated and not yet free’d through this allocator.

inlinevirtualint64_tmax_memory()constoverride#

Return peak memory allocation in this memory pool.

Returns:

Maximum bytes allocated. If not known (or not implemented), returns -1

inlinevirtualint64_ttotal_bytes_allocated()constoverride#

The number of bytes that were allocated.

inlinevirtualint64_tnum_allocations()constoverride#

The number of allocations or reallocations that were requested.

inlinevirtualstd::stringbackend_name()constoverride#

The name of the backend used by this MemoryPool (e.g. "system" or "jemalloc").

inlineStatus Allocate(int64_tsize, uint8_t**out)#

Allocate a new memory region of at least size bytes.

The allocated region shall be 64-byte aligned.

virtualStatus Allocate(int64_tsize, int64_talignment, uint8_t**out)=0

Allocate a new memory region of at least size bytes aligned to alignment.

virtualvoidFree(uint8_t*buffer, int64_tsize, int64_talignment)=0

Free an allocated region.

Parameters:
  • buffer – Pointer to the start of the allocated memory region

  • size – Allocated size located at buffer. An allocator implementation may use this for tracking the amount of allocated bytes as well as for faster deallocation if supported by its backend.

  • alignment – The alignment of the allocation. Defaults to 64 bytes.

virtualStatus Reallocate(int64_told_size, int64_tnew_size, int64_talignment, uint8_t**ptr)=0

Resize an already allocated memory section.

As by default most default allocators on a platform don’t support aligned reallocation, this function can involve a copy of the underlying data.