Composite Column Types¶
Sets of columns can be associated with a single user-defined datatype, which in modern use is normally a Python dataclass. The ORM provides a single attribute which represents the group of columns using the class you provide.
A simple example represents pairs of Integer
columns as a
Point
object, with attributes .x
and .y
. Using a
dataclass, these attributes are defined with the corresponding int
Python type:
importdataclasses @dataclasses.dataclass classPoint: x: int y: int
Non-dataclass forms are also accepted, but require additional methods to be implemented. For an example using a non-dataclass class, see the section Using Legacy Non-Dataclasses.
Added in version 2.0: The composite()
construct fully supports
Python dataclasses including the ability to derive mapped column datatypes
from the composite class.
We will create a mapping to a table vertices
, which represents two points
as x1/y1
and x2/y2
. The Point
class is associated with
the mapped columns using the composite()
construct.
The example below illustrates the most modern form of composite()
as
used with a fully
Annotated Declarative Table
configuration. mapped_column()
constructs representing each column
are passed directly to composite()
, indicating zero or more aspects
of the columns to be generated, in this case the names; the
composite()
construct derives the column types (in this case
int
, corresponding to Integer
) from the dataclass directly:
fromsqlalchemy.ormimport DeclarativeBase, Mapped fromsqlalchemy.ormimport composite, mapped_column classBase(DeclarativeBase): pass classVertex(Base): __tablename__ = "vertices" id: Mapped[int] = mapped_column(primary_key=True) start: Mapped[Point] = composite(mapped_column("x1"), mapped_column("y1")) end: Mapped[Point] = composite(mapped_column("x2"), mapped_column("y2")) def__repr__(self): return f"Vertex(start={self.start}, end={self.end})"
Tip
In the example above the columns that represent the composites
(x1
, y1
, etc.) are also accessible on the class but are not
correctly understood by type checkers.
If accessing the single columns is important they can be explicitly declared,
as shown in Map columns directly, pass attribute names to composite.
The above mapping would correspond to a CREATE TABLE statement as:
>>> fromsqlalchemy.schemaimport CreateTable >>> print(CreateTable(Vertex.__table__))CREATETABLEvertices( idINTEGERNOTNULL, x1INTEGERNOTNULL, y1INTEGERNOTNULL, x2INTEGERNOTNULL, y2INTEGERNOTNULL, PRIMARYKEY(id) )
Working with Mapped Composite Column Types¶
With a mapping as illustrated in the top section, we can work with the
Vertex
class, where the .start
and .end
attributes will
transparently refer to the columns referenced by the Point
class, as
well as with instances of the Vertex
class, where the .start
and
.end
attributes will refer to instances of the Point
class. The x1
,
y1
, x2
, and y2
columns are handled transparently:
Persisting Point objects
We can create a
Vertex
object, assignPoint
objects as members, and they will be persisted as expected:>>> v = Vertex(start=Point(3, 4), end=Point(5, 6)) >>> session.add(v) >>> session.commit()
BEGIN(implicit) INSERTINTOvertices(x1,y1,x2,y2)VALUES(?,?,?,?) [generatedin...](3,4,5,6) COMMITSelecting Point objects as columns
composite()
will allow theVertex.start
andVertex.end
attributes to behave like a single SQL expression to as much an extent as possible when using the ORMSession
(including the legacyQuery
object) to selectPoint
objects:>>> stmt = select(Vertex.start, Vertex.end) >>> session.execute(stmt).all()
SELECTvertices.x1,vertices.y1,vertices.x2,vertices.y2 FROMvertices [...]()[(Point(x=3, y=4), Point(x=5, y=6))]Comparing Point objects in SQL expressions
The
Vertex.start
andVertex.end
attributes may be used in WHERE criteria and similar, using ad-hocPoint
objects for comparisons:>>> stmt = select(Vertex).where(Vertex.start == Point(3, 4)).where(Vertex.end < Point(7, 8)) >>> session.scalars(stmt).all()
SELECTvertices.id,vertices.x1,vertices.y1,vertices.x2,vertices.y2 FROMvertices WHEREvertices.x1=?ANDvertices.y1=?ANDvertices.x2<?ANDvertices.y2<? [...](3,4,7,8)[Vertex(Point(x=3, y=4), Point(x=5, y=6))]Added in version 2.0:
composite()
constructs now support "ordering" comparisons such as<
,>=
, and similar, in addition to the already-present support for==
,!=
.Tip
The "ordering" comparison above using the "less than" operator (
<
) as well as the "equality" comparison using==
, when used to generate SQL expressions, are implemented by theComparator
class, and don’t make use of the comparison methods on the composite class itself, e.g. the__lt__()
or__eq__()
methods. From this it follows that thePoint
dataclass above also need not implement the dataclassesorder=True
parameter for the above SQL operations to work. The section Redefining Comparison Operations for Composites contains background on how to customize the comparison operations.Updating Point objects on Vertex Instances
By default, the
Point
object must be replaced by a new object for changes to be detected:>>> v1 = session.scalars(select(Vertex)).one()
SELECTvertices.id,vertices.x1,vertices.y1,vertices.x2,vertices.y2 FROMvertices [...]()>>> v1.end = Point(x=10, y=14) >>> session.commit()UPDATEverticesSETx2=?,y2=?WHEREvertices.id=? [...](10,14,1) COMMITIn order to allow in place changes on the composite object, the Mutation Tracking extension must be used. See the section Establishing Mutability on Composites for examples.
Returning None for a Composite¶
The composite attribute by default always returns an object when accessed,
regardless of the values of its columns. In the example below, a new
Vertex
is created with no parameters; all column attributes x1
, y1
,
x2
, and y2
start out as None
. A Point
object with None
values will be returned on access:
>>> v1 = Vertex() >>> v1.start Point(x=None, y=None) >>> v1.end Point(x=None, y=None)
This behavior is consistent with persistent objects and individual attribute queries as well:
>>> start = session.scalars( ... select(Point.start).where(Point.x1 == None, Point.y1 == None) ... ).first() >>> start Point(x=None, y=None)
To support an optional Point
field, we can make use
of the composite.return_none_on
parameter, which allows
the behavior to be customized with a lambda; this parameter is set automatically if we
declare our composite fields as optional:
classVertex(Base): __tablename__ = "vertices" id: Mapped[int] = mapped_column(primary_key=True) start: Mapped[Point | None] = composite(mapped_column("x1"), mapped_column("y1")) end: Mapped[Point | None] = composite(mapped_column("x2"), mapped_column("y2"))
Above, the composite.return_none_on
parameter is set equivalently as:
composite( mapped_column("x1"), mapped_column("y1"), return_none_on=lambda *args: all(arg is None for arg in args), )
With the above setting, a value of None
is returned if the columns themselves
are both None
:
>>> v1 = Vertex() >>> v1.start None >>> start = session.scalars( ... select(Point.start).where(Point.x1 == None, Point.y1 == None) ... ).first() >>> start None
Changed in version 2.1: - added the composite.return_none_on
parameter with
ORM Annotated Declarative support.
Other mapping forms for composites¶
The composite()
construct may be passed the relevant columns
using a mapped_column()
construct, a Column
,
or the string name of an existing mapped column. The following examples
illustrate an equivalent mapping as that of the main section above.
Map columns directly, then pass to composite¶
Here we pass the existing mapped_column()
instances to the
composite()
construct, as in the non-annotated example below
where we also pass the Point
class as the first argument to
composite()
:
fromsqlalchemyimport Integer fromsqlalchemy.ormimport mapped_column, composite classVertex(Base): __tablename__ = "vertices" id = mapped_column(Integer, primary_key=True) x1 = mapped_column(Integer) y1 = mapped_column(Integer) x2 = mapped_column(Integer) y2 = mapped_column(Integer) start = composite(Point, x1, y1) end = composite(Point, x2, y2)
Map columns directly, pass attribute names to composite¶
We can write the same example above using more annotated forms where we have
the option to pass attribute names to composite()
instead of
full column constructs:
fromsqlalchemy.ormimport mapped_column, composite, Mapped classVertex(Base): __tablename__ = "vertices" id: Mapped[int] = mapped_column(primary_key=True) x1: Mapped[int] y1: Mapped[int] x2: Mapped[int] y2: Mapped[int] start: Mapped[Point] = composite("x1", "y1") end: Mapped[Point] = composite("x2", "y2")
Imperative mapping and imperative table¶
When using imperative table or
fully imperative mappings, we have access
to Column
objects directly. These may be passed to
composite()
as well, as in the imperative example below:
mapper_registry.map_imperatively( Vertex, vertices_table, properties={ "start": composite(Point, vertices_table.c.x1, vertices_table.c.y1), "end": composite(Point, vertices_table.c.x2, vertices_table.c.y2), }, )
Using Legacy Non-Dataclasses¶
If not using a dataclass, the requirements for the custom datatype class are
that it have a constructor
which accepts positional arguments corresponding to its column format, and
also provides a method __composite_values__()
which returns the state of
the object as a list or tuple, in order of its column-based attributes. It
also should supply adequate __eq__()
and __ne__()
methods which test
the equality of two instances.
To illustrate the equivalent Point
class from the main section
not using a dataclass:
classPoint: def__init__(self, x, y): self.x = x self.y = y def__composite_values__(self): return self.x, self.y def__repr__(self): return f"Point(x={self.x!r}, y={self.y!r})" def__eq__(self, other): return isinstance(other, Point) and other.x == self.x and other.y == self.y def__ne__(self, other): return not self.__eq__(other)
Usage with composite()
then proceeds where the columns to be
associated with the Point
class must also be declared with explicit
types, using one of the forms at Other mapping forms for composites.
Tracking In-Place Mutations on Composites¶
In-place changes to an existing composite value are
not tracked automatically. Instead, the composite class needs to provide
events to its parent object explicitly. This task is largely automated
via the usage of the MutableComposite
mixin, which uses events
to associate each user-defined composite object with all parent associations.
Please see the example in Establishing Mutability on Composites.
Redefining Comparison Operations for Composites¶
The "equals" comparison operation by default produces an AND of all
corresponding columns equated to one another. This can be changed using
the comparator_factory
argument to composite()
, where we
specify a custom Comparator
class
to define existing or new operations.
Below we illustrate the "greater than" operator, implementing
the same expression that the base "greater than" does:
importdataclasses fromsqlalchemy.ormimport composite fromsqlalchemy.ormimport CompositeProperty fromsqlalchemy.ormimport DeclarativeBase fromsqlalchemy.ormimport Mapped fromsqlalchemy.ormimport mapped_column fromsqlalchemy.sqlimport and_ @dataclasses.dataclass classPoint: x: int y: int classPointComparator(CompositeProperty.Comparator): def__gt__(self, other): """redefine the 'greater than' operation""" return and_( *[ a > b for a, b in zip( self.__clause_element__().clauses, dataclasses.astuple(other), ) ] ) classBase(DeclarativeBase): pass classVertex(Base): __tablename__ = "vertices" id: Mapped[int] = mapped_column(primary_key=True) start: Mapped[Point] = composite( mapped_column("x1"), mapped_column("y1"), comparator_factory=PointComparator ) end: Mapped[Point] = composite( mapped_column("x2"), mapped_column("y2"), comparator_factory=PointComparator )
Since Point
is a dataclass, we may make use of
dataclasses.astuple()
to get a tuple form of Point
instances.
The custom comparator then returns the appropriate SQL expression:
>>> print(Vertex.start > Point(5, 6))vertices.x1>:x1_1ANDvertices.y1>:y1_1
Nesting Composites¶
Composite objects can be defined to work in simple nested schemes, by redefining behaviors within the composite class to work as desired, then mapping the composite class to the full length of individual columns normally. This requires that additional methods to move between the "nested" and "flat" forms are defined.
Below we reorganize the Vertex
class to itself be a composite object which
refers to Point
objects. Vertex
and Point
can be dataclasses,
however we will add a custom construction method to Vertex
that can be used
to create new Vertex
objects given four column values, which will will
arbitrarily name _generate()
and define as a classmethod so that we can
make new Vertex
objects by passing values to the Vertex._generate()
method.
We will also implement the __composite_values__()
method, which is a fixed
name recognized by the composite()
construct (introduced previously
at Using Legacy Non-Dataclasses) that indicates a standard way of
receiving the object as a flat tuple of column values, which in this case will
supersede the usual dataclass-oriented methodology.
With our custom _generate()
constructor and
__composite_values__()
serializer method, we can now move between
a flat tuple of columns and Vertex
objects that contain Point
instances. The Vertex._generate
method is passed as the
first argument to the composite()
construct as the source of new
Vertex
instances, and the __composite_values__()
method will be
used implicitly by composite()
.
For the purposes of the example, the Vertex
composite is then mapped to a
class called HasVertex
, which is where the Table
containing the
four source columns ultimately resides:
from__future__import annotations importdataclasses fromtypingimport Any fromtypingimport Tuple fromsqlalchemy.ormimport composite fromsqlalchemy.ormimport DeclarativeBase fromsqlalchemy.ormimport Mapped fromsqlalchemy.ormimport mapped_column @dataclasses.dataclass classPoint: x: int y: int @dataclasses.dataclass classVertex: start: Point end: Point @classmethod def_generate(cls, x1: int, y1: int, x2: int, y2: int) -> Vertex: """generate a Vertex from a row""" return Vertex(Point(x1, y1), Point(x2, y2)) def__composite_values__(self) -> Tuple[Any, ...]: """generate a row from a Vertex""" return dataclasses.astuple(self.start) + dataclasses.astuple(self.end) classBase(DeclarativeBase): pass classHasVertex(Base): __tablename__ = "has_vertex" id: Mapped[int] = mapped_column(primary_key=True) x1: Mapped[int] y1: Mapped[int] x2: Mapped[int] y2: Mapped[int] vertex: Mapped[Vertex] = composite(Vertex._generate, "x1", "y1", "x2", "y2")
The above mapping can then be used in terms of HasVertex
, Vertex
, and
Point
:
hv = HasVertex(vertex=Vertex(Point(1, 2), Point(3, 4))) session.add(hv) session.commit() stmt = select(HasVertex).where(HasVertex.vertex == Vertex(Point(1, 2), Point(3, 4))) hv = session.scalars(stmt).first() print(hv.vertex.start) print(hv.vertex.end)
Composite API¶
Object Name | Description |
---|---|
composite([_class_or_attr], *attrs, [group, deferred, raiseload, return_none_on, comparator_factory, active_history, init, repr, default, default_factory, compare, kw_only, hash, info, doc, dataclass_metadata], **__kw) |
Return a composite column-based property for use with a Mapper. |
- function sqlalchemy.orm.composite(_class_or_attr:None|Type[_CC]|Callable[...,_CC]|_CompositeAttrType[Any]=None, /, *attrs:_CompositeAttrType[Any], group:str|None=None, deferred:bool=False, raiseload:bool=False, return_none_on:_NoArg|None|Callable[...,bool]=_NoArg.NO_ARG, comparator_factory:Type[Composite.Comparator[_T]]|None=None, active_history:bool=False, init:_NoArg|bool=_NoArg.NO_ARG, repr:_NoArg|bool=_NoArg.NO_ARG, default:Any|None=_NoArg.NO_ARG, default_factory:_NoArg|Callable[[],_T]=_NoArg.NO_ARG, compare:_NoArg|bool=_NoArg.NO_ARG, kw_only:_NoArg|bool=_NoArg.NO_ARG, hash:_NoArg|bool|None=_NoArg.NO_ARG, info:_InfoType|None=None, doc:str|None=None, dataclass_metadata:_NoArg|Mapping[Any,Any]|None=_NoArg.NO_ARG, **__kw:Any) → Composite [Any]¶
Return a composite column-based property for use with a Mapper.
See the mapping documentation section Composite Column Types for a full usage example.
The
MapperProperty
returned bycomposite()
is theComposite
.- Parameters:
class_¶ – The "composite type" class, or any classmethod or callable which will produce a new instance of the composite object given the column values in order.
*attrs¶ –
List of elements to be mapped, which may include:
Column
objectsmapped_column()
constructsstring names of other attributes on the mapped class, which may be any other SQL or object-mapped attribute. This can for example allow a composite that refers to a many-to-one relationship
active_history=False¶ – When
True
, indicates that the "previous" value for a scalar attribute should be loaded when replaced, if not already loaded. See the same flag oncolumn_property()
.return_none_on=None¶ –
A callable that will be evaluated when the composite object is to be constructed, which upon returning the boolean value
True
will instead bypass the construction and cause the resulting value to be None. This typically may be assigned a lambda that will evaluate to True when all the columns within the composite are themselves None, e.g.:composite( MyComposite, return_none_on=lambda *cols: all(x is None for x in cols) )
The above lambda for
composite.return_none_on
is used automatically when using ORM Annotated Declarative along with an optional value within theMapped
annotation.Added in version 2.1.
group¶ – A group name for this property when marked as deferred.
deferred¶ – When True, the column property is "deferred", meaning that it does not load immediately, and is instead loaded when the attribute is first accessed on an instance. See also
deferred()
.comparator_factory¶ – a class which extends
Comparator
which provides custom SQL clause generation for comparison operations.doc¶ – optional string that will be applied as the doc on the class-bound descriptor.
info¶ – Optional data dictionary which will be populated into the
MapperProperty.info
attribute of this object.init¶ – Specific to Declarative Dataclass Mapping, specifies if the mapped attribute should be part of the
__init__()
method as generated by the dataclass process.repr¶ – Specific to Declarative Dataclass Mapping, specifies if the mapped attribute should be part of the
__repr__()
method as generated by the dataclass process.default_factory¶ – Specific to Declarative Dataclass Mapping, specifies a default-value generation function that will take place as part of the
__init__()
method as generated by the dataclass process.compare¶ –
Specific to Declarative Dataclass Mapping, indicates if this field should be included in comparison operations when generating the
__eq__()
and__ne__()
methods for the mapped class.Added in version 2.0.0b4.
kw_only¶ – Specific to Declarative Dataclass Mapping, indicates if this field should be marked as keyword-only when generating the
__init__()
.hash¶ –
Specific to Declarative Dataclass Mapping, controls if this field is included when generating the
__hash__()
method for the mapped class.Added in version 2.0.36.
dataclass_metadata¶ –
Specific to Declarative Dataclass Mapping, supplies metadata to be attached to the generated dataclass field.
Added in version 2.0.42.