TEI: Guidelines for Electronic Text Encoding and Interchange

P5 Version 4.10.2. Last updated on 4th September 2025, revision bcfa98f42

19 Feature Structures

Table of contents

A feature structure is a general purpose data structure which identifies and groups together individual features, each of which associates a name with one or more values. Because of the generality of feature structures, they can be used to represent many different kinds of information, but they are of particular usefulness in the representation of linguistic analyses, especially where such analyses are partial, or underspecified. Feature structures represent the interrelations among various pieces of information, and their instantiation in markup provides a metalanguage for the generic representation of analyses and interpretations. Moreover, this instantiation allows feature values to be of specific types, and for restrictions to be placed on the values for particular features, by means of feature system declarations.⁸⁴

TEI: Organization of this Chapter⚓︎ 19.1 Organization of this Chapter

This chapter is organized as follows. Following this introduction, section 19.2 Elementary Feature Structures and the Binary Feature Value introduces the elements fs and f, used to represent feature structures and features respectively, together with the elementary binary feature value. Section 19.3 Other Atomic Feature Values introduces elements for representing other kinds of atomic feature values such as symbolic, numeric, and string values. Section 19.4 Feature Libraries and Feature-Value Libraries introduces the notion of predefined libraries or groups of features or feature values along with methods for referencing their components. Section 19.5 Feature Structures as Complex Feature Values introduces complex values, in particular feature-structures as values, thus enabling feature structures to be recursively defined. Section 19.7 Collections as Complex Feature Values discusses other complex values, in particular values which are collections, organized as sets, bags, and lists. Section 19.8 Feature Value Expressions discusses how the operations of alternation, negation, and collection of feature values may be represented. Section 19.9 Default Values discusses ways of representing underspecified, default, or uncertain values. Section 19.10 Linking Text and Analysis discusses how analyses may be linked to other parts of an encoded text. Section 19.11 Feature System Declaration describes the feature system declaration, a construct which provides for the validation of typed feature structures. Formal definitions for all the elements introduced in this chapter are provided in section 19.12 Formal Definition and Implementation.

TEI: Elementary Feature Structures and the Binary Feature Value⚓︎ 19.2 Elementary Feature Structures and the Binary Feature Value

The fundamental elements used to represent a feature structure analysis are f (for feature), which represents a feature-value pair, and fs (for feature structure), which represents a structure made up of such feature-value pairs. The fs element has an optional type attribute which may be used to represent typed feature structures, and may contain any number of f elements. An f element has a required name attribute and an associated value. The value may be simple: that is, a single binary, numeric, symbolic (i.e. taken from a restricted set of legal values), or string value, or a collection of such values, organized in various ways, for example, as a list; or it may be complex, that is, it may itself be a feature structure, thus providing a degree of recursion. Values may be under-specified or defaulted in various ways. These possibilities are all described in more detail in this and the following sections.

Feature and feature-value representations (including feature structure representations) may be embedded directly at any point in an XML document, or they may be collected together in special-purpose feature or feature-value libraries. The components of such libraries may then be referenced from other feature or feature-value representations, using the feats or fVal attribute as appropriate.

We begin by considering the simple case of a feature structure which contains binary-valued features only. The following three XML elements are needed to represent such a feature structure:

fs (feature structure) represents a feature structure, that is, a collection of feature-value pairs organized as a structural unit.
type specifies the type of the feature structure.
feats (features) references the feature-value specifications making up this feature structure.
f (feature) represents a feature value specification, that is, the association of a name with a value of any of several different types.
name a single word which follows the rules defining a legal XML name (see https://www.w3.org/TR/REC-xml/#dt-name), providing a name for the feature.
fVal (feature value) references any element which can be used to represent the value of a feature.
binary (binary value) represents the value part of a feature-value specification which can contain either of exactly two possible values.

The attributes feats and the fVal are not discussed in this section: they provide an alternative way of indicating the content of an element, as further discussed in section 19.4 Feature Libraries and Feature-Value Libraries.

An fs element containing f elements with binary values can be straightforwardly used to encode the matrices of feature-value specifications for phonetic segments, such as the following for the English segment [s].

+--- ---+ | consonantal + | | vocalic - | | voiced - | | anterior + | | coronal + | | continuant + | | strident + | +--- ---+⚓

This representation may be encoded in XML as follows:

TEI: Guidelines for Electronic Text Encoding and Interchange

19 Feature Structures

TEI: Organization of this Chapter⚓︎ 19.1 Organization of this Chapter

TEI: Elementary Feature Structures and the Binary Feature Value⚓︎ 19.2 Elementary Feature Structures and the Binary Feature Value

TEI: Other Atomic Feature Values⚓︎ 19.3 Other Atomic Feature Values

TEI: Feature Libraries and Feature-Value Libraries⚓︎ 19.4 Feature Libraries and Feature-Value Libraries

TEI: Feature Structures as Complex Feature Values⚓︎ 19.5 Feature Structures as Complex Feature Values

TEI: Re-entrant Feature Structures⚓︎ 19.6 Re-entrant Feature Structures

TEI: Collections as Complex Feature Values⚓︎ 19.7 Collections as Complex Feature Values

TEI: Feature Value Expressions⚓︎ 19.8 Feature Value Expressions

TEI: Alternation⚓︎ 19.8.1 Alternation

TEI: Negation⚓︎ 19.8.2 Negation

TEI: Collection of Values⚓︎ 19.8.3 Collection of Values

TEI: Default Values⚓︎ 19.9 Default Values

TEI: Linking Text and Analysis⚓︎ 19.10 Linking Text and Analysis

TEI: Feature System Declaration⚓︎ 19.11 Feature System Declaration

TEI: Linking a TEI Text to Feature System Declarations⚓︎ 19.11.1 Linking a TEI Text to Feature System Declarations

TEI: The Overall Structure of a Feature System Declaration⚓︎ 19.11.2 The Overall Structure of a Feature System Declaration

TEI: Feature Declarations⚓︎ 19.11.3 Feature Declarations

TEI: Feature Structure Constraints⚓︎ 19.11.4 Feature Structure Constraints

TEI: A Complete Example⚓︎ 19.11.5 A Complete Example

TEI: Formal Definition and Implementation⚓︎ 19.12 Formal Definition and Implementation