Wolfram Language & System Documentation Center

LearnDistribution

LearnDistribution [{example₁,example₂,…}]

generates a LearnedDistribution […] that attempts to represent an underlying distribution for the examples given.

Details and Options

Details and Options Details and Options

Examples

Basic Examples

Scope

Options

FeatureTypes

Method

TimeGoal

TrainingProgressReporting

Applications

LearnDistribution

LearnDistribution [{example₁,example₂,…}]

generates a LearnedDistribution […] that attempts to represent an underlying distribution for the examples given.

Details and Options

LearnDistribution can be used on many types of data, including numerical, nominal and images.
Each example_i can be a single data element, a list of data elements or an association of data elements. Examples can also be given as a Dataset or a Tabular object.
LearnDistribution effectively assumes that each of the example_i is independently drawn from an underlying distribution, which LearnDistribution attempts to infer.
LearnDistribution [examples] yields a LearnedDistribution […] on which the following functions can be used:
PDF [dist,…] probability or probability density for data

RandomVariate [dist] random samples generated from the distribution

SynthesizeMissingValues [dist,…] fill in missing values according to the distribution

RarerProbability [dist,…] compute the probability to generate a sample with lower PDF than a given example
The following options can be given:
FeatureExtractor Identity how to extract features from which to learn

FeatureNames Automatic feature names to assign for input data

FeatureTypes Automatic feature types to assume for input data

Method Automatic which modeling algorithm to use

PerformanceGoal Automatic aspects of performance to try to optimize

RandomSeeding 1234 what seeding of pseudorandom generators should be done internally

TimeGoal Automatic how long to spend training the classifier

TrainingProgressReporting Automatic how to report progress during training

ValidationSet Automatic the set of data on which to evaluate the model during training
Possible settings for PerformanceGoal include:
"DirectTraining" train directly on the full dataset, without model searching

"Memory" minimize storage requirements of the distribution

"Quality" maximize the modeling quality of the distribution

"Speed" maximize speed for PDF queries

"SamplingSpeed" maximize speed for generating random samples

"TrainingSpeed" minimize time spent producing the distribution

Automatic automatic tradeoff among speed, quality and memory

{goal₁,goal₂,…} automatically combine goal₁, goal₂, etc.
Possible settings for Method include:
"ContingencyTable" discretize data and store each possible probability

"DecisionTree" use a decision tree to compute probabilities

"GaussianMixture" use a mixture of Gaussian (normal) distributions

"KernelDensityEstimation" use a kernel mixture distribution

"Multinormal" use a multivariate normal (Gaussian) distribution
The following settings for TrainingProgressReporting can be used:
"Panel" show a dynamically updating graphical panel

"Print" periodically report information using Print

"ProgressIndicator" show a simple ProgressIndicator

"SimplePanel" dynamically updating panel without learning curves

None do not report any information
Possible settings for RandomSeeding include:
Automatic automatically reseed every time the function is called

Inherited use externally seeded random numbers

seed use an explicit integer or strings as a seed
Only reversible feature extractors can be given in the option FeatureExtractor .
LearnDistribution […,FeatureExtractor "Minimal"] indicates that the internal preprocessing should be as simple as possible.
All images are first conformed using ConformImages .
Information [LearnedDistribution […]] generates an information panel about the distribution and its estimated performances.

Examples

open all close all

Basic Examples (3)

Train a distribution on a numeric dataset:

Generate a new example based on the learned distribution:

Compute the PDF of a new example:

Train a distribution on a nominal dataset:

Generate a new example based on the learned distribution:

Compute the probability of the examples "A" and "B":

Train a distribution on a two-dimensional dataset:

Generate a new example based on the learned distribution:

Compute the probability of two examples:

Impute the missing value of an example:

Scope (3)

Train a distribution on a dataset containing numeric and nominal variables:

Generate a new example based on the learned distribution:

Impute the missing value of an example:

Train a distribution on colors:

Generate 100 new examples based on the learned distribution:

Compute the probability density of some colors:

Train a distribution on dates:

Generate 10 new examples based on the learned distribution:

Compute the probability density of some new dates:

Options (6)

FeatureTypes (1)

Specify that the data is nominal:

Without specification, the data is considered numerical:

Method (2)

Train a "Multinormal" distribution on a numeric dataset:

Plot the PDF along with the training data:

Train a distribution on a two-dimensional dataset with all available methods ("Multinormal" , "ContingencyTable" , "KernelDensityEstimation" , "DecisionTree" and "GaussianMixture" ):

Visualize the probability density of these distributions:

TimeGoal (2)

Learn a distribution while specifying a total training time of 5 seconds:

Load 1000 images of the "MNIST" dataset:

Learn its distribution while specifying a target training time of 3 seconds:

The loss value obtained (cross-entropy) is about -0.43:

Learn its distribution while specifying a target training time of 30 seconds:

The loss value obtained (cross-entropy) is about -0.978:

Compare the learning curves for both trainings:

TrainingProgressReporting (1)

Load the "UCILetter" dataset:

Show training progress interactively during training:

Show training progress interactively without plots:

Print training progress periodically during training:

Show a simple progress indicator:

Do not report progress:

Applications (4)

Obtain a dataset of images:

Train a distribution on the images:

Generate 50 new examples based on the learned distribution:

Compare the probability density for an image of the training set, an image of a test set, a sample from the learned distribution, an image of another dataset and a random image:

Obtain the probability to generate a sample with a lower PDF for each of these images:

Load a sample dataset:

Train a distribution directly from the Tabular object:

Generate a random sample:

Generate several random samples:

Visualize random samples of the variables "PetalLength" and "SepalLength" from the distribution and compare them with the dataset:

Load the Titanic survival dataset:

Train a distribution on the dataset:

Use the distribution and SynthesizeMissingValues to generate complete examples from incomplete ones:

Use the distribution to predict the survival probability of a given passenger:

Train a distribution on a two-dimensional dataset:

Plot the PDF along with the training data:

Use SynthesizeMissingValues to impute missing values using the learned distribution:

Obtain the histogram of possible imputed values:

Related Guides

▪

Unsupervised Machine Learning

▪

Scientific Data Analysis

▪

Machine Learning Methods

▪

Nonparametric Statistical Distributions

History

Introduced in 2019 (12.0) | Updated in 2025 (14.2)

Wolfram Research (2019), LearnDistribution, Wolfram Language function, https://reference.wolfram.com/language/ref/LearnDistribution.html (updated 2025).

Text

Wolfram Research (2019), LearnDistribution, Wolfram Language function, https://reference.wolfram.com/language/ref/LearnDistribution.html (updated 2025).

CMS

Wolfram Language. 2019. "LearnDistribution." Wolfram Language & System Documentation Center. Wolfram Research. Last Modified 2025. https://reference.wolfram.com/language/ref/LearnDistribution.html.

APA

Wolfram Language. (2019). LearnDistribution. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/LearnDistribution.html

BibTeX

@misc{reference.wolfram_2025_learndistribution, author="Wolfram Research", title="{LearnDistribution}", year="2025", howpublished="\url{https://reference.wolfram.com/language/ref/LearnDistribution.html}", note=[Accessed: 17-November-2025]}

BibLaTeX

@online{reference.wolfram_2025_learndistribution, organization={Wolfram Research}, title={LearnDistribution}, year={2025}, url={https://reference.wolfram.com/language/ref/LearnDistribution.html}, note=[Accessed: 17-November-2025]}

Top [フレーム]

More Learning

Tech Support

Wolfram Solutions

Wolfram Solutions For Education

Get Started

Grow Your Skills

Work with Us

Educational Programs for Adults

Educational Programs for Youth

Read

LearnDistribution

Details and Options

Examples

Basic Examples (3)

Scope (3)

Options (6)

FeatureTypes (1)

Method (2)

TimeGoal (2)

TrainingProgressReporting (1)

Applications (4)

See Also

Related Guides

History

Text

CMS

APA

BibTeX

BibLaTeX