std::transform_reduce

class ForwardIt1, class ForwardIt2, class T,
class BinaryOp1, class BinaryOp2 >
T transform_reduce( ExecutionPolicy&& policy,
ForwardIt1 first1, ForwardIt1 last1,
ForwardIt2 first2, T init,

BinaryOp1 reduce, BinaryOp2 transform );

(4) (since C++17)

template< class InputIt, class T,

class BinaryOp, class UnaryOp >
T transform_reduce( InputIt first, InputIt last, T init,

BinaryOp reduce, UnaryOp transform );

(5) (since C++17)
(constexpr since C++20)

template< class ExecutionPolicy,

class ForwardIt, class T,
class BinaryOp, class UnaryOp >
T transform_reduce( ExecutionPolicy&& policy,
ForwardIt first, ForwardIt last, T init,

BinaryOp reduce, UnaryOp transform );

(6) (since C++17)

1) Equivalent to transform_reduce(first1, last1, first2, init,
std::plus <>(), std::multiplies <>()), effectively parallelized version of the default std::inner_product .

3) Applies transform to each pair of elements from the ranges [first1, last1) and the range of std::distance (first1, last1) elements starting from first2 and reduces the results (possibly permuted and aggregated in unspecified manner) along with the initial value init over reduce.

The result is non-deterministic if the reduce is not associative or not commutative (such as floating-point addition).

If any of the following values is not convertible to T, the program is ill-formed:

reduce(init, init)
reduce(init, transform(*first1, *first2))
reduce(transform(*first1, *first2), init)
reduce(transform(*first1, *first2), transform(*first1, *first2))

Given last2 as the std::distance (first1, last1)th next iterator of first2, if any of the following conditions is satisfied, the behavior is undefined:

T is not MoveConstructible.
transform or reduce modifies any element of [first1, last1) or [first2, last2).
transform or reduce invalidates any iterator or subrange of [first1, last1] or [first2, last2].

5) Applies transform to each element in the range [first, last) and reduces the results (possibly permuted and aggregated in unspecified manner) along with the initial value init over reduce.

The result is non-deterministic if the reduce is not associative or not commutative (such as floating-point addition).

If any of the following values is not convertible to T, the program is ill-formed:

reduce(init, init)
reduce(init, transform(*first))
reduce(transform(*first), init)
reduce(transform(*first), transform(*first))

If any of the following conditions is satisfied, the behavior is undefined:

T is not MoveConstructible.
transform or reduce modifies any element of [first, last).
transform or reduce invalidates any iterator or subrange of [first, last].

2,4,6) Same as (1,3,5), but executed according to policy.

These overloads participate in overload resolution only if all following conditions are satisfied:

std::is_execution_policy_v <std::decay_t <ExecutionPolicy>> is true.

(until C++20)

std::is_execution_policy_v <std::remove_cvref_t <ExecutionPolicy>> is true.

(since C++20)

If execution of a function invoked as part of the algorithm throws an exception and ExecutionPolicy is one of the standard policies, std::terminate is called. For any other ExecutionPolicy, the behavior is implementation-defined.
If the algorithm fails to allocate memory, std::bad_alloc is thrown.

[edit] Notes

transform is never applied to init.

If first == last or first1 == last1, init is returned, unmodified.

[edit] Example

transform_reduce can be used to parallelize std::inner_product . Some systems may need additional support to get advantages of parallel execution. E.g., on GNU/Linux, the Intel TBB be installed and -ltbb option be provided to gcc/clang compiler.

Run this code

#if PARALLEL
#include <execution>
#define PAR std::execution::par,
#else
#define PAR
#endif
 
#include <algorithm>
#include <functional>
#include <iostream>
#include <iterator>
#include <locale>
#include <numeric>
#include <vector>
 
// to parallelize non-associate accumulative operation, you'd better choose
// transform_reduce instead of reduce; e.g., a + b * b != b + a * a
void print_sum_squared(long const num)
{
 std::cout.imbue(std::locale {"en_US.UTF8"});
 std::cout << "num = " << num << '\n';
 
 // create an immutable vector filled with pattern: 1,2,3,4, 1,2,3,4 ...
 const std::vector <long> v{[n = num * 4] {
 std::vector <long> v;
 v.reserve(n);
 std::generate_n (std::back_inserter (v), n,
 [i = 0]() mutable { return 1 + i++ % 4; });
 return v;
 }()};
 
 auto squared_sum = [](auto sum, auto val) { return sum + val * val; };
 
 auto sum1 = std::accumulate (v.cbegin(), v.cend(), 0L, squared_sum);
 std::cout << "accumulate(): " << sum1 << '\n';
 
 auto sum2 = std::reduce (PAR v.cbegin(), v.cend(), 0L, squared_sum);
 std::cout << "reduce(): " << sum2 << '\n';
 
 auto sum3 = std::transform_reduce(PAR v.cbegin(), v.cend(), 0L, std::plus {},
 [](auto val) { return val * val; });
 std::cout << "transform_reduce(): " << sum3 << "\n\n";
}
 
int main()
{
 print_sum_squared(1);
 print_sum_squared(1'000);
 print_sum_squared(1'000'000);
}

Possible output:

num = 1
accumulate(): 30
reduce(): 30
transform_reduce(): 30
 
num = 1,000
accumulate(): 30,000
reduce(): -7,025,681,278,312,630,348
transform_reduce(): 30,000
 
num = 1,000,000
accumulate(): 30,000,000
reduce(): -5,314,886,882,370,003,032
transform_reduce(): 30,000,000
 
// Compile-options for parallel execution on POSIX:
// g++ -O2 -std=c++17 -Wall -Wextra -pedantic -DPARALLEL ./example.cpp -ltbb -o tr; ./tr

[edit] See also

accumulate

sums up or folds a range of elements
(function template) [edit]

transform

applies a function to a range of elements, storing results in a destination range
(function template) [edit]

reduce

(C++17)

similar to std::accumulate , except out of order
(function template) [edit]

cppreference.com

Namespaces

Variants

Views

Actions

std::transform_reduce

Contents

[edit] Parameters

[edit] Return value

[edit] Complexity

[edit] Exceptions

[edit] Notes

[edit] Example

[edit] See also

Navigation

Toolbox