Tensor Processing Unit

AI accelerator ASIC by Google

This article is about the chip developed by Google. For the smartphone system-on-chip, see Google Tensor. For other devices that provide tensor processing for artificial intelligence, see AI accelerator.

Tensor Processing Unit
Tensor Processing Unit 3.0
Designer	Google
Introduced	2015^[1]
Type	Neural network Machine learning

Tensor Processing Unit (TPU) is an AI accelerator application-specific integrated circuit (ASIC) developed by Google for neural network machine learning, using Google's own TensorFlow software.^[2] Google began using TPUs internally in 2015, and in 2018 made them available for third-party use, both as part of its cloud infrastructure and by offering a smaller version of the chip for sale.

TPUv1	TPUv2	TPUv3	TPUv4^[17]^[19]^[20]	TPUv5e^[21]	TPUv5p^[22]^[23]	v6e (Trillium)^[24]^[25]
Date introduced	2015	2017	2018	2021	2023	2023	2024
Process node	28 nm	16 nm	16 nm	7 nm	Unstated	Unstated
Die size (mm²)	331	< 625	< 700	< 400	300-350	Unstated
On-chip memory (MiB)	28	32	32 (VMEM) + 5 (spMEM)	128 (CMEM) + 32 (VMEM) + 10 (spMEM)	48^{[citation needed ]}	112^{[citation needed ]}
Clock speed (MHz)	700	700	940	1050	Unstated	1750
Memory	8 GiB DDR3	16 GiB HBM	32 GiB HBM	32 GiB HBM	16 GB HBM	95 GB HBM	32 GB
Memory bandwidth	34 GB/s	600 GB/s	900 GB/s	1200 GB/s	819 GB/s	2765 GB/s	1640 GB/s
TDP (W)	75	280	220	170	Not Listed	Not Listed
TOPS (Tera Operations Per Second)	23	45	123	275	197 (bf16) 393 (int8)	459 (bf16) 918 (int8)	918 (bf16) 1836 (int8)
TOPS/W	0.31	0.16	0.56	1.62	Not Listed	Not Listed

v t e Differentiable computing
General	Differentiable programming Information geometry Statistical manifold Automatic differentiation Neuromorphic computing Pattern recognition Ricci calculus Computational learning theory Inductive bias
Hardware	IPU TPU VPU Memristor SpiNNaker
Software libraries	TensorFlow PyTorch Keras scikit-learn Theano JAX Flux.jl MindSpore
Portals Computer programming Technology

v t e Digital electronics
Components	Transistor Resistor Inductor Capacitor Printed electronics Printed circuit board Electronic circuit Flip-flop Memory cell Combinational logic Sequential logic Logic gate Boolean circuit Integrated circuit (IC) Hybrid integrated circuit (HIC) Mixed-signal integrated circuit Three-dimensional integrated circuit (3D IC) Emitter-coupled logic (ECL) Erasable programmable logic device (EPLD) Macrocell array Programmable logic array (PLA) Programmable logic device (PLD) Programmable Array Logic (PAL) Generic Array Logic (GAL) Complex programmable logic device (CPLD) Field-programmable gate array (FPGA) Field-programmable object array (FPOA) Application-specific integrated circuit (ASIC) Tensor Processing Unit (TPU)
Theory	Digital signal Boolean algebra Logic synthesis Logic in computer science Computer architecture Digital signal Digital signal processing Circuit minimization Switching circuit theory Gate equivalent
Design	Logic synthesis Place and route Placement Routing Transaction-level modeling Register-transfer level Hardware description language High-level synthesis Formal equivalence checking Synchronous logic Asynchronous logic Finite-state machine Hierarchical state machine
Applications	Computer hardware Hardware acceleration Digital audio radio Digital photography Digital telephone Digital video cinematography television Electronic literature
Design issues	Metastability Runt pulse

Comparison to CPUs and GPUs

History

Products

First generation TPU

Second generation TPU

Third generation TPU

Fourth generation TPU

Fifth generation TPU

Sixth generation TPU

Edge TPU

Pixel Neural Core

Google Tensor

Lawsuit

See also

References

External links