About the Book
Tensor Decompositions for Data Science explores the foundations of tensor decompositions, a data analysis methodology that is ubiquitous in machine learning, signal processing, chemometrics, neuroscience, quantum computing, financial analysis, social science, business market analysis, image processing, and much more.
This self-contained mathematical, algorithmic, and computational treatment emphasizes examples using real-world downloadable open-source datasets to ground the abstract concepts. Methodologies for 3-way tensors (the simplest notation) are presented before generalizing to d-way tensors, making the book accessible to advanced undergraduate and graduate students in mathematics, computer science, statistics, engineering, and physical and life sciences. Extensive background materials in linear algebra, optimization, probability, and statistics are included as appendices.
- Separates algorithmic kernels, algorithms, and applications into their own chapters for flexible learning.
- Lowers notational and background barriers for students and researchers.
- Includes real-world open source datasets for practical experimentation.
Reviews & Endorsements
“Tensor Decompositions for Data Science by Grey Ballard and Tamara G. Kolda is a much-needed contribution to the field of tensor analysis. This book starts from ground zero and carefully builds up to the core tensor decompositions for data science, with just the right amount of intuition and several practical exercises. This book presents a carefully designed, clear, and intuitive system of notation that will become the standard notation for tensors, making an intimidating subject approachable for new researchers. It will serve as an excellent reference for an advanced undergraduate or early graduate course in tensor decompositions, as well as a trusted guide to tensor decompositions for working researchers.”
— Rachel Ward, University of Texas at Austin
“Tensors arise naturally in many applications but proper handling of them can be exceedingly difficult compared to matrices. This book provides a deep understanding of how to work with tensors, while also being accessible to students. On top of that the mathematical typesetting is delightfully precise and readable, while each concept has exactly the right Tikz illustration.”
— Art Owen, Stanford University
“Ballard and Kolda masterfully lay out a step-by-step introduction to tensor decomposition methods, combining mathematical rigor with practical applications across disciplines. Tensor Decompositions for Data Science stands out not only as an excellent course companion, but also as an inspiring reference for data scientists and engineers committed to advancing their analytic toolkit.”
— Alex Williams, New York University
“Tensor Decompositions for Data Science is an essential contribution to the growing field of tensor methods, presenting a comprehensive and accessible resource for both students and professionals in data science, applied mathematics, and machine learning. Grey Ballard and Tamara G. Kolda offer an in-depth exploration of tensor algebra, decompositions, and their applications across a range of complex data analysis challenges. The authors' structured approach demystifies the mathematics behind tensors, making this book an invaluable tool for understanding the fundamental principles and practical applications of tensor decompositions. With clear explanations, detailed examples, and a balanced focus on theory and application, Ballard and Kolda have crafted a definitive guide that will serve as both an instructional text and a reference for advanced practitioners. This book is a timely and powerful addition to the literature on high-dimensional data processing, and it is certain to become a cornerstone for those seeking to leverage tensor decompositions in cutting-edge research and industry applications.”
— Jack Dongarra, University of Tennessee
“This book provides an accessible entry point into the realm of tensor decompositions, emphasizing practical algorithms, applications and approximations. It guides readers through the essential decompositions, inviting further exploration and setting them off on a wonderful journey into this fascinating field.”
— Nick Vannieuwenhoven, KU Leuven
“Tensors play a crucial role in numerous aspects of data science, including machine learning, computer vision, natural language processing, data compression, anomaly detection, social science, computational neuroscience, materials science, microbiology, and many others. This book provides an accessible yet thorough exploration of tensor representations for complex data. The authors meticulously cover key variants, foundational theories, and practical algorithms - making complex concepts understandable for readers at different levels of expertise.”
— Rebecca Willett, University of Chicago
Table of Contents
- Preface
- I. Tensor Basics
- Tensors and their subparts
- Indexing and reshaping tensors
- Tensor operations
- II. Tucker Decomposition
- Tucker decomposition
- Tucker tensor structure
- Tucker algorithms
- Tucker approximation error
- Tensor train decomposition
- III. CP Decomposition
- Canonical polyadic (CP) decomposition
- Kruskal tensor structure
- CP alternating least squares (CP-ALS) optimization
- CP gradient-based optimization (CP-OPT)
- CP nonlinear least squares (CP-NLS) optimization
- CP algorithms for incomplete or scarce data
- Generalized CP (GCP) decomposition
- CP tensor rank and special topics
- IV. Closing Observations
- Closing observations
- V. Review Materials
- Numerical linear algebra
- Optimization principles and methods
- Some statistics and probability
- Bibliography
- Index
Software
The book is not specific to any language or software package. We encourage readers to let us know their experiences with different packages. Most of our prototyping and testing has been done in MATLAB, and prototyping in Python is underway. If you don't already have a predisposition to a particular package, we recommend the following:
Datasets
One of the important features of this book is that the exercises are grounded in experiments with real-world datasets. Here we link to the datasets used in the book.


Fluorescence excitation-emission matrix data: 18 samples x 251 emissions x 21 excitations. Used in chemometrics.
View Dataset
Large-scale simulation: 2048×256×256 grid points. Used for turbulence and scientific computing research.
View Dataset
Neural activity from 43 neurons over 200 time steps and 88 trials. Used for brain-machine interface research.
View Dataset
Crime data for the city of Chicago, organized as a 4-way tensor for spatio-temporal analysis: 365 days x 24 hours x 77 communities x 12 crime types.
View Dataset