Tensor Decompositions for Data Science

Tensor Decompositions for Data Science Book Cover

Format: Hardback
Published: July 2025
ISBN: 9781009471671

About the Book

Tensor Decompositions for Data Science explores the foundations of tensor decompositions, a data analysis methodology that is ubiquitous in machine learning, signal processing, chemometrics, neuroscience, quantum computing, financial analysis, social science, business market analysis, image processing, and much more.

This self-contained mathematical, algorithmic, and computational treatment emphasizes examples using real-world downloadable open-source datasets to ground the abstract concepts. Methodologies for 3-way tensors (the simplest notation) are presented before generalizing to d-way tensors, making the book accessible to advanced undergraduate and graduate students in mathematics, computer science, statistics, engineering, and physical and life sciences. Extensive background materials in linear algebra, optimization, probability, and statistics are included as appendices.

  • Separates algorithmic kernels, algorithms, and applications into their own chapters for flexible learning.
  • Lowers notational and background barriers for students and researchers.
  • Includes real-world open source datasets for practical experimentation.

Authors

Grey Ballard

Grey Ballard
Associate Professor of Computer Science, Wake Forest University

Grey Ballard specializes in numerical linear algebra, high performance computing, and computational science, with much of his work focusing on numerical methods and software for tensor decompositions. His work has been recognized with a National Science Foundation (NSF) CAREER award, a SIAM Linear Algebra Best Paper Prize, and best paper awards at SPAA, IPDPS, and ICDM.

Tamara G. Kolda

Tamara G. Kolda
MathSci.ai

Tamara G. Kolda is an independent consultant at MathSci.ai. Her research background is in numerical linear algebra, optimization, data science, and scientific software, and she is well known for her pioneering algorithmic and software work in tensor decompositions. She is a member of the US National Academy of Engineering, Fellow of SIAM and ACM, and founding editor-in-chief of the SIAM Journal on Mathematics of Data Science.

Reviews & Endorsements

“Tensor Decompositions for Data Science by Grey Ballard and Tamara G. Kolda is a much-needed contribution to the field of tensor analysis. This book starts from ground zero and carefully builds up to the core tensor decompositions for data science, with just the right amount of intuition and several practical exercises. This book presents a carefully designed, clear, and intuitive system of notation that will become the standard notation for tensors, making an intimidating subject approachable for new researchers. It will serve as an excellent reference for an advanced undergraduate or early graduate course in tensor decompositions, as well as a trusted guide to tensor decompositions for working researchers.”
— Rachel Ward, University of Texas at Austin
“Tensors arise naturally in many applications but proper handling of them can be exceedingly difficult compared to matrices. This book provides a deep understanding of how to work with tensors, while also being accessible to students. On top of that the mathematical typesetting is delightfully precise and readable, while each concept has exactly the right Tikz illustration.”
— Art Owen, Stanford University
“Ballard and Kolda masterfully lay out a step-by-step introduction to tensor decomposition methods, combining mathematical rigor with practical applications across disciplines. Tensor Decompositions for Data Science stands out not only as an excellent course companion, but also as an inspiring reference for data scientists and engineers committed to advancing their analytic toolkit.”
— Alex Williams, New York University
“Tensor Decompositions for Data Science is an essential contribution to the growing field of tensor methods, presenting a comprehensive and accessible resource for both students and professionals in data science, applied mathematics, and machine learning. Grey Ballard and Tamara G. Kolda offer an in-depth exploration of tensor algebra, decompositions, and their applications across a range of complex data analysis challenges. The authors' structured approach demystifies the mathematics behind tensors, making this book an invaluable tool for understanding the fundamental principles and practical applications of tensor decompositions. With clear explanations, detailed examples, and a balanced focus on theory and application, Ballard and Kolda have crafted a definitive guide that will serve as both an instructional text and a reference for advanced practitioners. This book is a timely and powerful addition to the literature on high-dimensional data processing, and it is certain to become a cornerstone for those seeking to leverage tensor decompositions in cutting-edge research and industry applications.”
— Jack Dongarra, University of Tennessee
“This book provides an accessible entry point into the realm of tensor decompositions, emphasizing practical algorithms, applications and approximations. It guides readers through the essential decompositions, inviting further exploration and setting them off on a wonderful journey into this fascinating field.”
— Nick Vannieuwenhoven, KU Leuven
“Tensors play a crucial role in numerous aspects of data science, including machine learning, computer vision, natural language processing, data compression, anomaly detection, social science, computational neuroscience, materials science, microbiology, and many others. This book provides an accessible yet thorough exploration of tensor representations for complex data. The authors meticulously cover key variants, foundational theories, and practical algorithms - making complex concepts understandable for readers at different levels of expertise.”
— Rebecca Willett, University of Chicago

Table of Contents

Software

The book is not specific to any language or software package. We encourage readers to let us know their experiences with different packages. Most of our prototyping and testing has been done in MATLAB, and prototyping in Python is underway. If you don't already have a predisposition to a particular package, we recommend the following:

Datasets

One of the important features of this book is that the exercises are grounded in experiments with real-world datasets. Here we link to the datasets used in the book.

EEM Tensor EEM Tensor Alternate View
EEM Tensor Dataset

Fluorescence excitation-emission matrix data: 18 samples x 251 emissions x 21 excitations. Used in chemometrics.

View Dataset
Miranda Turbulent Flow
Miranda Turbulent Flow Dataset

Large-scale simulation: 2048×256×256 grid points. Used for turbulence and scientific computing research.

View Dataset
Monkey BMI Tensor
Monkey BMI Tensor Dataset

Neural activity from 43 neurons over 200 time steps and 88 trials. Used for brain-machine interface research.

View Dataset
Chicago Crime
Chicago Crime Tensor Dataset

Crime data for the city of Chicago, organized as a 4-way tensor for spatio-temporal analysis: 365 days x 24 hours x 77 communities x 12 crime types.

View Dataset

LaTeX Resources

Errata