Cayley-Hamilton Theorem
States that every square matrix satisfies its own characteristic equation.
This public page keeps the free explanation visible and leaves premium worked solving, advanced walkthroughs, and saved study tools inside the app.
Core idea
Overview
The Cayley-Hamilton Theorem asserts that every square matrix satisfies its own characteristic equation, meaning if p(λ) is the characteristic polynomial of matrix A, then p(A) results in the zero matrix. This fundamental result bridges the gap between matrix algebra and polynomial theory, providing a powerful tool for matrix analysis.
When to use: Apply this theorem when calculating large powers of a matrix or finding the inverse of a non-singular matrix without row reduction. It is also used to simplify matrix-valued functions and to find the minimal polynomial of a linear operator.
Why it matters: It drastically reduces computational complexity in fields like control theory and signal processing by converting matrix exponentiation into linear combinations of lower powers. It is a cornerstone of the Jordan Canonical Form and other structural decompositions in linear algebra.
Walkthrough
Derivation
Derivation/Understanding of Cayley-Hamilton Theorem
The Cayley-Hamilton Theorem states that every square matrix satisfies its own characteristic polynomial, meaning that if a matrix is substituted into its characteristic polynomial, the result is the zero matrix.
- The matrix is a square matrix of dimension .
- The field of scalars is (complex numbers) or (real numbers).
Defining the Characteristic Polynomial and Adjugate Relationship:
We begin by defining the characteristic polynomial for an matrix . We then recall the fundamental property relating a matrix, its adjugate, and its determinant, applying it to the matrix .
Expressing the Adjugate as a Polynomial Matrix:
Since the adjugate matrix's elements are determinants of submatrices of , they are polynomials in of degree at most . This allows us to express the adjugate as a polynomial in whose coefficients are constant matrices.
Equating Coefficients and Deriving the Theorem:
By substituting the polynomial expressions for and into the identity, we can equate coefficients of powers of . Multiplying these resulting matrix equations by appropriate powers of and summing them leads to a telescoping sum on the left, which cancels to the zero matrix, thus proving that equals the zero matrix.
Result
Source: Introduction to Linear Algebra by Gilbert Strang
Visual intuition
Graph
Graph unavailable for this formula.
The graph is a constant function where the dependent variable result is always zero for any independent variable input. This horizontal line lies directly on the x-axis because the matrix polynomial evaluates to the zero matrix.
Graph type: constant
Why it behaves this way
Intuition
Imagine a square matrix as a set of instructions for transforming vectors; the Cayley-Hamilton theorem states that if you apply a specific polynomial sequence of these instructions (derived from the matrix's own characteristic polynomial), the net transformation is the zero transformation.
Free study cues
Insight
Canonical usage
This mathematical theorem describes an algebraic identity for square matrices. If the matrix elements possess physical units, then the polynomial coefficients must be chosen to ensure dimensional consistency across all terms of the identity.
Common confusion
A common confusion is either trying to assign physical units where none are necessary for a pure mathematical theorem, or failing to ensure dimensional consistency of the polynomial terms if the matrix elements do
Unit systems
One free problem
Practice Problem
Given a 2×2 matrix A with diagonal elements m11 = 5 and m22 = 3, the Cayley-Hamilton theorem states that A satisfies the equation A² - kA + dI = 0. Find the value of k, which corresponds to the trace of the matrix.
Solve for:
Hint: The trace of a matrix is the sum of its diagonal elements and appears as the negative coefficient of the λ term in the characteristic polynomial.
The full worked solution stays in the interactive walkthrough.
Where it shows up
Real-World Context
In control theory to compute the matrix exponential for solving systems of linear differential equations, Cayley-Hamilton Theorem is used to calculate P(A) from the measured values. The result matters because it helps connect the calculation to the shape, rate, probability, or constraint in the model.
Study smarter
Tips
- Calculate the characteristic polynomial first using det(λI - A) = 0.
- Substitute λ with the matrix A and the constant term with the identity matrix I.
- Use it to express A⁻¹ as a polynomial in A by multiplying the characteristic equation by A⁻¹.
Avoid these traps
Common Mistakes
- Applying the theorem to non-square matrices.
- Forgetting to multiply the constant term by the identity matrix when evaluating p(A).
Common questions
Frequently Asked Questions
The Cayley-Hamilton Theorem states that every square matrix satisfies its own characteristic polynomial, meaning that if a matrix is substituted into its characteristic polynomial, the result is the zero matrix.
Apply this theorem when calculating large powers of a matrix or finding the inverse of a non-singular matrix without row reduction. It is also used to simplify matrix-valued functions and to find the minimal polynomial of a linear operator.
It drastically reduces computational complexity in fields like control theory and signal processing by converting matrix exponentiation into linear combinations of lower powers. It is a cornerstone of the Jordan Canonical Form and other structural decompositions in linear algebra.
Applying the theorem to non-square matrices. Forgetting to multiply the constant term by the identity matrix when evaluating p(A).
In control theory to compute the matrix exponential for solving systems of linear differential equations, Cayley-Hamilton Theorem is used to calculate P(A) from the measured values. The result matters because it helps connect the calculation to the shape, rate, probability, or constraint in the model.
Calculate the characteristic polynomial first using det(λI - A) = 0. Substitute λ with the matrix A and the constant term with the identity matrix I. Use it to express A⁻¹ as a polynomial in A by multiplying the characteristic equation by A⁻¹.
References
Sources
- Wikipedia: Cayley-Hamilton theorem
- Linear Algebra and Its Applications (5th ed.) by David C. Lay
- Introduction to Linear Algebra (5th ed.) by Gilbert Strang
- Linear Algebra and Its Applications by David C. Lay
- Introduction to Linear Algebra by Gilbert Strang
- Linear Algebra and Its Applications, David C. Lay