Skip to main content

Section 2.7 Prime and maximal ideals

Subsection 2.7.1 Prime and maximal ideals

Definition 2.7.1. Maximal ideal.

Let \(R\) be a ring. An ideal \(M\) of \(R\) is maximal if \(M\ne R\) and if \(M\subseteq I\) for some ideal \(I\text{,}\) then either \(M=I\) or \(I=R\text{.}\) Using logical shorthand: an ideal \(M\) is maximal if \(M\ne R\) and
\begin{align} M\subseteq I \amp \implies M=I \text{ or } I=R \text{.}\tag{2.7.1} \end{align}
Equivalently, \(M\) is a maximal element of the set of all proper ideals of \(R\) with respect to the partial order \(\subseteq\text{.}\)

Example 2.7.2. Division ring.

Let \(R\) be a division ring. Prove that \((0)\) is the only maximal ideal of \(R\text{.}\)
Solution.
Indeed, \((0)\) is the only proper ideal of \(R\text{,}\) since if an \(I\) contains a nonzero element \(a\text{,}\) then since \(a\) is a unit it follows easily that \(I=R\text{.}\) Thus \((0)\) is maximal.

Example 2.7.3. Maximal ideals of \(M_n(F)\).

Let \(F\) be a field, and let \(n\) be a positive integer. Prove that \((0)\) is a maximal ideal of \(M_n(F)\text{.}\)
Solution.
You have shown on your homework that the only ideals of \(M_n(F)\) are of the form \(M_n(I)\) where \(I\) is an ideal of \(F\text{.}\) Since the only ideals of \(F\) are \((0)\) and \(F\text{,}\) it follows that \(M_n((0))=(0)\) is the only proper ideal of \(M_n(F)\text{,}\) and hence maximal.

Example 2.7.4. Maximal ideals of \(\Z\).

Let \(n\) be a nonnegative integer. Prove that the ideal \((n)\) is maximal if and only if \(n=p\) is prime.
Solution.
We have
\begin{align} (n) \text{ maximal} \amp \iff (n)\subseteq (a) \implies (m)=(a) \text{ or } (a)=\Z \tag{2.7.2}\\ \amp \iff a\mid n \implies a=\pm n \text{ or } a=\pm 1\tag{2.7.3}\\ \amp \iff n \prime\text{.}\tag{2.7.4} \end{align}
At the end of this section we will prove that maximal ideals exist in any ring using Zorn’s lemma. Before getting to that, however, we first introduce prime ideals and relate these to maximal ideals. The theory of prime ideals can be developed for any ring, commutative or not, but for simplicity, we will restrict our attention to commutative rings.

Definition 2.7.5. Prime ideal.

Let \(R\) be a commutative ring. An ideal \(P\) of \(R\) is prime if \(P\ne R\) and the following condition holds: for all \(a,b\in R\text{,}\) if \(ab\in P\text{,}\) then \(a\in P\) or \(b\in P\text{.}\) Using logical shorthand: \(P\) is prime if \(P\ne R\) and
\begin{align} ab\in P \amp \implies a\in P \text{ or } b\in P\text{.}\tag{2.7.5} \end{align}
The next theorem will help us identify prime and maximal ideals, and places both types of ideals in a common framework. Note that we include a useful criterion for maximal ideals that holds for all rings, commutative or not, in the form of statement (1). We take pains to emphasize that the remaining statements include the condition that the ring is commutative.

Proof.

  1. By the fourth isomorphism theorem, the ideals of \(R/M\) are in bijective correspondence with the ideals of \(R\) containing \(M\text{.}\) The result now follows immediately from the definitions of maximal ideal and simple ring.
  2. We have \(R/P\) integral if and only if \((a+P)(b+P)=0+P=P\) implies \(a+P=P\) or \(b+P=P\) for all \(a,b\in R\) if and only if \(ab+P=P\) implies \(a+P=P\) or \(b+P\) for all \(a,b\in R\text{,}\) if and only if \(ab\in P\) implies \(a\in P\) or \(b\in P\) for all \(a,b\in R\text{,}\) if and only if \(P\) is prime.
  3. From (1), \(M\) is maximal if and only if \(R/M\) is simple. It is easy to see that for commutative rings, simple is equivalent to being a field.
  4. Since all fields are integral domains, \((2)\) and \((3)\) now imply that all maximal ideals are prime.

Example 2.7.7. Prime and maximal ideals of \(\Z\).

Let \(n\) be a positive integer. Show that the following statements are equivalent.
  1. \((n)\) is prime.
  2. \((n)\) is maximal.
  3. \(n=p\) is a prime integer.
Solution.
We have seen previously that for positive \(n\text{,}\) the ring \(\Z/n\Z\) is an integral domain if and only if it is a field, if and only if \(n=p\) is a prime integer. The result now follows from TheoremΒ 2.7.6.

Remark 2.7.8. Prime integers.

From the equivalence of (1) and (3) in TheoremΒ 2.7.6, it follows that a positive integer \(n\) is a prime integer if and only if \((n)\) is a prime ideal, if and only if \(n\mid ab\) implies \(n\mid a\) or \(n\mid b\) for all integers \(a,b\text{.}\) We have thus derived, in a roundabout way, an equivalent formulation of being prime. (Recall that our original definition of a prime integer is one that has exactly two positive divisors: 1 and itself.) We will study the notion of a prime elements in arbitrary commutative rings down the line, and will in fact adopt this alternate formulation as our definition.

Subsection 2.7.2 Zorn’s lemma

We begin this section fumbling around in the foundations of set theory. The immediate motivation for our foray into the murky depths of mathematics is our needing Zorn’s lemma to prove the existence of maximal ideals in rings. Beyond that, however, partially ordered sets are ubiquitous in mathematics and well worth a formal introduction.

Definition 2.7.9. Partial order.

A partial order on a nonempty set \(X\) is a binary relation \(\leq\) satisfying the following properties.
  1. Reflexivity.
    \(x\leq x\) for all \(x\in X\)
  2. Antisymmetry.
    If \(x\leq y\) and \(y\leq x\text{,}\) then \(x=y\text{.}\)
  3. Transitivity.
    If \(x\leq y\) and \(y\leq z\text{,}\) then \(x\leq z\text{.}\)
A partially ordered set (or POSET) is a pair \((X,\leq)\) where \(\leq\) is a partial order on \(X\text{.}\)
Let \((X,\leq)\) be a POSET. A subset \(Y\subseteq X\) is totally ordered (or a chain) with respect to \(\leq\) if for all \(x,y\in Y\text{,}\) either \(x\leq y\) or \(y\leq x\text{.}\) An upper bound of \(Y\) is an element \(u\in X\) such that \(u\geq y\) for all \(y\in Y\text{.}\) An element \(m\in Y\) is a maximal element of \(Y\) if \(m\leq y\) implies \(m=y\) for all \(y\in Y\text{;}\) similarly, \(m\) is a minimal element of \(Y\) if \(y\leq m\) implies \(m=y\) for all \(y\in Y\text{.}\)

Proof.

Zorn’s lemma is equivalent to the axiom of choice. You can find a proof of this fact in most set theory textbooks.

Remark 2.7.11. β€œTotally ordered” versus β€œchain”.

We prefer the term β€œtotally ordered” to β€œchain”, as the latter connotes a countable set. This is important, as to verify the conditions of Zorn’s lemma, we must consider all totally ordered subsets, countable or not. It is not enough to just look at countable totally ordered subsets \(Y\text{.}\)
Let’s see how we can use Zorn’s lemma to prove the existence of maximal ideals. The underlying POSET we will use is a certain set of ideals of a ring, along the the partial order given by set inclusion.

Proof.

First observe that the second statement follows from the first by taking the ideal \(I=(0)\text{:}\) a maximal ideal containing \((0)\) is the same thing as a maximal ideal.
Now let \(I\) be a proper ideal of \(R\text{,}\) and let \(X\) be the set of all proper ideals of \(R\) containing \(I\text{:}\)
\begin{align*} X \amp =\{J\subseteq R\mid J \text{ a proper ideal}, I\subseteq J\}\text{.} \end{align*}
Note that \(X\) is nonempty since \(I\in X\text{.}\) The binary relation given by set inclusion \(\subseteq\) gives \(X\) the structure of a POSET. We use Zorn’s lemma to show that \(X\) has a maximal element with respect to \(\subseteq\text{.}\)
Consider any subset \(Y\subseteq X\) that is totally ordered with respect to \(\subseteq\text{.}\) We must show that \(Y\) has an upper bound in \(X\text{.}\) To this end we let
\begin{align*} J^* \amp =\bigcup_{J\in Y}J\text{.} \end{align*}
It is clear that \(J\subseteq J^*\) for all \(J\in Y\text{,}\) so it remains to show that \(J^*\in X\text{:}\) that is, we must show that (i) \(J^*\) is an ideal containing \(I\text{,}\) and (ii) \(J^*\ne R\text{.}\)
Since \(I\subseteq J\) for all \(J\in Y\text{,}\) and since \(J^*\) is the union of all such ideals, we have \(I\subseteq J^*\text{.}\) To show \(J^*\) is an ideal, we must show that it is a subgroup of \((R,+)\) and is closed under left and right multiplication by elements of \(R\text{.}\) Since \(I\subseteq J^*\text{,}\) we have \(0\in J^*\text{.}\) Now take \(a,b\in J^*\text{.}\) By definition we have \(a\in J\) and \(b\in J'\) for some ideals \(J,J'\in Y\text{.}\) Since \(Y\) is totally ordered, we have either \(J\subseteq J'\) or \(J'\subseteq\text{.}\) Assuming without loss of generality that the first condition holds, we may assume that \(a,b\in J\text{.}\) Since \(J\) is an ideal it follows that
\begin{align*} -a\amp \in J \amp a+b\amp \in J \amp ab\amp \in J \text{,} \end{align*}
and thus that \(-a,a+b,ab\in J^*\text{.}\) This shows that \(J^*\) is an ideal.
Next we show that \(J^*\ne R\text{.}\) This is a consequence of the fact that in any ring \(J=R\) for an ideal \(J\) if and only if \(1\in J\text{.}\) Since \(J\ne R\) for all \(J\in Y\text{,}\) it follows that \(1\notin J\) for all \(J\in Y\text{,}\) and hence that \(1\notin J^*\text{.}\) Thus \(J^*\ne R\text{.}\)
Having verified the conditions of Zorn’s lemma for all nonempty well-ordered subsets of \(X\text{,}\) we conclude that \(X\) has a maximal element \(M\text{.}\) By definition \(M\ne R\) and \(I\subseteq M\text{.}\) We conclude our proof by showing that \(M\) is maximal. Suppose \(M\subseteq J\) for some proper ideal \(J\ne R\text{:}\) then \(J\in X\) by definition, and hence \(M=J\) by maximality. Thus \(M\) is a maximal ideal of \(R\text{.}\)

Remark 2.7.13. Zorn’s lemma and maximal ideals.

You might wonder whether there is a proof of the existence of maximal ideals that does not rely on Zorn’s lemma. Surprisingly, we can say with assurance that the answer is no! As it turns out, not only does Zorn’s lemma imply the existence of maximal ideals, but the converse is also true: the existence of maximal ideals in arbitrary rings implies Zorn’s lemma. As a result, the following statements are all equivalent.
  • Zorn’s lemma.
  • Any nonzero ring has a maximal ideal.
  • The axiom of choice: if \((X_i)_{i\in I}\) is a family of nonempty sets indexed by the nonempty set \(I\text{,}\) then \(\prod_{i\in I} X_i\) is nonempty. (In other words, there is a way to β€œchoose” for each \(i\in I\text{,}\) an element of \(x_i\in X_i\) in order to form a tuple \((x_i)\in \prod_{i\in I}X_i\text{.}\))
In fact, we can add a few more equivalent statements to the list. The three statements above can be shown to be equivalent to the following statements.
  • Every vector space has a basis.
  • Arbitrary products of compact topological spaces are compact. (Tychonoff’s theorem).
  • Every set \(X\) admits a total order \(\leq\) satisfying the well-ordering principle: i.e., every nonempty subset of \(X\) has a minimal element with respect to \(\leq\text{.}\) (Well-ordering theorem.)
It is quite a wonder that these six statements, which on the surface seem to say very different things, are in fact all equivalent. Moreover, on a psychological level there is a great range of plausibility among the various statements: whereas the axiom of choice seems intuitively obvious, the well-ordering principle is quite a shocker, since it would imply that even \(\R\) admits some mysterious well-ordering. This is the basis of the following bon mot attributed to Jerry Bona, professor emeritus at UIC: β€œThe axiom of choice is obviously true, the well-ordering theorem obviously false, and who can tell about Zorn’s lemma?”