The Schröder–Bernstein theorem

Posted on Mon, Aug 10, 2020 basic math

I decided that my first blog post should be about something that gave me the ‘wow’ feeling - and I immediately knew that it has to be the Schröder–Bernstein theorem. The proof of this theorem was skipped in the lecture (as an exercise :P) and I was waiting for the lecture to get over to try the proof. After hours of failed attempts, I thought I might be missing something trivial and decided to google it. I am a fan of elegant proofs and this theorem has been close to my heart ever since. Okay, let's cut the drama and jump straight into the theorem.

The Schröder–Bernstein theorem

Theorem: Let A,BA,B be any sets. If there are injective functions f:ABf: A \rightarrow B and g:BAg: B \rightarrow A then there exists a bijection h:ABh: A \rightarrow B.

Before going into the proof, I would like to introduce you to an alternate restatement of this theorem. This is not used in the proof and can be skipped.

Let us first fix the following terminologies.

The Schröder–Bernstein theorem can now be restated as follows:

Theorem (restated): For any sets A,BA,B if AB|A| \leq |B| and BA|B| \leq |A| then A=B|A| = |B|.

Proof (Finite Case)

Let us do an easy special case first. Say A={a1,,an}A = \{a_1,\ldots,a_n\} and B={b1,,bk}B = \{b_1, \ldots,b_k\} be finite sets with injections f:ABf: A \rightarrow B and g:BAg: B \rightarrow A. If f(A)=Bf(A) = B, we are done as ff is a bijection as well. We now argue that if f(A)Bf(A) \subsetneq B then there can not exist any injection from BB to AAwhich is a contradiction. Since ff is an injection, f(ai)f(aj)f(a_i) \neq f(a_j) whenever iji \neq j, meaning BB can be written as B={f(a1),,f(an),bn+1,,bk}B = \{ f(a_1),\ldots,f(a_n),b_{n+1},\ldots, b_k \}. As k>nk > n, by pigeonhole principle there can not exist an injection from BAB \rightarrow A.

Proof (Infinite Case)

The above proof idea does not carry over to the case when AA and BB are not finite. There are various proofs of this theorem. The proof presented in this post is based on what I read from here (although I have used my own terminologies and visualized this as a graph).

Construct a directed graph G(V,E)G(V,E) as follows:

.

It follows from the injectivity of ff and gg, that both the indegree and outdegree of any vertex in this graph is atmost 1. So chains that have branches (as shown in the following diagrams) can not exist.

Hence the graph GG consists of disjoint components each of which is either a cycle or an infinite chain.

An infinite chain can either start with a vertex from set AA (that is a blue vertex) or a vertex from set BB (a pink vertex). Let us call them blue chains and pink chains respectively. Observe that the first element of each pink chain do not have an incoming edge and these are precisely those elements in BB that does not have a pre-image with respect to ff.

💡

The idea is to modify the pink chains so that all pink elements in this chain has exactly one incoming edge while preserving the number of outgoing edges of the blue elements.

We are now ready to define our bijection h:ABh: A \rightarrow B. Since all pink elements in cycles and blue chains have an incoming edge, hh mimics ff for the blue elements in these components.

However on pink chains, hh maps each blue element to its parent vertex in the graph GG.

More formally h:ABh: A \rightarrow B is defined as

h(a)={f(a) if ablue chain or cycleg1(a) if apink chainh(a) = \begin{cases}f(a) \text{ if } a \in \text{blue chain or cycle} \\ g^{-1}(a) \text{ if } a \in \text{pink chain} \end{cases}

Let us verify that hh is indeed a bijection. First of all, hh is a valid function as hh is defined on all blue vertices and each blue vertex has out-degree 1. Further, each pink vertex has in-degree exactly equal to 1 implying hh is a bijection. That's all - we are done 🙂

Aside: Observe that if AA and BB were finite sets then all the components are cycles!

Remarks

  1. There are various proofs of this theorem and the ideas used in this version are very similar to Konig’s proof of this theorem.
  2. This theorem is also referred to as Cantor–Schröder–Bernstein theorem.
  3. Any mistakes - technical, typos or any other mistake are solely my own. Please let me know if you spot any. Thanks!