§ Specht module construction
§ A[λ][t], and its image
Define
A[λ][t](x)≡π∈C[t]∑sgn(π)π(x).
That is, A[λ][t] creates a signed linear combination of x by creating signed orbits of x under the
column stablizier of t.
First consider
A[λ][t](t)=π∈C[t]∑sgn(π)π(t).
We claim that A[λ][t] is
a projection operator which projects onto the subspace spanned by A[λ][t](t). To show this,
let's consider the action of A[λ][t] on some other tabloid s.
There is a predicate we are
interested in that determines whether A[λ][t](x) is 0 or ±t: If t has two elements a,b
that are in the same column of t, which are in the same row of x. If such elements a,b exist, then
the action of swap(a,b) is trivial on x, as tabloids are invariant under row permutations. Furthermore,
swap(a,b) is in the column stabilizer C[t], since a,b are in the same column of t. Exploiting
this, we write the group C[t] as cosets of the subgroup H≡{id,swap(a,b)}. Now the magic
happens: the action on x via H turns out to be zero:
π∈H∑sgnππ(x)=id(s)−swap(a,b)(x)=x−x=0
Since C[t] partitions as cosets of H, the entire action of C[t] on x becomes zero:
π∈C[t]∑sgnππ(x)=π∈C[t]/H∑(π⋅id)(x)−(π⋅swap(a,b))(x)=π∈C[t]/H∑(π⋅id)(x)−(π⋅swap(a,b))(x)=π∈C[t]/H∑π⋅(id(x)−swap(a,b)(x))=π∈C[t]/H∑π(id(x)−swap(a,b)(x))=π∈C[t]/H∑π(x−x)=π∈C[t]/H∑π(0)=0
On the other hand, let us assume that elements in the same column of t are always in different rows x [
if they are in the same row, then the action is zero as we saw before. ] Let us focus on the cth column of t:
say the elements in this column are t[1][c],t[2][c],…,t[n][c]. These elements will be in different rows of x.
Since we can freely permute rows, we can move these elements t[:][c] to the cth column of x. This makes column x[:][c]
of x be a permutation of the t[:][c] column of t. Now, there is a unique permutation which permutes every column of x
to be like the columns of t. Thus, there is a unique permutation π∈C[x] such that π(x)=t. We can invert this,
to find a permutation π′≡π−1∈C[t] such that π′(t)=x. This will force the value of A[λ][t](x) to be
equal to ±A[λ][t](t), since x differs from t by a permutation:
A[λ][t](x)==A[λ][t](π′t)=σ∈C[t]∑sgn(σ)σ(π′t)=σ∈C[t]∑sgn(σ)(σ∘π′)t)=σ∘π∈C[t]∑sgn(σ)sgn(π′)sgn(π′)(σ∘π′)t)=sgn(π′)σ∘π∈C[t]∑sgn(σ∘π)(σ∘π′)t)=sgn(π′)σ′∈C[t]∑sgn(σ′)σ′(t)=sgn(π′)A[λ][t](t)
Thus, we find that when A[λ][t] acts on a tableaux x, the result is either 0 [when x cannot be obtained by a column permutation of t],
or is ±A[λ][t](t) [when x can be ontained by a column permutation of t]. Thus, the image of A[λ][t]
is a 1-dimensional subspace spanned by A[λ][t](t). So the important property that we have uncovered is that A[λ][t](x)
is non-zero iff t's columns can be permuted to produce x: written formally, we have:
A[λ][t](x)=0⟺∃π∈C[t],π(t)=xA[λ][t](x)=0⟺∃π∈C[t],π(t)=x
§ Inner product and A[λ][t] is self adjoint
We impose the "canonical" inner product on the space of vectors spanned by tabloids, given
by making all non-equal basis tabloids orthogonal:
⟨{t}∣{t′}⟩≡{10{t}≃{t′}otherwise
Under this inner product, we claim that A[λ][t] is self-adjoint:
we have that ⟨A[λ][t](x),y⟩=⟨x,A[λ][t](y)⟩. The key idea is that A[λ][t](y)
is made up of permutations which are unitary, since they simply permute the orthogonal basis vectors,
and these permutations are arranged in A[λ][t] such that the A[λ][t] operator is self-adjoint:
⟨A[λ][t](x)∣y⟩=⟨π∈C[t]∑sgnππ(x)∣y⟩=π∈C[t]∑sgnπ⟨π(x)∣y⟩(π−1 is a permutation of orthonormal basis, hence orthogonal)(π−1 preserves inner produce as orthogonal):=π∈C[t]∑sgnπ⟨π−1π(x)∣π−1y⟩=π∈C[t]∑sgnπ⟨π−1π(x)∣π−1y⟩=π∈C[t]∑sgnπ⟨x∣π−1y⟩(Sum over π−1, is an automorphism:)=π−1∈C[t]∑sgnπ−1⟨x∣π−1y⟩=⟨x∣π−1∈C[t]∑sgnπ−1π−1y⟩=⟨x∣A[λ][t](y)⟩
§ S[λ] is a irreducible subspace of M[λ]
- Define the subspace spanned by {A[λ][t]:t∈tabloid(λ)} as S[λ] (for Specht). Thus, the A[λ][t] span S[λ].
- S[λ] is invariant under S[n], since the action of π∈S[n] on A[λ][t]sends A[λ][t] to A[λ][π(t)]. Also, the full space M[λ] is invariant under S[n] by construction.
- The orbit of any A[λ][t] under Sn gives us the full set {A[λ][t′]:t′∈tabloid, since we can produce A[t′] from A[t] by the action that permutes t into t′.
- For all invariant subspace U, U is either disjoint from S[λ] or U contains S[λ]. So it is impossible to reduce S[λ] into a smaller invariant subspace U.
- Consider some invariant subsepace U. If it is disjoint from S[λ], then we are done.
- Otherwise, assume there is some x∈S[λ]∩U.
- As x∈S[λ] and S[λ] is spanned by {A[λ][t](t):t∈tabloid}, there must be some t′ along which x has a component: ⟨x∣A[λ][t′](t′)⟩=0.
- Since A[λ][t′] is symmetric, I can write the above as ⟨A[λ][t′](x)∣t′⟩=0. Now since the image of A[λ][t′] is the subspace spanned by t′, since U is invariant under A[λ][t′], and since ⟨A[λ][t′](x)∣t′⟩=0, we can say that A[λ][t′](x)=αt′∈U for α=0. This tells us that we have the vector t′∈U.
- Once we have a single t′∈U, we win, since all the other t's are obtained as permutations of t′, and U is an invariant subspace of these permutations.
- TLDR: if we havs some common vector x∈S[λ]∩U, then ⟨x∣A[λ][t](t)⟩=0. By self-adjoint, we get ⟨A[λ][t](x)∣t⟩=0. But A[λ][t](x)=kt,xA[λ][t](t), hence kt,x=0. Further, A[λ][t](x)∈U since U is invariant and x∈U, hence kt,xA[λ][t](t)∈U for kt,x=0 hence A[λ][t](t)∈U. This forces all of S[λ]∈U, since U is invariant and S[λ] is generated by the various {A[λ][t](t):ttabloid}, which are obtained by permutation of of A[λ][t](t) for a given t.
§ The argument, in the abstract
Let V be a finite dimensional real vector space with inner product ⟨⋅∣⋅⟩.
Let H:V→V be a symmetric operator with rank 1 image, eigenvector h∈V.
For simplicity, say that the eigenvalue of h is 1, so Hh=h.
( H Hermitian is defined as ⟨Hx∣y⟩=⟨x∣Hy⟩)
Let O be a group of orthogonal matrices. Define a subspace S of V given
by the O-span of the image of H: S≡span({Oh:O∈O}).
We wish to show that S is an irreducible O,H-invariant subspace. By construction, S is O,H-invariant,
since it takes the subspace spanned by h and makes it invariant under O.
To show that this is irreducible, suppose we have some O,H invariant subspace W. We wish to show
that if W contains a single vector from S, then it contains all of S: W∩S=∅⟹S⊂W.
- Suppose that x∈W∩S. Since S is spanned the various Oh, there must be some O∈Osuch that ⟨x∣Oh⟩=0.
- Since O is orthogonal, we can shift the rotation towards x by rotating the entire frame by O−1, giving us ⟨O−1x∣h⟩=0.
- Since h is an eigenvector, we replace h by Hh giving us ⟨O−1x∣Hh⟩=0.
- Since H is hermitian, I rewrite the above as ⟨HO−1x∣h⟩=0.
- Since x∈W and W is invariant under O and H, we have that HO−1x∈W.
- Also, since the image of H lies entirely along h, we have that HO−1x=αxh. Combining with ⟨HO−1x∣h⟩=0 gives us ⟨αxh∣h⟩=0, or α=0.
- Thus, the non-zero vector αxh∈W (non-zero as αx=0). Hence, the vector h∈W. Since W is closed under O and S is generated as Oh, we have that S⊆W.
§ Showing that H as thesigned linear combination of O is Hermitian
In the abstract, we define H≡∑O∈O∣O∣O, which specializes
to Ht≡∑π∈Ctπsgn(π) in the tableaux theory. Now consider
HT=∑O∈O∣O∣OT Since O is orthogonal, OT=O−1. Furthermore,
we have that ∣O∣=∣O∣−1 since:
∣O−1∣=∣O∣−1∣O∣=±1=(±1)−1=±1
Combined, this tells us that HT=∑O∈O∣O∣−1O−1. Since O
is a subgroup, the sum can be re-indexed to be written as HT=∑O′∈O∣O′∣O′,
which is equal to H. Hence, we find that HT=H, or H defined in this way is hermitian.
§ Showing that H is rank 1
In the symmetric group case, we consider:
Ax≡σ∈Ct∑sgn(σ)σ
Now say we have some other y. The two cases are:
- y∈Orb(x,Cx). We have y=πx for π∈Cx In this case, the expression for Axy can be written as Ax(πx)which is equal to sgn(π)Ax(x). So this belongs to the subspace of Ax(x).
- y∈Orb(x,Cx). This means that we cannot rearrange the columns of tabloid x to get tabloid y (upto row permutation).
- That is, we have:
xa -> 1
xb -> 1
xc -> 2
- where two elements in the same column of x
(xa, xb)
want to go to the same row of y. If all elements in the same column of x (xa, xb, xc)
wanted to go to different rows of y (3, 1, 2)
, we could have permuted x in a unique way as (xb, xc, xa)
to match the rows. This tells us how to convert x into y, for this column. If we can do this for all columns, we are done. - The only obstruction to the above process is that we have two elements in the same column of x
(xa, xb)
that want to go to the same row of y. Said differently, there is a permutation p that swaps xa <-> xb
that is in Cx (since (xa, xb)
are in the same column), whose action leaves y unchanged (since y a tabloid has these elements in the same row; tabloid invariant under row permutation). - Thus, we can write Ct as cosets of the subgroup {e,p} whose action of y will be:
(sgn(e)e+sgn(p)p)(y)(e−p)(y)y−y=0
- Thus, the action of the full Ct, written as cosets of {e,p} cancels out entirely and becomes zero, since every coset is of the form h{e,p}, ie {h,hp}. And the action of this will be:
(sgn(h)h+sgn(hp)hp)y=sgn(h)hy+sgn(h)(−1)hpy=sgn(h)hy−sgn(h)hpy=sgn(h)hy−sgn(h)hy=0
- Thus, either an element y is in the orbit Cx or not. If it's in the orbit, we get answer ±Axx. If it's not, we get zero.
§ Have we found all the irreps?
Recall that the number of irreps is upper bounded by the number of conjugacy classes of the group. This follows
from character theory: (1) the characters of irreps are orthogonal in the space of class functions, and
(2) the dimension of the space of class functions is is equal to the number of conjugacy classes, since there
are those many degrees of freedom for a class function --- it must take on a different value per conjugacy class
[TODO: finish my character theory notes ]. In our case, we have found one irrep per conjugacy class, since
conjugacy classes of Sn is determined by cycle type, and the shape of a diagram encodes the cycle type of
a permutation. If we show that the irreps of different shapes/diagrams are inequivalent, we are done.
§ Characterizing Maps S[λ] to S[μ]
We wish to prove the key lemma, which is that if we have a non-zero map f:M[λ]→M[μ],
then λ⊵μ. Let's consider the extreme cases with 3 elements:
λ = (1 1 1):
* * *
μ = 3:
#
#
#
- Let l be a λ tableau, m be a μ tableau.
- Let's consider A[l](m) and A[m](l).
- For Al(m) to be non-zero, we need a way to send elements of m in the same column (
#; #; #
) to correct rows in l ( * * *
)But see that lhas only one row, and m has no choice: it must send all its elements in all columns to that single row of l. Thus, the Cl [WRONG ] don't hinder us from doing the only thing we possibly can. - For Am(l) to be non-zero, we need a way to send elements of l in the same column, of which there are three columns,
*
, *
, *
, to different rows of m. But if l were feeling stubborn, it could say that it wants each of its *
's to end up in the first row of m. m will be overcrowded, so this leads to the map becoming zero. - In general, if λ▹μ, then the map A[λ](μ) can be nonzero, since we need to send elements in the same column of μ to different rows of λ, and λ is "bigger", [WRONG?! ]
Thus, we have found ALL irreps, since as argued before, there can be at most as many irreps as there are shapes/diagrams of n,
and we've shown that each irrep that corresponds to a shape is distinct.
§ All the S[λ] are distinct irreps of Sn by Schur's lemma
Suppose that S[λ]≃S[μ]. Thus we have an invertible intertwining map T:S[λ]→S[μ].
By Schur's lemma, since we know that S[λ] and S[μ] are irreps,
we know that T is a scalar multiple of the identity map. Let m be a tabloid of shape μ.
We know that A[μ][m](m)∈S[μ]. Now consider T−1(A[μ][m](m)). This must be
equal to A[μ][m](T−1(m)). This means that T−1(m) is not zero when acted upon
by A[μ][m](m), thus T−1(m), of shape λ must dominate shape μ
[Argue why this is the case by adapting the proof seen before about spaces ].
Ruunning the argument is reverse, we get both directions of λ⊵μ and μ⊵λ,
there by establishing λ=μ.
§ Working it out for S3
§ Tabloid(3)
There's only one tabloid of shape 3
, which is {1 2 3}
. Thus we get a 1D complex vector space with
basis vector b{1, 2, 3}
. Every permutation maps b{1, 2, 3}
onto itself, so we get the trivial
representation where each element of S3
is the identity map.
§ Tabloid(2, 1)
There are three tabloids of shape (2, 1)
, one for each unique value at the bottom. The top row can be
permuted freely, so the only choice is in how we choose the bottom. We get the tableaux
{1 2}{3}
= [1 2][3]
= [2 1][3]
, drawn as:
[1 2] = [2 1] = {1 2}
[3] [3 {3}
And similarly we get {1 3}{2}
and {1 2}{3}
. So we have a three dimensional vector space.
Now let's look at the action of the A
operator A: Tableaux -> GL(V(Tabloid(mu))
. First of all,
we see that the A
operator uses tableaux and not tabloids (because we
need to know which elements are in the same column).
Recall that the action of A(t)
on a tabloid x
is to sum up linear combinations of sgn(π)π(x),
where π is from the column stabilizer of t
.
A(t)(x)≡π∈col-stab(t)∑sgn(π)π(x)
So let's find the action! The tableaux [1 2][3]
, ie:
[1 2]
[3]
has as column stabilizers the identity permutation, and the
permutation (1 3)
obtained by swapping the elements of the columns [1..][3]
Thus, the action of A([1 2][3])
on a tabloid {k l}{m}
is the signed linear combination of the action
of the identity and the swap on {k l}{m}
:
A([12][3])({kl}{m})=1⋅{kl}{m}+(−1)⋅mlk
Recall that the basis of the Specht module is given by A([t])({t})
, where we have the tableaux t
act on its own tabloid. In the case where t = [1 2][3]
we get the output
A([1 2][3])({1 2}{3}) = {1 2}{3} - {3 1}{2}
Similarly, we tabulate all of the actions of A(x)({x})
below, where we
pick the equivalence class representative of tabloids as the tabloid whose
row entries are in ascending order.
A([1 2][3])({1 2}{3})
= (id - (1, 3))({1 2}{3})
= {1 2}{3} - {3 1}{2}
= {1 2}{3} - {1 3}{2}
A([2 1][3])({2 1}{3})
= A([2 1][3])({2 1}{3})
= A([2 1][3])({1 2}{3})
= (id - (2, 3))({1 2}{3})
= {1 2}{3} - {1 3}{2}
A([1 3][2])({1 3}{2})
= (id - (1, 2))({1 3}{2})
= {1 3}{2} - {2 3}{1}
A([3 1][2])({3 1}{2})
= (id - (3, 2))({3 1}{2})
= (id - (3, 2))({1 3}{2})
= {1 3}{2} - {1 2}{3}
A([1 2][3])({1 2}{3})
= (id - (1, 3))({1 2}{3})
= {1 2}{3} - {3 2}{1}
= {1 2}{3} - {2 3}{1}
A([2 1][3])({2 1}{3})
= (id - (2, 3))({2 1}{3})
= (id - (2, 3))({1, 2}{3})
= {1 2}{3} - {1 3}{2}
If we now label the vector as {2 3}{1} = a
, {1 3}{2} = b
, {1 2}{3} = c
, written
in ascending order of the element of their final row, we find that A(x)(x)
gave us the vectors:
A([1 2][3])({1 2}{3})
= {1 2}{3} - {1 3}{2} = c - b
A([2 1][3])({2 1}{3})
= {1 2}{3} - {1 3}{2} = c - b
A([1 3][2])({1 3}{2})
= {1 3}{2} - {2 3}{1} = b - a
A([3 1][2])({3 1}{2})
= {1 3}{2} - {1 2}{3} = b - c = -(c-b)
A([1 2][3])({1 2}{3})
= {1 2}{3} - {2 3}{1} = c - a
A([2 1][3])({2 1}{3})
= {1 2}{3} - {1 3}{2} = c - b
where the subspace spanned by the vectors (a-b)
, (b-c)
, (c-a)
is
two dimensional, because there a one-dimensional redundancy (a-b) + (b-c) + (c-a) = 0
between them. Furthermore, the basis vectors (a - b)
, (b - c)
, (c - a)
are invariant
under all swaps, and are thus invariant under all permutations, since all permutations can be
written as a composition of swaps. So we have found a two-subspace of a three-dimensional
representation of S3
. To see that this subspace is irreducible, notice that given any permutation
of the form k - l
, we can swap the letters k, l
and the third letter m
to obtain the entire
basis. Hence, this subspace is indeed irreducible, and the representation of Sn
that we have
is indeed an irreducible representation.
§ Tabloid(1, 1, 1)
There are 6 tabloids of shape (1, 1, 1)
, given by the permutations of the numbers {1, 2, 3}
.
If we write them down, they're going to be (a) {1}{2}{3}
, (b) {1}{3}{2}
, (c) {2}{1}{3}
,
(d) {2}{3}{1}
, (e) {3}{1}{2}
, (f) {3}{2}{1}
. This gives us a 6 dimensional vector
space spanned by these basis vectors.
Let's now find out the value of A([1][2][3])({1}{2}{3})
recall that we need to act
on {1}{2}{3}
with all column stabilizers of A([1][2][3])
.
§ A
on tabloid instead of tableaux
I claim that the different A_t
and A_s
for {t} = {s}
differ only by sign [Why?
Because we can reorder the elments of t
and s
to suffer a sign ]. Thus, we can
directly define A_{t}
on the tabloids , by defining it as first sorting the rows of t
and then using A_t
.