You are currently browsing the monthly archive for February 2009.
The following (open) question seems to hold the key to an area of combinatorics I had previously not considered:
Question: For which does there exist an
by
matrix with orthogonal rows consisting only of 1s and -1s (an
–Hadamard Matrix)?
There has been some progress towards answering this question. I am told it can be shown (and I shall probably do so in a later thread if it is possible for me to find the proof myself) that if there exists an -Hadamard matrix then
. It is also conjectured that for any
in this set we can find an
-Hadamard matrix, although this has not yet been proved in full.
In this post I will not really touch on any of the above results, but instead just briefly explain how my attention was brought to these objects in order to shed light on the combinatorial importance of such matrices and the idea of representing combinatorial objects by such 1,-1 matrices (as opposed to using matrices with elements 0 and 1 or matrices over , which I had found more natural previously). It arose during my investigations into the following problem (set, in a slightly different language, by Tim Gowers to Part IA students in the Probability course of the tripos).
Problem: Given and a base set
of size
, and distinct
-subsets
with the property that
whenever
, what is the largest possible value
of
?
After a little fiddling around (for about an hour), I stumbled upon an inductive construction to prove that was possible, which turned out to be more or less equivalent to the construction below.
Proof that :
Take wlog. If
a construction is simple for
(
).
If we let
and pick distinct subsets
of
with size
and pairwise intersections
using the induction hypothesis. It is then clear that if we choose our
as follows, they have the required properties:
.
It then seemed sensible to look for a method of proving that , so it would suffice to show that
gave a contradiction. After a bizarre collaborative lunch with Freddie Manners, the following method emerged:
Proof that unless
:
Suppose satisfy the required hypothesis. Then consider the matrix given by
. Then
. But this thing can (by considering the characteristic equation of the 1-matrix) be shown to have a determinant equal to
for some (very large)
. But its determinant must be a square, being
. So we require the existence of a solution to the Diophantine equation
. This is a routine equation, factorising and yielding the only solution as
. So for any other
such sets
cannot possibly exist.
This proof is clearly weird and a little undesirable in two respects. Firstly, and most obviously, it doesn’t work for the case , which would require a separate argument (or, as is fairly simple to do, a computer check). Secondly, and more significantly, it reduces a combinatorial problem to solving a diophantine equation which, by coincidence, is easily solved. This has the feeling more of a bizarre accident than the reason for the result to be true. A much better reason was given to me in a supervision by Ben Green.
Proof 2: (here is also shown to be impossible)
Instead of taking a matrix of indicator functions we take a matrix of 1s and -1s, say . The reason this is sensible seems, to me, to be that we have symmetry in the statements
, since replacing all the sets by their complements will change nothing, and in this case equates to multipication by
. In this universe of 1s and -1s, it is clear that any two rows of the matrix are orthogonal in the usual inner product. But they are also orthogonal to (1,1,…,1). So there is an linearly independent orthogonal set of size
vectors in a space of dimension
. Contradiction.
In fact, looking at it this way shows that the problem was really about Hadamard matrices, and that the construction for was in fact a proof of the following result:
Result 1: There exists a -Hadamard matrix for any
.
Sylow’s Theorem, from group theory, states that:
- If
then
contains a subgroup of order
.
- Any two such subgroups are conjugate.
- The number
of such subgroups satisfies
.
The standard proof is to attack 1 using a clever binomial coefficient trick, and then use different arguments to tackle 2 and 3. The following proof, shown to the author by Prof. Leader of Trinity College, Cambridge, more or less proves all three parts simultaneously, and is very natural. The proof is probably slightly different from that I was originally shown, but is hopefully identical in spirit.
Proof: For a finite group , let
be a maximal
-subgroup (a subgroup of largest cardinality where every element has order a power of
).
Proof of part 1: It will suffice if we can prove that . But
, where
is the normaliser of
in
. Splitting the fraction in this way allows us to assign meaning to the quantities.
Since is normal in
, we can measure the second quantity as simply the order of the group
. Suppose
divides this order. Then, by Cauchy’s Theorem, we can find
of order
. But then
would be a
-subgroup and have order greater than
, contradicting its maximality.
Now, by the Orbit-Stabiliser theorem considering the action of conjugation on by
, we can see that the first quantity must be the size of the orbit of
under this action. Let us call the set of these orbits
, so we need to show that
. In fact, we can do better and show that
, which will immediately give part 3 of the theorem as a consequence of part 2.
To do this we consider acting on
by conjugation, and aim to show that
is the only singleton orbit (and since
is a
-subgroup, all the other orbits must have sizes divisible by
, so this would give the required congruence). Suppose
is another singleton orbit (with
). Then
normalises
, which implies
normalises
, so
. But in this case, it is clear that
is a
-subgroup of order greater than that of
(having nontrivial image of order
under the quotient map
). Thus the only singleton orbit is
, whence
.
So it remains to prove statement 2 (essentially that ). Suppose
is a Sylow
-subgroup. If we let
act by conjugation on
, then there must be some (unique) singleton orbit
which is normalised by
. But considering that the image of
under the quotient map
must be a single element (by maximality of
and the fact all elements of
have order divisible by
), we must have
. So
, as required.
As remarked above, the third part is now immediate.