§ Mobius inversion on Incidence Algebras
Most of these functions are really defined on the incidence algebra of
the poset P with ground field K. An incidence algebra I(P) is a
set of functions which maps intervals of P to the ground field K. an
interval is a tuple (x,y)∈P×P such that x≤P
(where the ≤ comes from the partial order on P). We have a product
structure which I denote ⋆, given by:
(α⋆β)([x,z])=x≤y≤z∑α(x,y)β(y,z)
A linear algebra way to look at this is to consider ∣P∣x∣P∣ matrices over K
where the rows and columns are indexed by P.
The a function α:P×P→K
can be written as the elements of the P×P matrix.
Then this convolution-like operator ⋆ is simply matrix multiplication.
We have three natural functions:
(1) The characteristic function, which is the identity for ⋆:
δ([x,z])≡1 if x=z; 0 otherwise
(2) the zeta function, which plays the role of the constant 1:
ζ([x,z])≡1 if x≤z; 0 otherwise
(3) The inverse of the zeta function, the mobius function, a tool for mobius inversion:
μ([x,z])=1μ([x,z])=−x≤y<z∑μ([x,y])
The mobius inversion theorem for posets states that ζ and μ as
defined above are convolutional inverses. that is, ζ⋆μ=δ.
This allows us to prove:
g([x,z])=x≤y≤z∑f([x,y])g([x,z])=x≤y≤z∑f([x,y])⋅1g([x,z])=x≤y≤z∑f([x,y])⋅ζ(y,z)g=f⋆ζg⋆μ=f⋆ζ⋆μg⋆μ=f⋆δg⋆μ=f
We have managed to find f in terms of g, when previously we had g
in terms of f.
TODO : we are usually interested in a fixed [x,z]. What happens if we
make this implicit? We may get nice notation for all of this!
§ Sums as mobius inversion
We can derive the formula we had for integrals, that:
F(i)=0≤i≤n∑f(i)⟺f(k)=F(k)−F(k−1)
by setting up mobius inversion on the usual partial order for the natural
numbers. For simplicity, I'll show the example on [0,1,2,3,4]. The example
immediately generalizes.
- We have the partial (well, total) order P: 0<1<2<3<4.
- We are given a function f(⋅) we wish to integrate. We define an auxiliary function fi([x,y])=f(y) which evaluates f on the right endpoint.
- We can now define F([x,z]) as the sum of f from x to z:
F([x,z])≡x≤y≤z∑f(y)=x≤y≤z∑fi([x,y])=x≤y≤z∑fi([x,y])⋅ζ(y,z)=fi⋆ζ
- This tells us that f(n)=fi([0,n])=(F⋆μ)([0,n]):
f(n)=fi([0,n])≡(F⋆mu)[0,n]=0≤x≤n∑F([0,x])μ([x,n])
- We note that we need to know the values of μ([x,n]) for a fixed n, for varying x. Let us attempt to calculate μ([0,4]),μ([1,4]),μ([2,4]),μ([3,4]),μ([4,4])and see if this can be generalized:
μ([4,4])=1 By definitionμ([3,4])=−(3≤x<4∑) By definition