This site is supported by donations to The OEIS Foundation.

User:Natalia L. Skirrow/an introduction to Stirling numbers

From OeisWiki
Jump to navigationJump to search

foreword

This page is called introductory, and the intention is to require only familiarity with ideas of generating functions and linear algebra, but the subject has some extremely interesting parts that require more advanced methods. An elementary formulation is given for #Ramanujan's master theorem, but in #asymptotics and properties as statistical distributions, Hayman's saddle point method from [1] is used without duplicating its (quite involved) explanation. #recurrence relations lies at the very end, because an important part of it requires the Bernoulli/Gregory theory established first.

tools

Let ((nk))=(n+k1k) be the number of ways to form a k-multiset from an n-set by drawing with repetition.

We use generating functions; these encode sequences as functions to represent operations on them; in particular, the ordinary g.f. is ogf(a)=n=0a(n)xn, and the exponential g.f. egf(a)=n=0a(n)xnn!.

Multiplying g.f.s convolves their sequences in different ways; convogf(fg)(n)=k=0nf(k)g(nk) and convegf(fg)(n)=k=0n(nk)f(k)g(nk)

k! and Hk(n) are treated as complex functions, Γ(k+1) and (1)n1(n1)!ψn1(k+1)+({γn=1ζ(n)else)

The notations used here ([nk] and {nk}) are due to Jovan Karamata from 1935

The l'hospital theorem is used implicitly; specifically, when dividing two functions at a point where both have simple poles (which is done a lot in #Ramanujan's master theorem), we consider the quotient of their residues

calculus of finite differences

We define Δ(f)(x)=f(x+1)f(x) and (f)(x)=f(x)f(x1), and the falling and rising factorials xn_=x!(xn)! and xn=(x+n1)!(x1)!.

In particular, these operators Δ and are o.g.f. convolutions with 1x1 and 1x, respectively, so by binomial-expanding their exponents, their higher-order iterates are Δn(f)(x)=i=0n(ni)(1)nif(x+i) and n(f)(x)=i=0n(ni)(1)if(xi) (with negative iterates Δn(f)(k)=i=0kn(k1in1)f(i)).

Similarly to how a holomorphic function is converged to by the evaluation of its power series f(x)=n=0f(n)(0)xnn!, sequences are 'converged' to (eventually matched at all indexes by) the evaluation of their inverse binomial transforms; f(k)=n=0kΔn(f)(0)(xn_n!:=(xn)) and f(k)=n=0kn(f)(0)(xnn!:=(x+n1n)); as such, one can write Δn(f)(0)=[(xn)]f(x) and n(f)(0)=[((xn))]f(x)

We as such define a quartet of analogues,

ogf(a)=n=0a(n)xnegf(a)=n=0a(n)xnn!fgf(a)=n=0a(n)xn_bgf(a)=n=0a(n)(xn)rgf(a)=n=0a(n)xnmgf(a)=n=0a(n)((xn))

The 'factorial powers' (Pochhammer symbols) are joined by xn_=(xn+1)n, xn_=(1)n(x)n and xn_=1(x+1)n, and work similarly to monomials in that (see Concrete Mathematics, page 53, eq. (2.53))

Δxn_=nxn1_ddxxn=nxn1xn=nxn1x=ab1xn_={HbHan=1bn+1_an+1_n+1n1x=abxn={log(b)log(a)n=1bn+1an+1n+1n1x=a+1bxn={HbHan=1bn+1an+1n+1n1

(in fact, the rules for the calculus of infinitesimal differences comes from 'zooming out' in both the input and output axes on this calculus of finite differences)

finally, the binomial theorem has falling and rising analogues; k=0n(nk)xk_ynk_=n!k=0n(xk)(ynk)=(yn)2F1(n,x1n+y;1)=(x+y)n_ (using Gauss's hypergeometric theorem; the falling-rising joining identity directly brings it to the equivalent rising factorial identity)

power-analogue notations

It is useful to bear in mind the combinatorial definitions of these powers (and some more that we will variously see along the way);

  • xn counts functions from [n] to [x]
  • xn_=n!(xn) counts injective functions
  • xn=n!((xn)) counts functions with preimages totally ordered into lists
  • x{n}=x!{nx} counts surjective functions (a proposal from [2])
  • x[n]=x![nx] counts surjections with preimages cyclically ordered
  • x(n)=x!Lah(nx)=n!(n1x1) counts surjections with preimages totally ordered (note this deviates from some sources (like the Wikipedia page) which use this notation to mean xn)

while the product definition lets us evaluate Pochhammer symbols like xa on noninteger x, the gamma function lets us compute them on noninteger a! [3] provides the inequality 1xaxa(1+ax)1a for a[0,1], with equality at boundaries; taking series expansions shows that as x, the logarithms of the bounds' error ratios converge; log(xaxa)log((ax+1)1axa/xa), so the lower inequality is about as good as the upper one; the inequalities both flip into s for a1 and the now-upper bound continues to work as an approximately-as-good counterpart to the obvious one. (see also Gautschi's inequality)

the sequence f(n)=Δn(f)(0) (the inverse binomial transform, analogous to the power series) can be considered as a linear transformation on vector spaces (f(0)f(1)f(2))=(100110121)(f(0)f(1)f(2)), we can take its inverse, (f(0)f(1)f(2))=(100110121)(f(0)f(1)f(2))

can be represented as an e.g.f. convolution, egf(f)(x)=exegf(f)(x), and as an o.g.f. ogf(f)(x)=ogf(f)(x1+x)1+x (so ogf(f)(x)=ogf(f)(x1x)1x)

since (when computing xn) we can choose k outputs to have preimages, we have xn=k=0x(xk)k{n}; that is, λn.xn is the binomial transform of λn.x{n}. By the same reasoning elsewhere, we have

xn=k=0x(xk)k{n}x{n}=k=0x(1)xk(xk)kn=Δx(λk.kn)(0)xn~=k=0x(xk)k[n]x[n]=k=0x(1)xk(xk)kn~=Δx(λk.kn~)(0)xn=k=0x(xk)k(n)x(n)=k=0x(1)xk(xk)kn=Δx(λk.kn)(0)

In this case, is the statement "if you have two sequences a(n),b(n), counting the numbers of type-a and type-b structures on n-element labelled sets, with e.g.f.s A(x),B(x), then [xnn!]A(B(x)) (denoted a(b)(n)) is the number of ways to put n elements into type-b objects, then put these into a type-a object."

In particular, we can associate the exponent of B(x) with another variable y, to get that [xnn!yk]A(yB(x)) is the number of ways to put n elements into k b-objects in one a-object.

the most important sequences in this case are sets(n)=1, cycles(n)=(n1)!, lists(n)=n! (with e.g.f.s ex, log(11x) and 11x).

Note that cycles must be nonempty, but for the other two we subtract 1 from the e.g.f.s to count their nonempty forms; we denote this with an overline.

examples

  • Sets(Cycles(x))=elog(11x)=11x=Lists(x), meaning there is a bijection between lists and sets of cycles (ie. permutations' cycle decompositions).

if both a- and b-type structures are totally ordered (ie. in both, each instance of an n-node structure is part of an equivalence class of n! that are node-permutations of each other, and the nth terms of both are divisible by n!), then the n nodes can be considered unlabelled to divide by n! (considering their e.g.f.s as o.g.f.s). In particular, this means that o.g.f. composition for sequences on unlabelled sets imbues total orderings on all but the leaf nodes of the partition-tree.

  • Lists(Lists(x))=11x1x=1x12x, and (for k1) there are lists(lists(lists(n timesk)))=[xk]1(n1)x1nx=nk1 ways to put k unlabelled objects into a tree of lists n layers deep (since 1(n1)x1x1nx1x=1nx1(n+1)x). Equivalently, this is the number of ways to arrange brackets around k points such that each of them is n layers deep, and all bracket-pairs contain at least one point.
  • allowing all lists to contain singletons instead of only the depth-n ones can be recursively constructed 'from the leaves up'; we define f0=0 and thereafter Fn(x)=Fn1(x)21Fn1(x)+x; ie. must be a singleton or contain at least two members of the previous iteration.
  • this is difficult to analyse directly, but the fixed point of this iteration, F, is not! Since the recurrence is in terms of itself rather than the preceding iterate, by the quadratic formula we get 1+x16x+x24, the functional inverse of x(12x)1x, and so f(k)=A001003(k-1).
since its inverse's derivative, 21(1x)2, becomes zero at x=112, at which the inverse's maximum is 38, this is F's radius of convergence, so f(k)f(k1)3+8

Stirling-type triangles

cycle numbers (|A008275|,A132393)

As mentioned, a permutation is a set of cycles, so we define [nk]=[xnn!yk](Sets(yCycles(x))=eylog(11x)=1(1x)y).

Since [xnn!]1(1x)d=n!((dn))=dn (the g.f. for a general degree-d polynomial is some degree-d numerator times this), we have k=0n[nk]yk=yn, so can write [nk]=[yk]yn. (The reason for the unfortunate historical use of s(n,k)=(1)nk[nk] is that this gives s(n,k)=[yk]yn_.)

We can form a correspondence; adding an n+1th element to a length-n permutation can be done by

  • choosing one of the n existing elements, mapping the n+1th to it, and mapping its previous preimage to the n+1th
  • mapping the n+1th to itself

in particular, each permutation has n+1 successors, but each of the resultant n+1-element permutations has only one preimage. Specifically, in [n+1k] there are n elements for each in [nk] and one for each in [nk1], giving [n+1k]=n[nk]+[nk1].

for the representation in elementary symmetric polynomials and a bijective proof of its rook-theoretic interpretation, see Stirling factoradics

subset numbers (A008277,A048993)

These count partitions into sets of nonempty sets; {nk}=[xnn!yk](Sets(ySets(x))=ey(ex1))

By binomial-expanding the e.g.f. for the kth column, (ex1)kk!=i=0k(ki)(1)kieixk!=Δk(λi.eix)(0)k!; the coefficient of xnn! is {nk}=Δkk!(λi.in)(0)=[(xk)]xnk!=[xk_]xn.

By the same kind of argument as the cycle numbers, the n+1th element can be placed in its own set (in one way) or one of the k existing sets (in k ways), giving {n+1k}=k{nk}+{nk1}.

Writing {nk} as [xn]fk(x), this becomes (1kx)fk(x)=xfk1(x). Since f0(x)=1, the column o.g.f. is fk(x)=i=1kx1ix, which the e.g.f. binomial expansion thus shows partial-fraction-decomposes to i=0k(1)kii!(ki)!11ix.

The product form is rewritable in factorial powers as 1(1x1)k_, which (by the last joining identity) equals (1x)k. This means that according to the o.g.f.s of their columns and rows respectively (interpreting negative-exponent rising factorials as Puiseux series from x= instead of 0), we have {nk}=[kn].

this can be equivalently written exceedingly elegantly (using the Pochhammer reflection identities on binomial coefficients) as 1(1x1k)=i=0k(ik)1ix

note that by expanding out the denominator of the column g.f.s, one gets the linear recurrence {nk}=i=0k1[k+1ki](1)i{n1ik}. The cycle numbers, meanwhile, cannot be similarly decomposed into a simple summation; using the polynomiality of the diagonals of the conjoined triangle (see [4], page 11) and the polynomials' linear recurrences (see my MathOverflow answer), the cycle numbers can be computed in a singly-nested sum involving subset numbers, translating to a doubly-nested sum over elementary functions; no singly-nested such sum is known for them.

see also rook configurations for how {nnk} enumerates nonattacking k-rook configurations beneath the diagonal in an n×n chessboard

Lah numbers (|A008297|,A271703)

These count sets of nonempty lists, and have a closed form directly from their e.g.f., making them exceptional in a few ways; Lah(nk)=[xnn!yk]exy1x=[xnn!](x1x)kk!=n!k!(n1k1)

The combinatorial interpretation of said closed form is pretty straightforward; since (n1k1) enumerates ways to place k1 partitions in the n1 intervals of a length-n unlabelled list, to split it into an ordered list of k subsections (identified only by their lengths); we multiply by n! to make it totally-ordered, then divide by k! to count equivalence classes under permutations of the subsections.

The combinatorial interpretation directly yields the generatingfunctionological one; for each j-cycle permutation, partition the cycles into k subsets, then use these sets of cycles as permutations, ie. lists (either by interpreting them as a map or by Foata's transformation fondamentale)

Lah(nk)=j=kn[nj]{jk}

which can be rewritten

Lah(nk)=j=kn[xj]xn[xk_]xj=[xk_]xn

(which is a direct consequence of the Pochhammer-binomial theorem where y=n1, and can be rewritten as (n1k1)=[xk_k!]xnn!=[(xk)]((xn)))

Note that since the ratio between two Lah numbers at a constant displacement in the Lah triangle is a rational function of their coordinates, the row and diagonal o.g.f.s are hypergeometric! Lah(nk)=n![yk1]1F1(1n2;y) and Lah(n+kk)=[xk]n!2F1(n,n+12;x)=[xk]n!2F1(2n,1n2;x)(1x)2n1. The latter (obtained by Euler's hypergeometric reflection formula) is interesting, because the nth diagonal's g.f.'s numerator is the g.f. of the n1th row of the Narayana triangle A001263, so Lah(n+kk)=(n+1)!((k2n))3F2(1n,n,1k12nk,2;1)=(n+1)!i=0n((ki2n))Narayana(ni+1).

In fact, Narayana(nk+1) has some equivalent definitions
  • the number of strict order-homomorphisms from a k×2 array (considered as a poset, with each pair of elements i,j ordered by ijixjxiyjy) to a length-n list (ie. such that for two elements i,j, i<j implies the same about the indexes their mapped to in the list, l(i)<l(j)).
  • pairs of strictly increasing length-k lists a,b with elements [0..n) such that for all i, ai>bi
  • pairs of monotonically-increasing length-n lists A,B with elements [0..k) (where ith term is first index in which an element i appears in the previous bullet point's lists) such that for all i, Ai+1Bi

general matrix multiplications!

In general, for a Stirling-type array a(n,k)=[xnn!yk]eyA(x) and a slightly more general array b(n,k)=[xnn!yk]eB(y)B(x),

j=0na(n,j)b(j,k)=j=0[xnn!yj]eyA(x)[xjj!yk]eB(y)B(x)=j=0n[xnn!]A(x)jj![A(x)jj!yk]eB(y)B(A(x))=[xnn!yk]eB(y)B(A(x))

that is, given that a(n,k) is the number of ways to partition n labelled nodes into a set of A-egf-type structures, and b(n,k) the ways to partition them into a (set of B-ogf-type structures of) k B-egf-type structures, j=0na(n,j)b(j,k) is the number of ways to put n labelled nodes into any number of labelled A-type structures, put these into k B-type structures, and put these into a set of B-type structures.

In particular, (nk)=[xnn!yk]e(1+y)x, so j=kn[nj](jk)=[xnn!yk]1(1x)1+y=i=knn!i![ik] is the number of ways to totally order all but an arbitrary selection of i of n elements, then put those i into a permutation; equivalently, to cyclically order all but i of n+1 elements, which can then be added as an additional member of the set of cycles in [ik], making this equal to [n+1k+1]!

see also my page with some more interesting binomial matmuls

Ward's theorem

If one has a combinatorial structure s such that the number of instances of s with n (labelled) nodes is given by s(n), with e.g.f. S(x), the number triangle that counts partitions into sets of ss, a(n,k)=[xnn!yk]eyS(x), then the inverse binomial transform of the diagonals gives a partition triangle of the same type.

a(n,nk)=Δn(λj.a(j,jk))(0)=j=0n(1)nj(nj)[xjj!]S(x)jk(jk)!=[x0]j=0n(1)nj(nj)jk_S(x)jkxj=(1)n[x0]2F1(1,n1k;S(x)x)(k)!S(x)k=(1)nknk_[xk]1F0(kn;S(x)x)=nk_[xk](S(x)xx)nk=nk_[xn](S(x)x)nk=[xnn!](S(x)x)nk(nk)!

ie. a(n,k)=[xnn!](S(x)x)kk!=[xnn!yk]ey(S(x)x), meaning a(n,k) counts the number of ways to partition n nodes into a set of k ss, which are identical to ss but have one fewer singleton.

This means {{nnk}}=Δn(λj.{jjk})(0) and [[nnk]]=Δn(λj.[jjk])(0) count partitions into singleton-free subsets and cycles (the latter corresponding with permutations without fixed points).

for the second-order Lah numbers, we have Lahlah(nk)=n!k!(nk1k1)=[xnn!yk]eyx21x=n![yk1]2F2(1n2,3n22,2n;4y)

(See [6], which gives two citations for its discovery.)

row g.f.s of the opposite kinds

subset row sums (Bell numbers A000110) and o.g.f.s (Bell polynomials)

The nth row sum of the subset numbers is called the Bell number k=0n{nk}=Belln=A000110(n)=[xnn!]eex1, and the row o.g.f.s are called Bell polynomials, k=0n{nk}yk=[xnn!]ey(ex1)=Belln(y).

The Bell numbers aren't as simple as one might expect from their counterparts the factorials, and getting an asymptotic characterisation as precise as Stirling's approximation entails saddle point approximation. However, we can do some elementary things!

Taking the average value of k across all sets of sets, k=0nk{nk}ykk=0n{nk}yk=k=0n({n+1k}{nk1})ykk=0n{nk}yk, and the upper bound can be changed to k+1 since the k+1th summand cancels itself out, giving Belln+1(y)Belln(y)y.

We also have that [yk_]Belln(y)=i=kn{ni}[yk_]yi=i=kn{ni}{ik}=[xnn!yk]ey(eex11) (entries A039810=A130191) by the matmul correspondence

Since we know that {nk}=[xk_]xn, ie. k{n}=k!{nk}=Δk(λi.in)(0), we can use the formula from #binomial transforms for egf(λk.k{n})(x)=Belln(x)=exegf(λi.in)(x); this is Dobiński's formula, and its practical use is the other way around; knowing the function by which to multiply ex in the e.g.f. of a polynomial sequence, in terms of said polynomial's expansion.

Dobiński's formula is not to be confused with the fact that (ddx)nef(x)=psets(sets(n))spf(s)(x), which is sometimes called the EXP transform but is a case of Faà di Bruno; it can be rewritten (ddx)nef(x)=Belln(f(x),,f(n)(x)) via the multivariate Bell polynomial (for which each size of subset is 'coloured' in its own variable)

cycle row rising g.f.s

having looked at [nk]=[xk]xn and {nk}=[xk_]xn=[xk]Belln(x), it makes sense to look at the function [nk]=[xk]antiBelln(x)!

similarly to before, we have [yk]antiBelln(y)=i=kn[ni][yk]yi=i=kn[ni][ik]=[xnn!yk](eylog(11log(11x)):=1(1log(11x))y) (entries |A039814|=|A325872|); this tells us that antiBelln(y)=[xnn!]1(1log(11x))y.

By setting y=1, we likewise get the antiBell numbers, A007840 (enumerating lists of cycles)!

combinatorial interpretations of Pochhammer calculus

this section now moved to /binomial-type theorems

the duality

As explained in [4], the duality shown in the o.g.f.s has a direct combinatorial proof in terms of Richard P. Stanley's poset homomorphism duality theorem. In fact, it works for every variety of generating function, importantly also the falling ones; {nk}=[yk_]yn becomes [nk]=[xn_:=1(x+1)n]1xk; that is, within the radius of convergence (of 1 from x=0), 1xk=n=k[nk](x+1)n and so we can get the expansion of the oth-order partial sums of the sum of reciprocals of kth powers!

Eulerian numbers (nk1=A008292(n,k)=A123125(n,k),nk0=A173018(n,k))

note that pretty often (including in the primary entry) the indexing E(n,k)=nk1 is used, but nk0 is Euler's and Knuth's and Foata's and Hirzebruch's and Luschny's (see the latter's page); be careful

Let nk0 be the number of permutations of n elements in which k of the n1 pairs of consecutive elements pi,pi+1 have pi<pi+1 (ie. are 'rises'), and nk1=nk10 have k1 rises.

To establish the recurrence combinatorially, we can surject a permutation into a 'landscape' of rises and falls, then with k rises,

  • one can place the n+1th element at the beginning to add a fall, or in an interval that was a rise (the two new intervals that it's split into are a rise and a fall), for k+1 insertions that conserve the rise-count
  • likewise, since there are n1 intervals from which the k rises are subtracted for n1k falls, one can place it at the end or inside a fall to create a new rise in nk ways

which give us

n+1k0=(k+1)nk0+(nk+1)nk10

writing nk0=[yk]fn(y) (where fn is known as the nth Eulerian polynomial), the recurrence turns into diffeqs

fn+1(y)=(1+ny)fn(y)+y(1y)fn(y)

which is actually the quotient rule; note that if we define g(y)=y(1y)n+1f(y), we get precisely that diffeq-recurrence in the numerators, g(y)=(1+ny)f(y)+(1y)f(y)(1y)n+2 so since f0(y)=1, gn(y) is the nth iterate of the map λf.λy.yg(y) starting at g0=y1y.

In particular, since the operation 'differentiate and then multiply by y' multiplies the kth term of the series expansion by k without shifting it, we have that [yk]gn(y)=kn for n1.[n 1]

So, nk0=[yk+1](1y)n+1ogf(λi.in)=n+1(λi.in)(k+1)=i=0n+1(n+1i)(1)i(k+1i)n (and nk1=i=0n+1(n+1i)(1)i(ki)n), and likewise kn=[yk1]ogf(λj.nj0)(1y)n+1=j=0k1nj0((kjn))=j=nkn1nj0(k+jn) (the latter following from nj0=nn1j0).

known as Worpitzky's identity; a bijective proof is given in [7], but a more direct one is as follows:

  • kn is the number of functions from [n] to [k]
  • j=0k1nj0((kj1n+1)) is the number of pairings p,q, where p are (permutations of [n] with j falls) and q are (placements of n+1 identical markers into kj1 intervals)
  • to biject from the left to right side,
  • plot the function's output, then connect the nodes in reading order. the order in which the line lands upon them is a permutation of the initial list; let p be the sequence of x positions along the line (here 1,3,6,4,2,0,5)
  • place tally marks along each line segment for each unit change to y position; the j descents each have at least 1, the ascents have at least 0
  • after placing the j mandatory tallies, the remaining k1j have n+1 intervals between successive nodes to go into (in stars and bars, |*||*|*|***|**|||); the red nodes are fixed at the minimum and maximum height, so act to shift all or none of the function respectively

We can put the Eulerian numbers inside the monomials in the formula for subset numbers, to get

k{n}=i=0k(ki)(1)kij=0n1(i+jn)nj0=j=0n1nj0i=njk(ki)(1)ki(i+jn)=j=0n1nj0(knj)(1)j+kn2F1(nkj,1+n1+nj;1)=j=0n1nj0(knj)jj+kn_(1+nj)j+kn #Gausss hypergeometric theorem again!=j=nkn1nj0(jnk)
other formulations:
in the other convention, it can be written as j=1knnk+j1((jnk)); by reflection, j=1knkj0((jnk))

and the bijective proof of this (in the manner of stars and bars) is that one can represent each member of k{n} (a list of sets) by sorting the sets in ascending order then concatenating them together, with |s placed at intervals which were formed by concatenations and ,s otherwise; count the nkj<n ascents, then a fall must be a bar but a rise can be either a comma or bar; there are nk commas, distributed amongst the j ascents in (jnk) ways.

We can also get this from the fact that subset numbers are the inverse #binomial transforms of monomials; however, unlike the Bell numbers section, we use the o.g.f. rather than e.g.f. relation. ogf(λk.k{n})(x)=ogf(λi.in)(x1+x)1+x. (Since ogf(λi.in)(x)=k=0n1nkxk+1(1x)n+1, ogf(λk.k{n})(x)=j=0n1njxj+1(1+x)n1j=j=0n1njxnj(1+x)j)

Eulerians in the hypercube!

taking a set of n uniform random variables in [0,1] and replacing each variable with its rank (the number of variables smaller than it) produces a uniformly-selected random permutation. This is useful, because it means the chance that a random permutation satisfies some property (ie. a set of inequalities between some of its elements) is equivalent to the measure (hypervolume) in [0,1]n that satisfies it (see also bitwise permutations for a little application)

the short single-page note [8] explains a mapping that bijects the regions of the hypercube with k rises as a permutation, into the region whose sum of coordinates is between k and k+1 (a polytope bounded by hypersimplices; [9] explains much more about them)

map (x1,,xn) into (y1,,yn)<math>by<math>yi=(xi1xi)%1 (where x0=0); each ascent wraps around and adds 1 to the sum, and each xi's contribution is cancelled out by being subtracted from yi and added to yi+1, except for the last (which allows the image to be in the interval between two hyperplanes, instead of just within one)

as such, laying an n-cube along its n-agonal, then chopping into n slices of length 1n and melting down the parts produces a row of the Eulerians!

note this was known to Laplace as of 1812 in [10]!

calculus of the wrong kinds

from binomial expansions, Δxn=k=0n1(nk)xk and xn=k=0n1(nk)(1)n1kxk. What about the expansion in Pochhamers of the derivative of a Pochhammer?

via the product rule, it is ddxxn=k=0n1(xk(x+n1)n1k_=xnx+k)=xn(Hx+n1Hx1)

however, it is also elucidating to do gfologically

ddxxn=i=0nixi1([xi]xn:=[ni])=i=0n[ni]ik=0i1xk([xk]xi1:=(1)i1k{i1k})=k=0n1(1)kxki=k+1n[ni]i(1)i1{i1k}=k=0n1(1)kxki=k+1n[xnn!yi]1(1x)yi[xi1(i1)!yk]ey(ex1)=k=0n1(1)kxki=k+1n[xnn!]log(11x)ii![xii!yk]xey(ex1)=k=0n1(1)kxk[xnn!yk]log(11x)eyx=k=0n1(1)kxk[xnn!yk]i=1j=0xii(xy)jj!=k=0n1xknkn!k!

Note that this can also be written as ddx((xn))=k=0n1((xk))nk

Faulhabernoulli's formula

We can write

t=1ytn=i=0nyi+1i+1([yi]yn:=(1)ni{ni})=i=0n(1)ni{ni}k=1i+1yk([yk]yi+1:=[i+1k])i+1=k=1n+1yki=k1n(1)ni{ni}[i+1k]i+1=k=1n+1yki=k1n[xnn!yi]ey(1ex)[xi+1i!yk]1(1x)y=k=1n+1yki=k1n[xnn!](1ex)ii![xii!yk]1x(1x)y1x=k=1n+1yk[xnn!yk]1(1ex)(1(1ex))y11ex=k=1n+1yk[xnn!yk]exy11ex=[xnn!]exy11ex

if we define the 'ordinaryised' Bernoulli numbers Bn+=[xn]x1ex (1n!th of the standard definition Bn+ as the e.g.f. expansion), then since exy1=j=1(xy)jj!, exy11ex=i=0j=1Bi+xi1+jyjj!, so [xnn!yk]exy11ex=Bn+1k+n!k!.

Plugging this in, we obtain the very elegant t=1ytnn!=k=1n+1Bn+1k+ykk!; that is, the 2D array of coefficients is just a 1D sequence in the first column with elements extruded into diagonals.

Gregory's dual thereto

going the other way around, we have the analogue,

t=0ytn=i=0nyi+1i+1([yi]yn:=[ni])=i=0n[ni]1i+1k=1i+1yk([yk]yi+1:=(1)i+1k{i+1k})=k=1n+1yki=k1n[ni](1)i+1ki+1{i+1k}=k=1n+1yki=k1n[xnn!yi]1(1x)y1i+1[xi+1(i+1)!yk]ey(1ex)=k=1n+1yki=k1n[xnn!]log(11x)ii![xii!yk]ey(1ex)1x=k=1n+1yk[xnn!yk]eyx1log(11x)

again, we must define a sequence for the expansion this unpleasant reciprocal of a pleasant e.g.f.; we define Gn=[xn]xlog(11x) (whereas usually Gn=[xn]xlog(1+x), so Gn=(1)nGn; the Gn alternate in sign (every even term after the 0th is negative), but the Gn are all negative after the 0th). Then, as before,

eyx1log(11x)=i=0j=1Gi+1xi+jyjj!

so we have t=0ytn=k=0n+1Gnk+1n!k!yk, ie. t=0y((tn))=k=0n+1Gnk+1((yk)), or t=0y(tn)=k=0n+1Gnk+1(yk)

We can represent this as a single coefficient extraction from a bivariate, just like the monomial summation. Knowing [xnn!yk]eyx1log(11x)=[xnn!yk]f(x,y), we have that

[xnn!yk]f(x,y)=i=0k[xnn!yi]eyx1log(11x)([yk]yi:=[ik])=i=1k[xnn!]1log(11x)xii![xii!yk]1(1x)y=[xnn!yk](1x)y1log(11x)

so, t=0ytn=[xnn!](1x)y1log(11x)

asymptotics and properties as statistical distributions

For the cycle numbers, the existence of ogf(λk.[nk])=xn helps us,

pperms(n)cycles(p)#(perms(n))=k=0nk[nk]ykk=0n[nk]yk=ddyynyn=ddylog((y+n1)!(y1)!)=Hy+n1Hy1

so at y=1, the centre of mass of the distribution is Hn! meanwhile, the variance, expectpperms(n)(cycles(p)Hn)2; in general,

k=0n((k2:=k2_+k1_)2kHn+Hn2)[yk]f(y)n!=(f(1)f(1)+f(1)f(1))2f(1)f(1)Hn+Hn2=f(1):=yn((Hy+n1Hy1)2(Hn+y1(2)Hy1(2)))|y=1n!+HnHn2=Hn2Hn(2)+HnHn2=HnHn(2)

meaning that (since Hn(2)π26) both the expectation and variance of the cycle count of a length-n permutation are asymptotic to log(n). [11] proves that the distribution is asymptotically normal, so (asymptotically, up to Puiseux refinement) completely characterised by these, since higher-order moments can be neglected.

Applying the saddle point method (from [1]), we can still get asymptotics,
  • g(x)=xyex,g(x)=(x+1)yex
  • xn=W0(ny)log(ny) (Lambert W function)
  • f(xn)=enW0(ny)y
  • g(xn)=n+yeW0(ny)=n(1+1W0(ny))n
  • xnn+12=e(n+12)log(W0(ny))=e(n+12)(log(ny)W0(ny))=(ny)n+12e(n+12)W0(ny)n!2πe(e1W0(ny)y)n+12

so an asymptotic on the nth Bell polynomial is

Belln(y)=[xnn!]ey(ex1)=n!f(xn)2πg(xn)xnn+12enW0(ny)y+12(yeW0(ny)1)n+12n(1+1W0(ny))enW0(ny)yn(nW0(ny))n+12nenW0(ny)ynnnW0(ny)n+12
whose logarithm is nW(ny)(n+12)(log(ny)W(ny))+n(log(n)1)y, and logarithmic derivative nyW(ny)+12y(W(ny)+1)1 (though reintroducing the removed 1+1W0(ny) denominator part gives nyW(ny)+W(ny)2y(W(ny)+1)21),

giving us the final result that the centre of mass of the nth row is expectPsets(sets(n))(|P|)nW(n)+12(W(n)+1)1 (although the refinement, nW(n)+W(n)2(W(n)+1)21, changes the ratio of the estimates for the 192nd term (877 bits long) from 1.000423 times overapproximation to 1.00000734 under-; <1/57th the error!)

Since we know (from the section on them) that this logarithmic derivative equals Belln+1(y)Belln(y)y, this can be slightly more nicely expressed as Belln+1BellnnW(n)+W(n)2(W(n)+1)2.

Likewise, knowing the variance equals (f(1)f(1):=f(1)f(1)f(1)f(1))+f(1)f(1)(f(1)f(1))2 (where f=Belln). Resolving f(1)f(1)=nW+3W22(W+1)2+2(2nW3W22W2n(W+1)2W(2W2+3W+2)1), we get varPsets(sets(n))(|P|)=(Belln+1Belln)(Belln(Belln+1+Belln+2)Belln+12)Belln2Belln+1nW(n)(W(n)+1)1+12(W(n)+1)232(W(n)+1)3+1(W(n)+1)4

[12] gives an analogous result to the first kind; asymptotically, these first two moments describe the normal distributions they converge to.

row log-concavity inequalities

the cycle row o.g.f.s (rising powers) have roots lie in an arithmetic progression over the nonpositive reals, and [12]'s lemma 1 shows that the subset row o.g.f.s' (Touchard polynomials') roots are also real; theorem 2 of [13] is that this property makes them at least as log-concave as the binomial coefficients, ie. [nk]2[nk1][nk+1](n1k1)2(n1k2)(n1k)=(1+1k1)(1+1nk) (and it seems the excess ratio is monotonically decreasing; [nn1]2[nn2][nn](nn1)2(nn2)(nn)=2n(n1)(n2)(n13)2n1n2=1+13n1, which is a lower bound when varying k)

[14] compiles a lot of nice results; theorem (3.2) is that since {nk+1}{nk}2(nk)k(k+1), for subsets we have the slightly stronger {nk}2{nk1}{nk+1}>(k+1)(nk+1)(k1)(nk)=(1+2k1)(1+1nk), with excess error ratio {nk}2{nk1}{nk+1}2(1+2n2)=1+23n5 on the right end (to which it also seemingly decreases monotonically)

For the Lah numbers, this section of my page on A058349 provides some more general results about asymptotics for their o.g.f.s (and modifications covering all confluent hypergeometrics with respect to a single numerator Pochhammer).

asymptotics along rays

a (non-Hayman) saddle point method can be used slightly more directly; write k=cn, then

(cn){n}=[xnn!](ex1)k=n!τiγ(ex1)cnxn+1 dxn!τiγ(ex1)cnxn+1 dx=n!τiγ(ex1)cnxn+1 dx #choose r so er=11rc; r=W0(1cec)+1c=n!τirn+10τ(ereiθ1)cneiθ(n+1)(dxdθ=rieiθ) d(θ=log(sgn(x))i)=n!τrn0τ(ereiθ1)cneiθn dθ=n!τrn0τexp(ϕ(x:=eiθ):=n(clog(erx1)log(x)+log(r))) dθ #considering r to be independent from x=n!τrnεεexp(ϕ(x:=eiθ):=n(clog(erx1)log(x))) dθ #match function at real line

so ϕ(x)=1x2cex(ex1)2, ϕ(x)ϕ(r)ϕ(r)θ22 and

(cn){n}n!τrnexp(ϕ(x)n(clog(er1r)(1r2cer(er1)2)θ22)) dθ=n!(er1)cnτrn+1τn(1r2cer(er1)2)=n!(er1)cnrn+11τn(1r2cer(11rc1)2)(necr1rc1cc)ncc(r+1)1

this allows you to asymptoterate {ncn} by dividing by (cn)!, then (as realised by [15]) setting c>1 and using [nk]={kn} lets you asymptoterate the cycle numbers!

transforms

the factorial integral

The classical integral for the factorial, x!=t=0txet dt, implies the useful properties of the two transforms we will be using. As the area under a graph, it can be stretched n times, nx!=t=0(tn)xetn dt. By reassigning n:=1n, x!nx+1=t=0txent dt.

  • the Mellin transform is {f}(x)=t=0f(t)tx1 dt, so this is rewritable as {λt.ent}(x)(x1)!=1nx. By summing, {λt.n=1ent=1et1}(x)(x1)!=n=11nx=ζ(x)
  • x!nx+1=t=0txet/n dt; by swapping x and n around, n!xn=t=0tnet/x dtx. The Laplace transform is {f}(x)=t=0f(t)ext dt so we can write {λt.tnn!}(1x)x=xn

e.g.f.s to o.g.f.s (Laplace)

as per above, we can define {f}(x)={f}(1x)x so that {f}(x)=ogf(egf1(f))(x). (This is most of why the Laplace transform is so useful for differential equations; {λt.(ddt)nf(t)}(x)=truncn{f}(x), where the trunc operator does truncating division, truncf(x)=f(x)f(0)x; through this, ordinary differential equations with constant coefficients (that define e.g.f.s of linear-recurrent sequences) become arithmetic equations (that define rational o.g.f.s); it can be applied to solve differential equations that define the g.f.s of a variety of D-finite recurrences also)

o.g.f.s to r.g.f.s (Mellin)

t=0f(t)txet dt=n=0[tn]f(t)t=0tx+net dt=n=0(x+n)![tn]f(t). As such, t=0f(t)tx1et dt(x1)!=n=0xn[tn]f(t)=rgf(ogf1(f))(x). (This is a variant of the Mellin transform, which we will denote as {f}(x)={λt.f(t)et}(x)(x1)!)

via this, the previous transform for the zeta function can be rewritten as {λt.11et}=ζ

As another example, antiBelln(x)=t=0tntx1et dt(x1)!=t=0r=0rn+t1tx1er+t dr dt(t1)!(x1)! and in particular antiBelln=t=0tnet dt

convergence

for a function that is O(xα) as x0 and O(xβ) as x, the Mellin transform {f}(x)=t=0f(t)tx1 dt converges for α<x<β

o.g.f.s to f.g.f.s

Since xn_=(1)n(x)n, fgf(f)(x)=rgf(λn.(1)nf(n))(x)

to recap on sequence transforms,

we can make a whole table of all the different variations of the binomial transform and their g.f.wise interpretations!
t's name binomial multichoose
binomial
multichoose
t{a}(n)ogf(t{a})(x)egf(t{a})(x) k=0n(nk)a(k)f(x1x)1xexf(x) k=0((nk))a(k)x1xf(11x) k=n(kn)a(k)f(x+1) k=0((kn))a(k)f(11x)
t1{b}(n)ogf(t1{b})(x)egf(t1{b})(x) k=0n(1)nk(nk)b(k)f(x1+x)1+xexf(x)[t 1] k=0n(nk)[xk]f(1+x)f(11x)x1[t 2] k=n(1)kn(kn)a(k)f(x1) f(11x)
and since we now know how to turn monomials into Pochhammers (and how to expand Pochhammers in monomials), likewise for the Stirling numbers!
t's name subset
(conventionally
"Stirling")
cycle
subset
cycle
t{a}(n)ogf(t{a})(x)egf(t{a})(x) k=0n{nk}a(k)f(ex1) k=0n[nk]a(k)f(log(11x)) k=n{kn}a(k)gf{fgf1{f}}(x)=1{λx.f(x)}(x) k=n[kn]a(k)rgf{gf1{f}}(x)={f}(x)
t1{b}(n)ogf(t1{b})(x)egf(t1{b})(x) k=0n(1)nk[nk]b(k)f(log(1+x)) k=0n(1)nk{nk}b(k)f(1ex) k=n(1)kn[kn]a(k)fgf{gf1{f}}(x)={λx.f(x)}(x) k=n(1)kn{kn}a(k)gf{rgf1{f}}(x)=1{f}(x)
we can use this to obtain combinatorial identities, by finding closed forms for the expansion of a function as both an o.g.f. and (either kind of) Pochhammer g.f.! for example,
  • we know that {nk}=[xk_]xn, so {n+1k+1}=[xk+1_]xn+1=[(x+1)k+1_](1+x)n+1=[xk_](1+x)n! (the last step being to divide both the []'s reference frame's inside and outside by 1+x) As such, we have the two simultaneous interpretations (1+x)n=ogf{λk.(nk)}=fgf{λk.{n+1k+1}}, which translates to a pair of dual identities
{n+1k+1}=i=kn{ik}(ni)
(nk)=i=kn(1)ik[ik]{n+1i+1}
  • likewise, [nk]=[xk]xn, so [n+1k+1]=[xk+1]xn+1=[xk](x+1)n; meanwhile, since Lah(nk)=[xk_]xn, we can apply Δ to both sides of a coeff-extraction [xk_](x+1)n=[Δxk_]Δ(x+1)n=[kxk1_]nxn1=nkLah(nk)=n!k!(nk), which translates to
n!k!(nk)=i=kn{ik}[n+1i+1]
[n+1k+1]=i=kn(1)ik[ik]n!i!(ni)
  • meanwhile, (1)nk(nk)=[xk](x1)n but [xk_](x1)n=[xk1_]xn(1)nx+1=j=kn(1)nj{j1k1} (which equals (1)nk[xk](1+x)n by the connection identity), so
j=kn(1)nj{j1k1}=i=kn{ik}(1)ni(ni) (=A105794(n,k))
(nk)=kijn(1)ji[ik]{j1i1}


notes
  1. we can also write the coefficient as (1)nk(nk)=(kn)
  2. where [xn]f(1+x)=k=nkn_a(k); so we can rewrite it i=0b(i)k=0n(nk)ik_, where the inner sum can be rewritten [xii!]ex(1+x)n=2F0(n,i ;1)=in_1F1(nin+1;1)in.
    the reason for the leading minus sign, informally, is that the sequence b(n)=cn requires c<1 (though in general does not prohibit limna(n)a(n1)=1) and multichoose-transforms forwards into b(n)=11cn, and exponentially-growing infinite sums resolve to negative values by analytic continuation, so the sign flips it back around

o.g.f.s to mixed kind (Lapllin transform)

putting these together, we can define an operator that performs both transforms in different variables; let a(n)=[xn]f(x), then

{f}(x,y)=0ty1et/xf(t) dt(y1)!xy=n=0a(n)xnyn

Note that (1)nmgf(φ)(n)=k=0n(1)nkφ(k)k!nk_=Δn(φ)(0), which can be rewritten (1)nmgf(egf1(f))(n)=[(yn)]f(y)(0)=[xnn!]exf(x); note that mgf(egf1(f)) equals rgf(ogf1(f)), the thing we have a Mellin transform for! Writing the integral, (1)n{f}(n)=(1)nt=0f(t)tnet dt(n)!=[xnn!]exf(x). Defining g(x)=exf(x), {λt.g(t)}(n)(n)!=[xnn!]g(x) (and via Euler's reflection formula, sin(πn)π{λt.g(t)}(n)=[xn]g(x)).

This is Ramanujan's master theorem.

note that the Euler's-reflection-formula'd version is a deformation of Cauchy's formula [xn]f(x)=γf(t)tn+1 dt2πi into an infinitesimal-width contour to infinity around the negative reals!

[xnn!]f(x)=0f(t)tn+1 dt(n)![xn]f(x)=0f(t)tn+1 dtπcsc(πn)=limc0(0f(t)(t+ci)n+1 dt)+(0f(t)(tci)n+1 dt)2πi

(we use the negative reals rather than the positive, to use the branch cut we've induced along them)

note that having proven the first case combinatorially and second case analytically, their simultaneous truth is logically equivalent to the statement of Euler's reflection identity, giving an accidental proof of that as well!

Some special cases of its use:

  • the factorial integral comes by setting g(x)=ex; [xnn!]ex is continued to equal 1 for all complex n
  • if we denote (n!(n)!:=πsin(πn))[xn]f(x)=0f(t)tn+1 dt={λt.f(t)}(n) by {xn}f(x), then it behaves quite similarly to [xn]; we have
  • {xn}f(λx)=λn{xn}f(x)
  • {xn}xvf(x)={xnv}f(x)
  • {xn}f((x)p)={xn/p}f(x) (the somewhat awkward arrangement is because it needs to converge along negative reals)
  • {xn}log(1x)f(x)=ddn{xn}f(x)
the first two work in the same manner as ordinary coefficient extraction, but the last two work based on {xn}f(x) being a complex function of n
  • for instance, we can start with (n)!={xn}ex, then use the second law for (np)!={xn}e(x)p; by dividing both sides by (n)! and using the recurrence for the factorial, we get (np)!(n)!=[xnn!]e(x)p; we know [xnn!]e(x)p=[p divides n](1)n(1+1p)n!(np)!; as such, we get the identity over the integers (1)n(1+1p)(np)!(n)!=[p divides n]n!(np)! which is maybe a little bit useful sometimes
  • Faulhaber's formula t=1ytn=[xnn!]exy11ex becomes t=01etyet1tn1 dt(n1)!=t=1ytn=Hy(n). The 1 in the numerator contributes ζ(n)=H(n), so t=0etyet1tn1 dt(n1)!=ζ(n)Hy(n)=ζ(n,y+1), the Hurwitz incomplete zeta function
  • Bn+=[xn]x1ex becomes t=0tnet1 dtn!(n)!=Bn+. By substituting n=1n in t=0tn1et1 dt(n1)!=ζ(n), we get that Bn+=ζ(1n)(n1)!=2cos(πn2)(2π)nζ(n) (by the zeta function's reflection formula)
  • for the Stirling numbers, we write n!k! as nnk_, then
  • since {nk}=nnk_[xn](ex1)k and {nk}=[kn], [nk]=(k)nk_t=0(et1)ntk1 dt(k1)!=(k)nk_[xk](ex1)n=(k)nk_[xnk](xex1)n=knk[xnk](x1ex)n
  • since [nk]=nnk_[xn]log(11x)k and [nk]={kn}, {nk}=(k)nk_t=0log(11+x)ntk1 dt(k1)!=(k)nk_[xk]log(11x)n=(k)nk_[xnk](xlog(11x))n=knk[xnk](xlog(1+x))n
since exponents of g.f.s correspond with nested convolutions of sequences, these two translate to sums of products over partitions
[nk]=knkPn(P)=nkbPBb+{nk}=knkPn(P)=nkbPGb


knowing that t=01etyet1tn1 dt=(n1)!Hy(n), we can take the e.g.f. for this as a sequence in n,

n=1Hy(n)xnn=n=1t=01etyet1tn1 dtxnn!=t=01etyet1n=1tn1xnn! dt=t=01etyet1etx1t dt

the derivative in x is n=0Hy(n+1)xn=t=01etyet1etx dt; we can rewrite this as n=0Hy(n+1)xn=t=0etxet(xy)et1 dt. This is a difference between two of the n=1 case of the previous Hurwitz zeta integral, t=0etyet1 dt=ζ(1)Hy(1)=ζ(1,y+1), so n=0Hy(n+1)xn=ζ(1,1+yx)ζ(1,1x)=HyxHx, which Mathematica says is valid for (y)>(x)1<0.

So, n=1Hy(n)xnn=log((x)y_:=(x)!(yx)!), ie. Hy(n)n=[xn]log((x)y_). By Ramanujan's master theorem again, we get t=1ytnn=[xn(n1)!]exy11ex=(n1)!k=1n+1Bn+1k+ykk!=[xn]log((yx)!(x)!), so log((yx)!(x)!)=k=0Bk+n=k(n1)!xnyn+1k(n+1k)!=k=0Bk+(y+x((1y/x)k1))(k2)!(xy)k

harmonic asymptotic for cycle columns

Knowing that ogf(λc.Hy(c)c)=log((x)y_), we can write 1xy_=(x+1)y=exp(ogf(λc.(1)c1Hy(c)c)). It makes sense to think of this instead as exp(egf(λc.(1)c1(c1)!Hy(c))), since (via Faà di Bruno) the exp counts sets of its input's object type, and the interior counts cycles 'coloured' with variables that happen to be harmonic numbers. As such, an e.g.f. coefficient extraction can be written as a sum over permutations.

Then since [xk](x+1)n=[xk+1]xn+1=[n+1k+1], we have [n+1k+1]=n!k!psets(cycles)(k)cp(1)|c|1Hn(|c|).

this is useful! if one fixes k then as n, we can rewrite it as [n+1k+1]n!k!psets(cycles)(k)Hn|{cp:|c|=1}|cp|c|>1(1)|c|1ζ(|c|); so we can write

[n+1k+1]n!k!j=0k(kj)Hnkjdderangements(j)(1)j|d|cdζ(c)=n!k!(Hnk(k2)ζ(2)Hnk2+(k3)2ζ(3)Hnk3+(k4)(3ζ(2)26ζ(4))Hnk4)

(a refinement to the DLMF's 26.8.40)

more musings:

Defining the coefficient D(j)=dderangements(j)(1)j|d|cdHn(c), noting that constructing derangements by adding cycles to preceding cases gives D(j)=i=2jji_jHn(i)(1)i1D(ji). Writing d(j)=D(j)j! this is jd(j)=i=2j(1)i1Hn(i)d(ji). Since i=2Hn(i)(x)i1=Hn+xHnHx, writing F(x)=j=0d(j)xj, we have F(x)F(x)=Hn+xHx so D(j)=[xjj!]et=0xHn+tHnHt (where the approximation reduces this to [xjj!]et=0xHt), so

[n+1k+1]=n!k!j=0k(kj)Hnkj[xjj!]et=0xHn+tHnHt=n![xk]eHnx+t=0xHn+tHnHt=n![xk](n+xx)=[xk]((x+1)nnex(Hnγ)x!) #by substituting Hn(i) for ζ(i)

note that there is a dual!

[n+1k+1]=n![xk]((x+1)n=exp(egf(λc.(1)c1(c1)!Hn(c))))=n!k!psets(cycles)(k)(1)k|p|cpHn(|c|)Δn(λi.{0i=01ikelse)(0)=n![xk](1(x1)n_=xn=(1)n1exp(egf(λc.(c1)!Hn(c))))=(1)n1k!psets(cycles)(k)cpHn(|c|)=n{k}

these numbers n{k} (which arise when interpreting the extension of the cycle row g.f.s backwards as a Taylor rather than Laurent series, and also in expansions of recursive integrals of powers of logarithms) are covered thoroughly in section Subpowers with negative integer exponents beginning p.12 of [2]

Stirling polynomials

The fact [nk]=knk[xnk](x1ex)n can be rewritten [yyn](y1n)=[xnn!](x1ex)y=k=0nykS1kΣ(S)=npS[xpp!]log(x1ex). (Since valx(log(x1ex))=1, each of the k summands must be at least 1, so the inner sum has a nonzero number of summands iff k[1..n], or k=0 (the empty partition) for n=0.) As such, [yyn](y1)n_ is a polynomial in y of degree n, and therefore so too is the more pleasant index-shift [n+yy]((yn))!

and by the same argument, {nk}=knk[xnk](xlog(1+x))n becomes {yyn}(y1n)=[xnn!](xlog(1+x))y=k=0nykS1kΣ(S)=npS[xpp!]log(xlog(1+x))

there is a name for these, the Nørlund/generalised Bernoulli numbers! see this answer for how the cycle triangle's columns further leftwards (with their zeroes divided out) give coefficients for higher-order versions of #Faulhabernoulli's formula.

for n>0, we have

Gn=[xn]xlog11x=1(n1)!ddx[n+xx]|x=1 (due to Gn=Bn(n1)n1=(1)n(n1)!ddx[n+xx]|x=1)
A002208(n)A002209(n)=[xn]x(1x)log11x=1(n1)!ddx[n+xx]|x=0[n 2]
Bn=[xn]xex1=Bn(1)=1(n1)!ddx{n+xx}|x=1
Bn+=[xn]x1ex=1(n1)!ddx{n+xx}|x=0

recurrence relations

see also my big math.se post about this

linear

only for subset numbers

o.g.f.s

unravelling the cycle row o.g.f. and subset column o.g.f. as products gives

[nk]=S{0..n1}|S|=nkiSi
{nk}=Smult{0..k}|S|=nkiSi=Pk(P)=nki=1kiPi

(where mult denotes making a multiset by drawing from a set with multiplicity, and the latter sum is just a run-length encoding)

only the latter can be partially unravelled into preceding cases, to give the recurrence

{nk}=b=0nkkb{n1bk1}

e.g.f.s

likewise,

Lah(nk)=n!k!P1k(P)=n1=n!k!(n1k1)[nk]=n!k!P1k(P)=n1bPb{nk}=n!k!P1k(P)=n1bPb!

which becomes

Lah(nk)=b=1nknb_kLah(nbk1)[nk]=b=1nknb_bk[nbk1]{nk}=b=1nk(nb)k{nbk1}

these become more evident when written via the Stirling powers; the coefficient tells you how many ways you can take out b elements and arrange them into the first list/cycle/set in the ordered list of lists/cycles/sets

k(n)=b=1nknb_(k1)(nb)k[n]=b=1nknb_b(k1)[nb]k{n}=b=1nk(nb)(k1){nb}

duality

employing the Stirling duality on e.g.f.s (from #Ramanujan's master theorem) gives

[n+1k+1]=nkb=0nkBb+kb[nk+b]{n+1k+1}=nkb=0nkGbkb{nk+b}

or

(k+1)[n+1]k+1=nb=0nkBb+(k+b)[n]k+b(k+1){n+1}k+1=nb=0nkGb(k+b){n}k+b

another formulation can also be derived with indexes shifted; this lets you use it down to k=1 without division-by-0 issue

k[n]=nb=0nkBb+(k+b1)[n1]k{n}=nb=0nkGb(k+b1){n1}

as shown there, we have

{n+1k+1}=i=kn(ni){ik}
[n+1k+1]=i=knn!i!(ni)(1)ik[ik]

moments of statistical distributions

[16] gives applications to a few kinds of random variables' factorial and ordinary moments

table of their results (and more!) in nicer formatting:

as a recap,

  • binomial is "how many heads in n tosses"
  • Poisson is "if [event] occurs randomly (chance unaffected by past occurrences, like radioactive decay) with λ instances expected in the measuring duration, how many are actually observed"
  • geometric is "how many tails before one heads"
  • negative binomial is "how many tails before n heads"
  • coupon is the coupon collector's problem; the random variable X is how many times one must roll an n-sided fair die for all faces to land up at least once (I copied the analysis done for this table (slightly more verbosely) into there, since it superseded the old method)

given the o.g.f. for P(X=x),

  • the e.g.f. for E(Xk_) (ie. the o.g.f. for E(Xk)) is obtained by replacing each x with x+1
  • the e.g.f. for E(Xk) is obtained from that of E(Xk_) by replacing each x with ex1
name uniform binomial Poisson geometric negative binomial hypergeometric cycle count coupon
P(X=x) [0x<n]n (nx)pxqnx=[tx](q+pt)n eλλxx! pqx pn((nx))qx=[tx](p1qt)n (Kx)(NKnx)(Nn)=[tx](NK)n_Nn_2G2(n,K0,NKn;t) [nx]n!=[tx]((tn)) (n1){x1}nx1=[tx](n!(nt)n_=1(ntn))
P(Xx) x+1n (nx)pxqnx1G1(xnx;qp) Γ(x+1,λ)x! 1qx+1 pn1((nx))(1qx+1) (Kx)(NKnx)(Nn)2G2(KN+n+x,xnx,Kx;1)[n 3] A349782(n,x) n{x}nx
E(Xk_) (n1)k_k+1=[xkk!](1+x)n1nx[n 4] nk_pkqnk1F0(kn ;pq)=nk_pk=[xkk!](1+px)n λk k!(qp)k=[xkk!]11qpx nk(qp)k=[xkk!]1(1qpx)n nk_Kk_Nk_ [xk]((ex1n)) [xkk!]1(nx+1n)=nk!i=0n(in)i(i1)k1(n+1i)k+1=k!i=0k(ik)[yi]1(nnyn)=i=0kLah(ki)(1)kiniPperms(i)cPHn(|c|)[n 5]
E(Xk)[n 6] x=0n1xkn=i=0k{ki}(n1)i_i+1=[xkk!]enx1n(ex1) x=0n(nx)pxqnxxk=i=0k{ki}ni_pi=[xkk!](1+p(ex1))nk(np)k(p+qek/n)nk1+kqnpek/n[n 7] Bellk(λ) px=0qxxk=i=0ki{k}(qp)i=j=0k1kj0qkjpk=[xkk!]11qp(ex1) pnx=0((nx))qxxk=i=0k{ki}ni(qp)i=[xkk!]1(1qp(ex1))n nk_Kk_Nk_k!2G2(Nk,knk,Kk;ξ),where ξi=(ki){k} 1n!i=0n[ni]ik=[xkk!]((exn))=[xkk!yn]1(1y)ex=[yk]11yi=0k{ki}log(11y)i=i=0k{ki}j=0ni!j![ji]=i=0k{ki}i!n![n+1i+1]=i=0k{ki}pperms(i)cp(1)|c|1Hn(|c|) [xkk!]n!(nex)n_=i=0k{ki}(1)kiniPperms(i)cPHn(|c|)=i=0k(1)n1+kii{k}n{i}ni[n 8]

the Poisson distribution's moments being the Bell polynomials is sometimes also referred to as the Dobiński's formula, and the fact that we can do saddlepointery for binomial but not negative binomial is a consequence of the e.g.f. conserving the Hayman-admissible pole at x=, whereas the negative one produces a finite-order simple pole as the nearest to the origin

note this tells you that ogf(λi.i{k})=𝔼[geomp=11+xk] and ogf(λj.kj0)=(x1)k𝔼[geomp=11xk], which means both are log-convex sequences with respect to k for all x by Hölder's inequality (𝔼[Xi]=𝔼[Xi1Xi+1]𝔼[Xi1]𝔼[Xi+1])

historical/nomenclaturewise notes

  • the abstract of [2] notes that Euler had written the number triangle of k{n} in 1755, which might imply (since they're only a factorial factor away) the Stirling subset numbers would more aptly be called Euler numbers; Unfortunately,
  • in many sources, the univariate Bell polynomials are known as the Touchard polynomials; however, per Ramanujan's notebooks, part 1, p. 11, Ramanujan had developed a lot of the same theory as Bell and Touchard over two decades prior, so either name is a slight misnomer (but the cards have fallen where they lay)

attributions

thank you very much to my dear friend MagmaMcFry (from the ConwayLife lounge) for finding the bijective proofs of some of the identities found here

see also

notes

  1. and since ddy11y=ddyy1y we can set g0(y)=11y in order for this to hold for all n by the convention 00=1; since every g0=c+(1c)y1y works, the Eulerian numbers take no opinion on 00's true value.
  2. and A075266(n)A075267(n)=[xn]loglog11xx=1n!ddx[n+xx]|x=0, since A075266(n)A075267(n)=A002208(n)nA002209(n)
  3. this is unfortunately not the kind of 2G2 that is solved by the generalisation of of Saalschutz's theorem given in my hypergeometrics page; one requires that m=N. However, note that if the G is turned into an I to become bilateral, then applying Dougall's sum evaluates to 1; that is, the integer-parameters bilaterally-terminating case of Dougall's theorem is equivalent to the fact that summing the p.d.f. over all x gives 1
  4. Zeilberger's algorithm helpfully tells us the o.g.f. fn(x) satisfies the cute little D-finite recurrence (n+1)fn+1n((1+nx)fn+(n1)xfn1)=1
  5. from the generating function, we have 1(nx+1n)=i=0n(in)1i(1nx+1+n)=i=0n(in)(1+n1+ni+ink=1(i1)k1(n+1i)k+1xk), so for k>0 we get the second line. Line 3 comes from writing it as the inverse binomial transform of its binomial transform, and the inner summation on line 4 is the expansion of [yii!]1(nyn) via the same method as #harmonic asymptotic for cycle columns took
  6. the e.g.f. for each entry here comes directly from applying the Stirling transform to the above entry's e.g.f. (ie. replacing x with ex1), providing (for the uniform dist.) a much shorter proof of the e.g.f. form of Faulhaber's theorem!
  7. g(x)=nx1+(qp)ex so xk=(j:=kn)+(W:=W(jqejp)), g(x)=exnpq+exp+exxnpq(q+exp)2, and E(Xk)kk!qn(j+W)nkWn2πk(1+W)(np)k(p+qek/n)nk1+kqnpek/n (the first by ordinary saddlepointery, the second by Stirling's approx. and truncating W(x)x for small values)
    the ratio to (np)k (asymptoted by the (p+qek/n)nk1+kqnpek/n part) is at most 1, and this paper (p. 5) says it cannot exceed 1pk; it converges to pnpk
    we have recurrence E[Xnk+1]=n(E[Xnk]qE[Xn1k])
  8. this comes from the falling moments via the identity {Ki}(1)K=k=0K{Kk}Lah(ki)(1)k

references

  1. 1.0 1.1 A. M. Odlyzko, 1995. Asymptotic enumeration methods (page 118, equation (12.10))
  2. 2.0 2.1 2.2 Mircea Dan Rus, 2025, Yet another note on notation
  3. J. G. Wendel, 1948. Note on the Gamma Function, Am. Math. Mo., Vol. 55, No. 9, pp. 563-564
  4. 4.0 4.1 Donald Knuth, 1992. Two notes on notation
  5. Nicholas A. Loehr, Jeffrey B. Remmel, May 2009. Rook-by-rook rook theory: Bijective proofs of rook and hit equivalences
  6. Lemma 2.6 of E. G. Santos, November 24, 2024. Counting non-attacking chess pieces placements: Bishops and Anassas
    for prerequisites to understand in somewhat more detail, read [5]
  7. bijective proof of Worpitzky's identity; StackExchange answer by EuYu, 2015
  8. Richard P. Stanley, Eulerian Partitions of a Unit Hypercube (1986)
  9. Fred J. Rispoli, 2008. The Graph of the Hypersimplex
  10. Pierre Simon de Laplace, 1812. Théorie analytique des probabilités, 1812
  11. L. A. Shepp, S. P. Lloyd, 1965. Ordered Cycle Lengths in a Random Permutation
  12. 12.0 12.1 L. H. Harper, 1967. Stirling Behaviour is Asymptotically Normal
  13. Richard P. Stanley, Log-Concave and Unimodal Sequences in Algebra, Combinatorics and Geometry (1989)
  14. Masaaki Sibuya, Log-concavity of Stirling numbers and unimodality of Stirling distributions (1987)
    (note i am citing it for things that it brought forth from earlier papers, otherwise there would be more here when it serves as a nice convenient grouping)
  15. O. Arfwedson, 1951. A probability distribution connected with Stirling's second class numbers, Scandinavian Actuarial Journal, vol. 1951, iss. 1-2, pp. 121-132
  16. S. Bagui, K.L. Mehra, 2024. The Stirling Numbers of the Second Kind and Their Applications (Alabama Journal of Mathematics, Vol. 47 (1), pp. 1-22) (read beginning at page 14)