This site is supported by donations to The OEIS Foundation.
User:Natalia L. Skirrow/an introduction to Stirling numbers
foreword
This page is called introductory, and the intention is to require only familiarity with ideas of generating functions and linear algebra, but the subject has some extremely interesting parts that require more advanced methods. An elementary formulation is given for #Ramanujan's master theorem, but in #asymptotics and properties as statistical distributions, Hayman's saddle point method from [1] is used without duplicating its (quite involved) explanation. #recurrence relations lies at the very end, because an important part of it requires the Bernoulli/Gregory theory established first.
tools
Let be the number of ways to form a -multiset from an -set by drawing with repetition.
We use generating functions; these encode sequences as functions to represent operations on them; in particular, the ordinary g.f. is , and the exponential g.f. .
Multiplying g.f.s convolves their sequences in different ways; and
and are treated as complex functions, and
The notations used here ( and ) are due to Jovan Karamata from 1935
The l'hospital theorem is used implicitly; specifically, when dividing two functions at a point where both have simple poles (which is done a lot in #Ramanujan's master theorem), we consider the quotient of their residues
calculus of finite differences
We define and , and the falling and rising factorials and .
In particular, these operators and are o.g.f. convolutions with and , respectively, so by binomial-expanding their exponents, their higher-order iterates are and (with negative iterates ).
Similarly to how a holomorphic function is converged to by the evaluation of its power series , sequences are 'converged' to (eventually matched at all indexes by) the evaluation of their inverse binomial transforms; and ; as such, one can write and
We as such define a quartet of analogues,
The 'factorial powers' (Pochhammer symbols) are joined by , and , and work similarly to monomials in that (see Concrete Mathematics, page 53, eq. (2.53))
(in fact, the rules for the calculus of infinitesimal differences comes from 'zooming out' in both the input and output axes on this calculus of finite differences)
finally, the binomial theorem has falling and rising analogues; (using Gauss's hypergeometric theorem; the falling-rising joining identity directly brings it to the equivalent rising factorial identity)
power-analogue notations
It is useful to bear in mind the combinatorial definitions of these powers (and some more that we will variously see along the way);
- counts functions from to
- counts injective functions
- counts functions with preimages totally ordered into lists
- counts surjective functions (a proposal from [2])
- counts surjections with preimages cyclically ordered
- counts surjections with preimages totally ordered (note this deviates from some sources (like the Wikipedia page) which use this notation to mean )
while the product definition lets us evaluate Pochhammer symbols like on noninteger , the gamma function lets us compute them on noninteger ! [3] provides the inequality for , with equality at boundaries; taking series expansions shows that as , the logarithms of the bounds' error ratios converge; , so the lower inequality is about as good as the upper one; the inequalities both flip into s for and the now-upper bound continues to work as an approximately-as-good counterpart to the obvious one. (see also Gautschi's inequality)
the sequence (the inverse binomial transform, analogous to the power series) can be considered as a linear transformation on vector spaces , we can take its inverse,
can be represented as an e.g.f. convolution, , and as an o.g.f. (so )
since (when computing ) we can choose outputs to have preimages, we have ; that is, is the binomial transform of . By the same reasoning elsewhere, we have
In this case, is the statement "if you have two sequences , counting the numbers of type- and type- structures on -element labelled sets, with e.g.f.s , then (denoted ) is the number of ways to put elements into type- objects, then put these into a type- object."
In particular, we can associate the exponent of with another variable , to get that is the number of ways to put elements into -objects in one -object.
the most important sequences in this case are , , (with e.g.f.s , and ).
Note that cycles must be nonempty, but for the other two we subtract from the e.g.f.s to count their nonempty forms; we denote this with an overline.
examples
- , meaning there is a bijection between lists and sets of cycles (ie. permutations' cycle decompositions).
if both - and -type structures are totally ordered (ie. in both, each instance of an -node structure is part of an equivalence class of that are node-permutations of each other, and the th terms of both are divisible by ), then the nodes can be considered unlabelled to divide by (considering their e.g.f.s as o.g.f.s). In particular, this means that o.g.f. composition for sequences on unlabelled sets imbues total orderings on all but the leaf nodes of the partition-tree.
- , and (for ) there are ways to put unlabelled objects into a tree of lists layers deep (since ). Equivalently, this is the number of ways to arrange brackets around points such that each of them is layers deep, and all bracket-pairs contain at least one point.
- allowing all lists to contain singletons instead of only the depth- ones can be recursively constructed 'from the leaves up'; we define and thereafter ; ie. must be a singleton or contain at least two members of the previous iteration.
- this is difficult to analyse directly, but the fixed point of this iteration, , is not! Since the recurrence is in terms of itself rather than the preceding iterate, by the quadratic formula we get , the functional inverse of , and so f∞(k)=A001003(k-1).
- since its inverse's derivative, , becomes zero at , at which the inverse's maximum is , this is 's radius of convergence, so
Stirling-type triangles
As mentioned, a permutation is a set of cycles, so we define .
Since (the g.f. for a general degree- polynomial is some degree- numerator times this), we have , so can write . (The reason for the unfortunate historical use of is that this gives .)
We can form a correspondence; adding an th element to a length- permutation can be done by
- choosing one of the existing elements, mapping the th to it, and mapping its previous preimage to the th
- mapping the th to itself
in particular, each permutation has successors, but each of the resultant -element permutations has only one preimage. Specifically, in there are elements for each in and one for each in , giving .
- for the representation in elementary symmetric polynomials and a bijective proof of its rook-theoretic interpretation, see Stirling factoradics
These count partitions into sets of nonempty sets;
By binomial-expanding the e.g.f. for the th column, ; the coefficient of is .
By the same kind of argument as the cycle numbers, the th element can be placed in its own set (in one way) or one of the existing sets (in ways), giving .
Writing as , this becomes . Since , the column o.g.f. is , which the e.g.f. binomial expansion thus shows partial-fraction-decomposes to .
The product form is rewritable in factorial powers as , which (by the last joining identity) equals . This means that according to the o.g.f.s of their columns and rows respectively (interpreting negative-exponent rising factorials as Puiseux series from instead of ), we have .
this can be equivalently written exceedingly elegantly (using the Pochhammer reflection identities on binomial coefficients) as
note that by expanding out the denominator of the column g.f.s, one gets the linear recurrence . The cycle numbers, meanwhile, cannot be similarly decomposed into a simple summation; using the polynomiality of the diagonals of the conjoined triangle (see [4], page 11) and the polynomials' linear recurrences (see my MathOverflow answer), the cycle numbers can be computed in a singly-nested sum involving subset numbers, translating to a doubly-nested sum over elementary functions; no singly-nested such sum is known for them.
- see also rook configurations for how enumerates nonattacking -rook configurations beneath the diagonal in an chessboard
These count sets of nonempty lists, and have a closed form directly from their e.g.f., making them exceptional in a few ways;
The combinatorial interpretation of said closed form is pretty straightforward; since enumerates ways to place partitions in the intervals of a length- unlabelled list, to split it into an ordered list of subsections (identified only by their lengths); we multiply by to make it totally-ordered, then divide by to count equivalence classes under permutations of the subsections.
The combinatorial interpretation directly yields the generatingfunctionological one; for each -cycle permutation, partition the cycles into subsets, then use these sets of cycles as permutations, ie. lists (either by interpreting them as a map or by Foata's transformation fondamentale)
which can be rewritten
(which is a direct consequence of the Pochhammer-binomial theorem where , and can be rewritten as )
Note that since the ratio between two Lah numbers at a constant displacement in the Lah triangle is a rational function of their coordinates, the row and diagonal o.g.f.s are hypergeometric! and . The latter (obtained by Euler's hypergeometric reflection formula) is interesting, because the th diagonal's g.f.'s numerator is the g.f. of the th row of the Narayana triangle A001263, so .
- the number of strict order-homomorphisms from a array (considered as a poset, with each pair of elements ordered by ) to a length- list (ie. such that for two elements , implies the same about the indexes their mapped to in the list, ).
- pairs of strictly increasing length- lists with elements such that for all ,
- pairs of monotonically-increasing length- lists with elements (where th term is first index in which an element appears in the previous bullet point's lists) such that for all ,
general matrix multiplications!
In general, for a Stirling-type array and a slightly more general array ,
that is, given that is the number of ways to partition labelled nodes into a set of -egf-type structures, and the ways to partition them into a (set of -ogf-type structures of) -egf-type structures, is the number of ways to put labelled nodes into any number of labelled -type structures, put these into -type structures, and put these into a set of -type structures.
In particular, , so is the number of ways to totally order all but an arbitrary selection of of elements, then put those into a permutation; equivalently, to cyclically order all but of elements, which can then be added as an additional member of the set of cycles in , making this equal to !
Ward's theorem
If one has a combinatorial structure such that the number of instances of with (labelled) nodes is given by , with e.g.f. , the number triangle that counts partitions into sets of s, , then the inverse binomial transform of the diagonals gives a partition triangle of the same type.
ie. , meaning counts the number of ways to partition nodes into a set of s, which are identical to s but have one fewer singleton.
This means and count partitions into singleton-free subsets and cycles (the latter corresponding with permutations without fixed points).
for the second-order Lah numbers, we have
(See [6], which gives two citations for its discovery.)
row g.f.s of the opposite kinds
subset row sums (Bell numbers A000110) and o.g.f.s (Bell polynomials)
The th row sum of the subset numbers is called the Bell number A000110(n), and the row o.g.f.s are called Bell polynomials, .
The Bell numbers aren't as simple as one might expect from their counterparts the factorials, and getting an asymptotic characterisation as precise as Stirling's approximation entails saddle point approximation. However, we can do some elementary things!
Taking the average value of across all sets of sets, , and the upper bound can be changed to since the th summand cancels itself out, giving .
We also have that (entries A039810=A130191) by the matmul correspondence
Since we know that , ie. , we can use the formula from #binomial transforms for ; this is Dobiński's formula, and its practical use is the other way around; knowing the function by which to multiply in the e.g.f. of a polynomial sequence, in terms of said polynomial's expansion.
Dobiński's formula is not to be confused with the fact that , which is sometimes called the EXP transform but is a case of Faà di Bruno; it can be rewritten via the multivariate Bell polynomial (for which each size of subset is 'coloured' in its own variable)
cycle row rising g.f.s
having looked at and , it makes sense to look at the function !
similarly to before, we have (entries |A039814|=|A325872|); this tells us that .
By setting , we likewise get the antiBell numbers, A007840 (enumerating lists of cycles)!
combinatorial interpretations of Pochhammer calculus
- this section now moved to /binomial-type theorems
the duality
As explained in [4], the duality shown in the o.g.f.s has a direct combinatorial proof in terms of Richard P. Stanley's poset homomorphism duality theorem. In fact, it works for every variety of generating function, importantly also the falling ones; becomes ; that is, within the radius of convergence (of from ), and so we can get the expansion of the th-order partial sums of the sum of reciprocals of th powers!
- note that pretty often (including in the primary entry) the indexing is used, but is Euler's and Knuth's and Foata's and Hirzebruch's and Luschny's (see the latter's page); be careful
Let be the number of permutations of elements in which of the pairs of consecutive elements have (ie. are 'rises'), and have rises.
To establish the recurrence combinatorially, we can surject a permutation into a 'landscape' of rises and falls, then with rises,
- one can place the th element at the beginning to add a fall, or in an interval that was a rise (the two new intervals that it's split into are a rise and a fall), for insertions that conserve the rise-count
- likewise, since there are intervals from which the rises are subtracted for falls, one can place it at the end or inside a fall to create a new rise in ways
which give us
writing (where is known as the th Eulerian polynomial), the recurrence turns into diffeqs
which is actually the quotient rule; note that if we define , we get precisely that diffeq-recurrence in the numerators, so since , is the th iterate of the map starting at .
In particular, since the operation 'differentiate and then multiply by ' multiplies the th term of the series expansion by without shifting it, we have that for .[n 1]
So, (and ), and likewise (the latter following from ).
known as Worpitzky's identity; a bijective proof is given in [7], but a more direct one is as follows:

- is the number of functions from to
- is the number of pairings , where are (permutations of with falls) and are (placements of identical markers into intervals)
- to biject from the left to right side,
- plot the function's output, then connect the nodes in reading order. the order in which the line lands upon them is a permutation of the initial list; let be the sequence of positions along the line (here )
- place tally marks along each line segment for each unit change to position; the descents each have at least , the ascents have at least
- after placing the mandatory tallies, the remaining have intervals between successive nodes to go into (in stars and bars, |*||*|*|***|**|||); the red nodes are fixed at the minimum and maximum height, so act to shift all or none of the function respectively
We can put the Eulerian numbers inside the monomials in the formula for subset numbers, to get
- in the other convention, it can be written as ; by reflection,
and the bijective proof of this (in the manner of stars and bars) is that one can represent each member of (a list of sets) by sorting the sets in ascending order then concatenating them together, with s placed at intervals which were formed by concatenations and s otherwise; count the ascents, then a fall must be a bar but a rise can be either a comma or bar; there are commas, distributed amongst the ascents in ways.
We can also get this from the fact that subset numbers are the inverse #binomial transforms of monomials; however, unlike the Bell numbers section, we use the o.g.f. rather than e.g.f. relation. . (Since , )
Eulerians in the hypercube!
taking a set of uniform random variables in and replacing each variable with its rank (the number of variables smaller than it) produces a uniformly-selected random permutation. This is useful, because it means the chance that a random permutation satisfies some property (ie. a set of inequalities between some of its elements) is equivalent to the measure (hypervolume) in that satisfies it (see also bitwise permutations for a little application)
the short single-page note [8] explains a mapping that bijects the regions of the hypercube with rises as a permutation, into the region whose sum of coordinates is between and (a polytope bounded by hypersimplices; [9] explains much more about them)
- map into (where ); each ascent wraps around and adds to the sum, and each 's contribution is cancelled out by being subtracted from and added to , except for the last (which allows the image to be in the interval between two hyperplanes, instead of just within one)
as such, laying an -cube along its -agonal, then chopping into slices of length and melting down the parts produces a row of the Eulerians!
note this was known to Laplace as of 1812 in [10]!
calculus of the wrong kinds
from binomial expansions, and . What about the expansion in Pochhamers of the derivative of a Pochhammer?
via the product rule, it is
however, it is also elucidating to do gfologically
Note that this can also be written as
Faulhabernoulli's formula
We can write
if we define the 'ordinaryised' Bernoulli numbers (th of the standard definition as the e.g.f. expansion), then since , , so .
Plugging this in, we obtain the very elegant ; that is, the 2D array of coefficients is just a 1D sequence in the first column with elements extruded into diagonals.
Gregory's dual thereto
going the other way around, we have the analogue,
again, we must define a sequence for the expansion this unpleasant reciprocal of a pleasant e.g.f.; we define (whereas usually , so ; the alternate in sign (every even term after the th is negative), but the are all negative after the th). Then, as before,
so we have , ie. , or
We can represent this as a single coefficient extraction from a bivariate, just like the monomial summation. Knowing , we have that
so,
asymptotics and properties as statistical distributions
For the cycle numbers, the existence of helps us,
so at , the centre of mass of the distribution is ! meanwhile, the variance, ; in general,
meaning that (since ) both the expectation and variance of the cycle count of a length- permutation are asymptotic to . [11] proves that the distribution is asymptotically normal, so (asymptotically, up to Puiseux refinement) completely characterised by these, since higher-order moments can be neglected.
- (Lambert W function)
so an asymptotic on the th Bell polynomial is
giving us the final result that the centre of mass of the th row is (although the refinement, , changes the ratio of the estimates for the 192nd term (877 bits long) from 1.000423 times overapproximation to 1.00000734 under-; <1/57th the error!)
Since we know (from the section on them) that this logarithmic derivative equals , this can be slightly more nicely expressed as .
Likewise, knowing the variance equals (where ). Resolving , we get
[12] gives an analogous result to the first kind; asymptotically, these first two moments describe the normal distributions they converge to.
row log-concavity inequalities
the cycle row o.g.f.s (rising powers) have roots lie in an arithmetic progression over the nonpositive reals, and [12]'s lemma 1 shows that the subset row o.g.f.s' (Touchard polynomials') roots are also real; theorem 2 of [13] is that this property makes them at least as log-concave as the binomial coefficients, ie. (and it seems the excess ratio is monotonically decreasing; , which is a lower bound when varying )
[14] compiles a lot of nice results; theorem (3.2) is that since , for subsets we have the slightly stronger , with excess error ratio on the right end (to which it also seemingly decreases monotonically)
For the Lah numbers, this section of my page on A058349 provides some more general results about asymptotics for their o.g.f.s (and modifications covering all confluent hypergeometrics with respect to a single numerator Pochhammer).
asymptotics along rays
a (non-Hayman) saddle point method can be used slightly more directly; write , then
so , and
this allows you to asymptoterate by dividing by , then (as realised by [15]) setting and using lets you asymptoterate the cycle numbers!
transforms
the factorial integral
The classical integral for the factorial, , implies the useful properties of the two transforms we will be using. As the area under a graph, it can be stretched times, . By reassigning , .
- the Mellin transform is , so this is rewritable as . By summing,
- ; by swapping and around, . The Laplace transform is so we can write
e.g.f.s to o.g.f.s (Laplace)
as per above, we can define so that . (This is most of why the Laplace transform is so useful for differential equations; , where the operator does truncating division, ; through this, ordinary differential equations with constant coefficients (that define e.g.f.s of linear-recurrent sequences) become arithmetic equations (that define rational o.g.f.s); it can be applied to solve differential equations that define the g.f.s of a variety of D-finite recurrences also)
o.g.f.s to r.g.f.s (Mellin)
. As such, . (This is a variant of the Mellin transform, which we will denote as )
via this, the previous transform for the zeta function can be rewritten as
As another example, and in particular
convergence
for a function that is as and as , the Mellin transform converges for
o.g.f.s to f.g.f.s
Since ,
to recap on sequence transforms,
| 's name | subset (conventionally "Stirling") |
cycle | subset |
cycle
|
- we know that , so ! (the last step being to divide both the 's reference frame's inside and outside by ) As such, we have the two simultaneous interpretations , which translates to a pair of dual identities
- likewise, , so ; meanwhile, since , we can apply to both sides of a coeff-extraction , which translates to
- meanwhile, but (which equals by the connection identity), so
- (=A105794(n,k))
- ↑ we can also write the coefficient as
- ↑ where ; so we can rewrite it , where the inner sum can be rewritten .
the reason for the leading minus sign, informally, is that the sequence requires (though in general does not prohibit ) and multichoose-transforms forwards into , and exponentially-growing infinite sums resolve to negative values by analytic continuation, so the sign flips it back around
o.g.f.s to mixed kind (Lapllin transform)
putting these together, we can define an operator that performs both transforms in different variables; let , then
Note that , which can be rewritten ; note that equals , the thing we have a Mellin transform for! Writing the integral, . Defining , (and via Euler's reflection formula, ).
This is Ramanujan's master theorem.
note that the Euler's-reflection-formula'd version is a deformation of Cauchy's formula into an infinitesimal-width contour to infinity around the negative reals!
(we use the negative reals rather than the positive, to use the branch cut we've induced along them)
note that having proven the first case combinatorially and second case analytically, their simultaneous truth is logically equivalent to the statement of Euler's reflection identity, giving an accidental proof of that as well!
Some special cases of its use:
- the factorial integral comes by setting ; is continued to equal for all complex
- if we denote by , then it behaves quite similarly to ; we have
- (the somewhat awkward arrangement is because it needs to converge along negative reals)
- the first two work in the same manner as ordinary coefficient extraction, but the last two work based on being a complex function of
- for instance, we can start with , then use the second law for ; by dividing both sides by and using the recurrence for the factorial, we get ; we know ; as such, we get the identity over the integers which is maybe a little bit useful sometimes
- Faulhaber's formula becomes . The in the numerator contributes , so , the Hurwitz incomplete zeta function
- becomes . By substituting in , we get that (by the zeta function's reflection formula)
- for the Stirling numbers, we write as , then
- since and ,
- since and ,
- since exponents of g.f.s correspond with nested convolutions of sequences, these two translate to sums of products over partitions
knowing that , we can take the e.g.f. for this as a sequence in ,
the derivative in is ; we can rewrite this as . This is a difference between two of the case of the previous Hurwitz zeta integral, , so , which Mathematica says is valid for .
So, , ie. . By Ramanujan's master theorem again, we get , so
harmonic asymptotic for cycle columns
Knowing that , we can write . It makes sense to think of this instead as , since (via Faà di Bruno) the counts sets of its input's object type, and the interior counts cycles 'coloured' with variables that happen to be harmonic numbers. As such, an e.g.f. coefficient extraction can be written as a sum over permutations.
Then since , we have .
this is useful! if one fixes then as , we can rewrite it as ; so we can write
(a refinement to the DLMF's 26.8.40)
Defining the coefficient , noting that constructing derangements by adding cycles to preceding cases gives . Writing this is . Since , writing , we have so (where the approximation reduces this to ), so
note that there is a dual!
these numbers (which arise when interpreting the extension of the cycle row g.f.s backwards as a Taylor rather than Laurent series, and also in expansions of recursive integrals of powers of logarithms) are covered thoroughly in section Subpowers with negative integer exponents beginning p.12 of [2]
Stirling polynomials
The fact can be rewritten . (Since , each of the summands must be at least , so the inner sum has a nonzero number of summands iff , or (the empty partition) for .) As such, is a polynomial in of degree , and therefore so too is the more pleasant index-shift !
and by the same argument, becomes
there is a name for these, the Nørlund/generalised Bernoulli numbers! see this answer for how the cycle triangle's columns further leftwards (with their zeroes divided out) give coefficients for higher-order versions of #Faulhabernoulli's formula.
for , we have
- (due to )
- [n 2]
recurrence relations
- see also my big math.se post about this
linear
o.g.f.s
unravelling the cycle row o.g.f. and subset column o.g.f. as products gives
(where denotes making a multiset by drawing from a set with multiplicity, and the latter sum is just a run-length encoding)
only the latter can be partially unravelled into preceding cases, to give the recurrence
e.g.f.s
likewise,
which becomes
these become more evident when written via the Stirling powers; the coefficient tells you how many ways you can take out elements and arrange them into the first list/cycle/set in the ordered list of lists/cycles/sets
duality
employing the Stirling duality on e.g.f.s (from #Ramanujan's master theorem) gives
or
another formulation can also be derived with indexes shifted; this lets you use it down to without division-by-0 issue
as shown there, we have
moments of statistical distributions
[16] gives applications to a few kinds of random variables' factorial and ordinary moments
as a recap,
- binomial is "how many heads in tosses"
- Poisson is "if [event] occurs randomly (chance unaffected by past occurrences, like radioactive decay) with instances expected in the measuring duration, how many are actually observed"
- geometric is "how many tails before one heads"
- negative binomial is "how many tails before heads"
- coupon is the coupon collector's problem; the random variable is how many times one must roll an -sided fair die for all faces to land up at least once (I copied the analysis done for this table (slightly more verbosely) into there, since it superseded the old method)
given the o.g.f. for ,
- the e.g.f. for (ie. the o.g.f. for ) is obtained by replacing each with
- the e.g.f. for is obtained from that of by replacing each with
| name | uniform | binomial | Poisson | geometric | negative binomial | hypergeometric | cycle count | coupon |
|---|---|---|---|---|---|---|---|---|
| [n 3] | A349782(n,x) | |||||||
| [n 4] | [n 5] | |||||||
| [n 6] | [n 7] | [n 8] |
the Poisson distribution's moments being the Bell polynomials is sometimes also referred to as the Dobiński's formula, and the fact that we can do saddlepointery for binomial but not negative binomial is a consequence of the e.g.f. conserving the Hayman-admissible pole at , whereas the negative one produces a finite-order simple pole as the nearest to the origin
note this tells you that and , which means both are log-convex sequences with respect to for all by Hölder's inequality ()
historical/nomenclaturewise notes
- the abstract of [2] notes that Euler had written the number triangle of in 1755, which might imply (since they're only a factorial factor away) the Stirling subset numbers would more aptly be called Euler numbers; Unfortunately,
- in many sources, the univariate Bell polynomials are known as the Touchard polynomials; however, per Ramanujan's notebooks, part 1, p. 11, Ramanujan had developed a lot of the same theory as Bell and Touchard over two decades prior, so either name is a slight misnomer (but the cards have fallen where they lay)
attributions
thank you very much to my dear friend MagmaMcFry (from the ConwayLife lounge) for finding the bijective proofs of some of the identities found here
see also
- second-order Stirling series (on a really really cool Puiseux series refinement due to Ramanujan of the asymptotic for counting <math>n<math>-derangements, and another for the number of cycles across all <math>n<math>-derangements)
- my page on A058349 and a certain bijection
notes
- ↑ and since we can set in order for this to hold for all by the convention ; since every works, the Eulerian numbers take no opinion on 's true value.
- ↑ and , since
- ↑ this is unfortunately not the kind of that is solved by the generalisation of of Saalschutz's theorem given in my hypergeometrics page; one requires that . However, note that if the is turned into an to become bilateral, then applying Dougall's sum evaluates to ; that is, the integer-parameters bilaterally-terminating case of Dougall's theorem is equivalent to the fact that summing the p.d.f. over all gives
- ↑ Zeilberger's algorithm helpfully tells us the o.g.f. satisfies the cute little D-finite recurrence
- ↑ from the generating function, we have , so for we get the second line. Line 3 comes from writing it as the inverse binomial transform of its binomial transform, and the inner summation on line 4 is the expansion of via the same method as #harmonic asymptotic for cycle columns took
- ↑ the e.g.f. for each entry here comes directly from applying the Stirling transform to the above entry's e.g.f. (ie. replacing with ), providing (for the uniform dist.) a much shorter proof of the e.g.f. form of Faulhaber's theorem!
- ↑ so , , and (the first by ordinary saddlepointery, the second by Stirling's approx. and truncating for small values)
the ratio to (asymptoted by the part) is at most , and this paper (p. 5) says it cannot exceed ; it converges to
we have recurrence - ↑ this comes from the falling moments via the identity
references
- ↑ 1.0 1.1 A. M. Odlyzko, 1995. Asymptotic enumeration methods (page 118, equation (12.10))
- ↑ 2.0 2.1 2.2 Mircea Dan Rus, 2025, Yet another note on notation
- ↑ J. G. Wendel, 1948. Note on the Gamma Function, Am. Math. Mo., Vol. 55, No. 9, pp. 563-564
- ↑ 4.0 4.1 Donald Knuth, 1992. Two notes on notation
- ↑ Nicholas A. Loehr, Jeffrey B. Remmel, May 2009. Rook-by-rook rook theory: Bijective proofs of rook and hit equivalences
- ↑ Lemma 2.6 of E. G. Santos, November 24, 2024. Counting non-attacking chess pieces placements: Bishops and Anassas
for prerequisites to understand in somewhat more detail, read [5] - ↑ bijective proof of Worpitzky's identity; StackExchange answer by EuYu, 2015
- ↑ Richard P. Stanley, Eulerian Partitions of a Unit Hypercube (1986)
- ↑ Fred J. Rispoli, 2008. The Graph of the Hypersimplex
- ↑ Pierre Simon de Laplace, 1812. Théorie analytique des probabilités, 1812
- ↑ L. A. Shepp, S. P. Lloyd, 1965. Ordered Cycle Lengths in a Random Permutation
- ↑ 12.0 12.1 L. H. Harper, 1967. Stirling Behaviour is Asymptotically Normal
- ↑ Richard P. Stanley, Log-Concave and Unimodal Sequences in Algebra, Combinatorics and Geometry (1989)
- ↑ Masaaki Sibuya, Log-concavity of Stirling numbers and unimodality of Stirling distributions (1987)
(note i am citing it for things that it brought forth from earlier papers, otherwise there would be more here when it serves as a nice convenient grouping) - ↑ O. Arfwedson, 1951. A probability distribution connected with Stirling's second class numbers, Scandinavian Actuarial Journal, vol. 1951, iss. 1-2, pp. 121-132
- ↑ S. Bagui, K.L. Mehra, 2024. The Stirling Numbers of the Second Kind and Their Applications (Alabama Journal of Mathematics, Vol. 47 (1), pp. 1-22) (read beginning at page 14)