Recall some of the previously defined arithmetic functions:
\(d(n)\text{:}\) number of positive divisors of \(n\)
\(\sigma(n)\text{:}\) sum of positive divisors of \(n\)
These can “oscillate” wildly: for example \(d(n)\) can be arbitrarily large (\(d(2^{k-1}) = k\)) and it also drops down to \(d(n)=2\) infinitely often (specifically, at primes). We can “smooth out the oscillations” by averaging. That is, for any \(x \gt 1\text{,}\) we consider the average value of \(d(n)\) for all \(n\) up to \(x\text{:}\)
Observe that the domain of the average function is the real numbers rather than the positive integers. At this point, this is a more or less arbitrary choice (“why not”).
Definition5.1.1.
Let \(f\) be an arithmetic function. We say \(g\) is an average order of \(f\) if \(g\) is positive and
The average order is not unique. For example, if \(g\) is an average order for \(f\) and \(g'(n) = g(n)\) for all but finitely many \(n\text{,}\) then \(g'\) is also an average order for \(f\text{.}\) Indeed, the difference between summing \(g(n)\) and summing \(g'(n)\) is some constant (finite) value; after dividing by \(x\text{,}\) the difference goes to zero as \(x \to \infty\text{.}\)
Nevertheless we will say “the average order” instead of “an average order”, just because it sounds nicer.
Subsection5.1.1Average order of number of divisors
Theorem5.1.2.
For \(d(n)\text{,}\) the number of positive divisors of \(n\text{,}\) we have \(\sum_{n \leq x} d(n) = x \log x + O(x)\text{.}\) Thus, \(\overline{d}(x) \sim \log x\text{.}\) The average order of \(d(n)\text{,}\) the number of divisors of \(n\text{,}\) is \(\log n\text{.}\) That is, \(\overline{d}(x) = \frac{1}{x}\sum_{n \leq x} d(n) \sim \frac{1}{x} \sum_{n \leq x} \log n\text{.}\)
This is Euler’s solution of the Basel problem. For now we will use this without giving a proof (many proofs exist and can be found in various textbooks or online). Second, we have seen that for \(\alpha \gt 1\text{,}\)
\begin{align*}
\frac{1}{x} \sum_{n \leq x} \frac{\pi^2}{6} n \amp = \frac{\pi^2}{12} \frac{1}{x} \lfloor x \rfloor ( \lfloor x \rfloor + 1 ) \\
\amp = \frac{\pi^2}{12} x + O(1) \text{.}
\end{align*}
The two quantities are asymptotic since
\begin{equation*}
\lim_{x \to \infty} \frac{\frac{\pi^2}{12} x + O(\log x)}{\frac{\pi^2}{12} x + O(1) } = 1\text{.}
\end{equation*}
This is not quite the same thing as saying that the average value of \(\sigma(n)/n\) should be \(\pi^2/6\text{.}\) In our statements above we averaged \(\sigma(n)\text{.}\) We have not evaluated the sum or the average of \(\sigma(n)/n\text{.}\)
We have been concerned with averages (and sums) of arithmetic functions. It would be interesting to investigate other statistical properties such as the standard deviation, or the second, third, and higher moments. (The \(k\)th moment is simply the average of the \(k\)th power. Moments are important in statistics and analysis.)
As described in the textbook’s chapter notes, there has been some study of modified averages taking, instead of all \(n \leq x\text{,}\) just those \(n \leq x\) with \(n \equiv a \pmod{q}\) for some \(a, q\text{.}\) It seems there is plenty still to be figured out in that direction.
Another way to modify the sequence would be to fix an irrational number \(\epsilon\) and a parameter \(0 \leq p \leq 1\) and take those \(n \leq x\) such that the fractional part \(\{n\epsilon\} \lt p\text{.}\) (This is a way to take a sort of “random” subset of integers.)