Show that a circle inscribed in a square has a larger perimeter than any other ellipse inscribed in the square.

**Solution**

The first step is to show that if an ellipse touches all four sides of a square then its axes must lie along the diagonals of the square (in other words, it is symmetrically placed).

Let the ellipse have centre O, semi-major axis a and semi-minor axis b. Let the line A through O contain the ellipse's major axis and the line B through O contain its minor axis. Let S be the square in which the ellipse is inscribed. Let k be the similarity operation which leaves A invariant and expands by a factor a/b perpendicular to A. (So given a general point P, with X be the foot of the perpendicular from P to A, P goes to the point P' on the ray XP such that XP'/XP = a/b.)

k takes the ellipse into the circle centre O, radius a. It also takes parallel lines to parallel lines and touching curves to touching curves, so it takes the square S to a parallelogram whose sides all touch the circle. But that means the parallelogram must be a rhombus (with equal sides). But a line segment making an angle θ to the line A is taken by k to a line segment whose length is longer by a factor √(1 + ( (a/b)^{2} - 1) sin^{2}θ), which is a strictly monotonically increasing function of θ. Since the equal sides of the square are taken to the equal sides of the rhombus, they must make equal angles to the line A. Hence they must be at 45^{o} to it and hence the line A must lie along a diagonal of the square.

The second step is to show that the ellipse with the largest perimeter is the circle. Recall that the perimeter of an ellipse is not an elementary function (it is a complete elliptical integral of the second kind). So it is not clear how to attack the problem. An analytic approach could evidently get somewhat messy.

But to pursue that a little, the tangent at the point (X, Y) of the ellipse has equation xX/a^{2} + yY/b^{2} = 1. If this is at 45^{o} (with X and Y both positive) then we have Xb^{2} = Ya^{2}. The corresponding tangent at (X, -Y) is will evidently meet it on the x-axis at the point ( √(a^{2} + b^{2}), 0), so the side of the square has length √(2a^{2} + 2b^{2}).

The perimeter is 4a ∫_{0}^{π/2} √( 1 - c cos^{2}θ) dθ, where c = (1 - (b/a)^{2}) (*). The easiest way to see this is to use the parametric equation of the ellipse x = a cos θ, y = b sin θ. Then if s represents arc length and ' differentiation wrt θ, we have s' = √( (x')^{2} + (y')^{2}) and (*) follows almost immediately.

Now a/√(a^{2} + b^{2}) = 1/√(2 - c), so we have to maximise f(c) = 1/√(2 - c) ∫_{0}^{π/2} √( 1 - c cos^{2}θ) dθ over the range [0, 1], with c = 0 corresponding to b = a or the circle.

Putting the factor 1/√(2 - c) inside the integral, the integrand becomes I(c, θ) = √( (1 - c cos^{2}θ) / (2 - c) ). We note that at θ = π/4 this is always √(1/2) irrespective of c. But for θ < π/4, the integrand is a decreasing function of c and for θ > π/4 it is an increasing function of c. So we cannot simply argue that the integrand is maximised by taking c = 0 and hence the integral also.

However, a slight elaboration of that argument does work. We can split the integral at π/4. Then for the range π/4 to π/2 we can make the substituion φ = π/2 - θ to get back to an integral over 0 to π/4 with integrand J(c, φ) = √( (1 - c sin^{2}φ) / (2 - c) ). So now we have to maximise the integral over 0 to π/4 of I(c, θ) + J(c, θ). But now it is true that c = 0 maximises the integrand at every point of the range.

We just have to square twice. So I(c, θ) + J(c, θ) ≤ √2 is equivalent to (1 - c cos^{2}θ) + (1 - c sin^{2}θ) + 2 √( (1 - c cos^{2}θ)(1 - c sin^{2}θ) ) <= 2(2 - c), or √( (1 - c cos^{2}θ)(1 - c sin^{2}θ) ) <= 1 - c/2. Squaring again, that is equivalent to 1 - c + c sin^{2}θ cos^{2}θ ≤ 1 - c + c^{2}/4 or c^{2}sin 2θ ≤ c^{2}. But the last relation is certainly true with equality iff c = 0 or θ = π/4. Hence for the integral we have equality iff c = 0.

*Comment. This is much harder if you happen to write down the perimeter in a form that does not involve trigonometric functions. Then the range does not divide so nicely - the breakpoint is at 1/√2 in the range 0 to 1. It is then not at all obvious how to get an inequality on the integral from an inequality on the integrand. I say this with some feeling, because I spent a long time stuck on this question precisely for this reason. I assumed I was expected to derive the result from some result about elliptic integrals (and did indeed prove it from considering the power series).*

© John Scholes

jscholes@kalva.demon.co.uk

27 Jan 2001