Loading [MathJax]/jax/output/HTML-CSS/jax.js

Saturday, October 24, 2015

Convex Functions and Jensen's Inequality (Tuklas Vol. 17, No. 5 - October 24, 2015)

CONVEX FUNCTIONS AND JENSEN'S INEQUALITY

J. L. W. V. Jensen had the following to say about the beauty of convex functions [4]:
It seems to me that the notion of convex functions is just as fundamental as positive function[s] or increasing function[s]. If [I] am not mistaken in this, the notion ought to find its place in elementary expositions of the theory of real functions.
Convex functions play numerous roles in different fields in pure and applied mathematics, most notably in geometry, real analysis, probability theory, and nonlinear optimization. The modern-day appreciation and depth of the theory of convex functions is heavily attributed to Jensen, known for his inequality on convex functions that has become a stable discussion in any material on the theory of convexity. Jensen's Inequality has since then taken on various forms---a discrete form, a form which uses integrals, and a form using probabilities, among others. In relation to Jensen's statement on convex function, Steele refers to convexity as the ``third pillar'' of mathematical inequalities, along with positivity and monotonicity. [5]

The aim of this article is to provide the reader with initial insights on the theory of convex functions and the intuition and applications of Jensen's Inequality. As we shall see later, convex functions and Jensen's Inequality has earned a special spot as an indispensable technique in dissecting Olympiad-level inequalities. One of the most fundamental results related to Jensen's Inequality is that it can lead to what is called the generalized Arithmetic Mean-Geometric Mean Inequality. For a more detailed discussion of convex functions, the reader is encouraged to look at the references used for this article.

Before convex functions are introduced, we must first define what a convex set is. A set S is a convex set if the line segment connecting any two points in S is contained entirely in S. Mathematically, the set S is convex if for any x,yS and for any θ[0,1], the number θx+(1θ)y, called the convex combination of x and y, is a member of the set S.
Figure 1: The two-dimensional sets above are examples of convex and non-
convex sets. The leftmost figure is a convex set, while the center and right-
most figures are not.

Convex functions, on the other hand, are functions defined on a convex set. A function f is said to be convex on the interval [a,b] if for any x,y[a,b] (with x<y) and θ[0,1],f(θx+(1θ)y)θf(x)+(1θ)f(y).Graphically, this definition means that if we take any two points A and B on the graph of a convex function f, the line segment AB should be above the graph y=f(x). This characterization also means that the graph of convex functions are U-shaped or bowl-shaped on the interval [a,b]. Speaking in the language of averages, for a convex function, the function value of the weighted average of x and y is less than or equal to the weighted average of the function values at x and y.

Figure 2: The graph of a convex function.

Calculus is also used to determine whether a function is convex. A function is convex on an interval [a,b] if all tangent lines to the graph y=f(x) on the interval lie below the graph of the function. Furthermore, a function is convex if its second derivative is nonnegative for all x[a,b].

Examples of convex functions include the exponential function f(x)=eax for any aR, powers of absolute values f(x)=|x|p for p1, and the maximum function f(x)=max{x1,x2,,xn} for real numbers x1,x2,,xn.

There is a very nice quality of convex functions that makes this entire class of functions an attraction in the field of optimization. Notice that if a function f is convex on an interval [a,b], then its minimum value on [a,b] is unique. A convex function also attains its maximum value at the endpoints of the interval considered; that is, if f is convex on [a,b], then the maximum value of f is either f(a) or f(b). Furthermore, any local minimum is a global one; a strictly convex function (where the inequality is strictly less than) admits at most one minimum. In usual optimization, one may be looking at several local minima, but with (strictly) convex functions, we are guaranteed that there is only one minimum.

The most important result related to convex functions is Jensen's Inequality. This inequality states that if the real-valued function f:[a,b]R is convex, x1,x2,,xn are numbers from an interval [a,b], and p1,p2,,pn are numbers in [0,1] whose sum is 1 (i.e. ni=1pi=1), then it holds that f(p1x1+p2x2++pnxn)p1f(x1)+p2f(x2)++pnf(xn).Speaking in terms of weighted averages, Jensen's Inequality means that the function evaluated at the weighted average of the xi's is less than or equal to the weighted average of the functional values at the xi's. Notice also that if n=2, then the inequality reduces to the definition of convexity introduced earlier. Equality is attained when x1=x2==xn.

The power of convex functions and Jensen's Inequality lies in the problem-solver's ability to spot a convex function in the given problem. Once the appropriate convex function has been identified, then the results above may be used to complete the solution to the problem. This step, however, is sometimes the most difficult when attempting to resolve an inequality problem.

To demonstrate the power of convex functions and Jensen's Inequality, we shall prove the generalized Arithmetic Mean-Geometric Mean (AM-GM) Inequality yp11×yp22××ypnnpiy1+p2y2++pnyn,where the yi's are nonnegative numbers and the pi's are numbers in the interval [0,1] that sum up to 1.

It turns out that one only has to resort to the exponential function f(x)=ex, which we note is a convex function on the entire real number line. (We can verify its convexity based on its graph or using the defintion for convexity). Thus, for any x1,x2,,xnR and pi's satisfying the condition above, it holds that ep1x1+p2x2++pnxnp1ex1+p2ex2++pnexn.Using the laws of exponents, we may rewrite the above equation as (ex1)p1×(ex2)p2××(exn)pnp1ex1+p2ex2++pnexn.Let yi=exi for all i=1,2,,n. Then the previous equation can be written as yp11×yp22××ypnnp1y1+p2y2++pnyn.Thus, the generalized AM-GM Inequality has been proven.

Another problem in which convexity and Jensen's Inequality can be used is the following [5]:
In an equilateral triangle with area A, the product of any two sides is equal to 4A/3. Show that there exist two sides whose lengths have a product that is greater than or equal to 4A/A.
To solve this, one may find useful some formulas that give the area of a triangle given side lengths and measures of interior angles. In particular, we note that if a, b, and c are side lengths of a triangle and α, β, and γ are the angles opposite a, b, and c, respectively, thenA=12absinγ=12acsinβ=12bcsinα.The rest of the solution is left to the reader; as before, to use Jensen's Inequality, we must first be able to identify a convex function from the given problem. What do you think is the convex function we can use?

In the theory of inequalities, there are three pillars---positivity, monotonicity, and convexity. As we have shown in the foregoing exposition, convex functions significantly recur in the study of various mathematical relationships between numbers and functions. Its far-reaching beauty, with the aid of Jensen's Inequality, will continue to grace various applications in the optimization of functions and other fields involving a mastery of mathematical inequalities.

ABOUT THE AUTHOR:
Len Patrick Garces is an Instructor at the Ateneo de Manila University. He obtained his Master of Applied Mathematics major in Mathematical Finance degree at the Ateneo de Manila University in 2015.

REFERENCES:
[1] Boyd, S. & L. Vanderberghe. (2009). Convex Optimization. Cambridge University Press.
[2] Bautista, E. P & I. J. L. Garces. (2010). Mathematical Excursions: A Problem-Solving Primer for Trainers and Olympiad Enthusiasts. C&E Publishing, Inc.
[3] Manfrino, R. B, J. A. G. Ortega & R. V. Delgado. (2005). Inequalities: A Mathematical Olympiad Approach. Birkhauser.
[4] Niculescu, C. P. & L. E. Persson. (2004). Convex Functions and their Applications: A Contemporary Approach. Springer.
[5] Steele, J. M. (2004). The Cauchy-Schwarz Master Class: An Introduction to the Art of Mathematical Inequalities. Cambridge University Press.

OLYMPIAD CORNER
Team Selection Test for the 55th IMO, Bulgaria

Problem:  Find the least positive real number α with the following property: if the weight of a finite number of pumpkins is 1 ton and the weight of every pumpkin is not more than α tons then the pumpkins can be distributed in 50 boxes (some of the boxes may remain empty) such that there are no more than α tons of pumpkins in every box.

Solution: 
Claim: the real number α that satisfies the above conditions is α=251.

Suppose that α<251, and let k0 be a nonnegative integer such that151×2kα<151×2k1Consider 51×2k pumpkins, each having a weight of 151×2k tons. Since there are 50 boxes, and 50<51×2k for any nonnegative integer k, then by the pigeonhole principle, there exists a box with at least two pumpkins, with combined weight of at least 151×2k1>α. This is a contradiction.

We now show that α=251. Suppose that we have a total of m pumpkins, and we place a pumpkin in each of the m empty boxes. Take two of the lightest boxes; if the combined weight is at most 251, we transfer all the pumpkins into one of these boxes and remove the other. When the operation terminates, let n be the number of boxes remaining. Suppose x1x2xn represent the weights (in tons) of pumpkins in the n boxes. Note that based on the procedure, xi251 for all i=1,2,...,n, which means we have distributed m pumpkins into n boxes such that there is no more than 251 tons of pumpkins in every box. Moreover, since x1+x2>251 and x1x2, then x2>151. Thereforex1+x2++xn>251+(n2)151But since the sum of weights is $1$, then251+(n2)151<1n<51Hence we have distributed the pumpinks in no more than 50 boxes such that weight in each box does not exceed 251.

SOLUTIONS
(for October 10, 2015)
  1. Determine the values of x such that 2x+3x4x+6x9x1. (Taken from 101 Problems in Algebra by Andreescu and Feng)
    (Solved by Jarrett Ian G. Lim [Philippine Academy of Sakya] and Farrell Eldrian Wu [MGC New Life Christian Academy]; partial credit for Joyce Heidi Ong [Chiang Kai Shek College], Steven Reyes [Saint Jude Catholic School], and Madeline Tee [Jubilee Christian Academy])

    SOLUTION: 
    Note that, for any value of x,2x3x+4x6x+9x+1=12[(4x26x+9x)+(4x22x+1)+(9x23x+1)]=12[(2x3x)2+(2x1)2+(3x1)2]0,because we just have a sum of squares of real numbers. This means that2x3x+4x6x+9x+1012x+3x4x+6x9xfor all x. Therefore, the solution set is R.
  2. Let n, a, and b be positive integers. Prove thatgcd(na1,nb1)=ngcd(a,b)1.(First Stage, Moscow Mathematical Olympiad, 1995)
    (Solved by Jarrett Ian G. Lim [Philippine Academy of Sakya], Joyce Heidi Ong [Chiang Kai Shek College], and Farrell Eldrian Wu [MGC New Life Christian Academy]; partial credit for Madeline Tee [Jubilee Christian Academy])

    SOLUTION: 
    What we will show is that the expression on the left will divide the expression on the right, and vice-versa.

    First note that gcd(a,b) must divide both a and b. Also, for any expression nx1, if y|x, thennx1=(ny1)(nxy+nx2y++1).Since gcd(a,b) divides a and b, then ngcd(a,b)1 must also divide both na1 and nb1, that is, ngcd(a,b)1 is a common factor of na1 and nb1. Consequently, ngcd(a,b)1 must divide gcd(na1,nb1).

    We now work with the second part, which is to prove that gcd(na1,nb1) divides ngcd(a,b)1. Recall that if gcd(a,b) denotes the greatest common divisor of a and b, then there exists x, y such that axby=gcd(a,b). Specifically, we can choose x and y to be positive. Now, following a similar argument as in the first part, we now know that na1 will divide nax1 and nb1 will divide nby1.

    This means that gcd(na1,nb1) divides nax1 and nby1. So gcd(na1,nb1) divides (nax1)(nby1).

    Note that(nax1)(nby1)=nby(naxby1)=nby(ngcd(a,b)1).Again, gcd(na1,nb1) divides (nax1)(nby1), and since na1 and nb1 are both 1(modn) while nby0(modn), we have gcd(nby,gcd(na1,nb1))=1. This means that gcd(na1,nb1) divides ngcd(a,b)1.
  3. Let a and b, with ab be roots of x23x50=0. Determine the value of a32014b2+2015.
    (Solved by Jarrett Ian G. Lim [Philippine Academy of Sakya], Joyce Heidi Ong [Chiang Kai Shek College], and Farrell Eldrian Wu [MGC New Life Christian Academy]; partial credit for Madeline Tee [Jubilee Christian Academy])

    SOLUTION: 
    This item has two solutions.

    The first is the more tedious one, where we actually get the values of a and b via the quadratic formula, and substitute them into the expression a32014b2+2015, but this is tedious.

    The second involves a more intricate approach. First, note that by Vieta's Theorem, we have a+b=3 and ab=50.

    LetA=a32014b2+2015andB=b32014a2+2015.Adding A and B, we get A+B=(a3+b3)2014(a2+b2)+2030The first expression can be factored as A+B=(a+b)(a2ab+b2)2014(a2+b2)+2030=(a+b)(a2+2ab+b23ab)2014(a2+2ab+b22ab)+2030=(a+b)[(a+b)23ab]2014[(a+b)22ab]+2030=3[32+150]2014(32+100)+2030=217,109.Similarly,AB=(a3b3)+2014(a2b2)=(ab)[(a+b)2ab+2014(a+b)].Now, where will we get the values of those expressions? We go back to the original equation which reads x23x50=0.

    It can be shown that the sum of the roots is 3, and the product of the roots is 50. Since ab, then ab is nonnegative, soab=+(ab)2=(a+b)24abThis means thata+b=3ab=50ab=209.This means thatAB=209[9+50+2014(3)]=6101209.Solving for A from A+B=217,109 and AB=6,101209, we getA=6,101209217,1092.
  4. Four spheres have radii 2, 2, 3, and 3 respectively. Each sphere is tangent to three others. There is another sphere which is tangent to all these four spheres. Determine the radius of this sphere. (China, 1995)
    (Solved by Jarrett Ian G. Lim [Philippine Academy of Sakya] and Farrell Eldrian Wu [MGC New Life Christian Academy]; partial credit for Madeline L. Tee [Jubilee Christian Academy])

    SOLUTION: 
    Let A and B be the centers of the two spheres of radius 2, and C and D the centers of the two spheres of radius 3. Let E and r be the center and radius of the sphere tangent to all others.

    In addition, let M and N be the midpoints of AB and CD, respectively.

    First, note that AC=AD=5, and N is the midpoint of CD. This means that AN is perpendicular to CD. So we can use Pythagorean Theorem on ANC to show thatAC2=AN2+NC252=AN2+32AN=4.Now, MN is perpendicular to AB, so we can again use Pythagorean Theorem on AMN and see thatAN2=AM2+MN242=22+MN2MN=12.Now we focus our attention to the sphere with center E. It must be noted that, for the sphere with center at E to be tangent to all four spheres, E must lie on the perpendicular bisecting planes of both AB and CD, which intersect at the line in which MN is a segment of. This means that E lies on segment MN.

    Moreover, EMA is a right triangle with AM=2 and AE=r+2. We now haveAE2=AM2+ME2(r+2)2=22+ME2ME=r2+4r.Similarly, working with EN, we will get EN=r2+6r. Since E lies on segment MN, we haveMN=ME+EN12=r2+4r+r2+6r(12r2+6r)2=(r2+4r)212212r2+72r+r2+6r=r2+4r2r+12=212r2+72r(r+6)2=(12r2+72r)2r2+12r+36=12r2+72r0=11r2+60r36r=611,6.This means that the radius of the sphere tangent to all others is 611.
  5. Let P1P2P12 be a regular dodecagon. Prove that P1P5, P4P8, and P3P6 are concurrent (they intersect at the same point). (23rd Putnam, 1963)
    (Partial credit for Jarrett Ian G. Lim [Philippine Academy of Sakya], Joyce Heidi Ong [Chiang Kai Shek College], Madeline Tee [Jubilee Christian Academy], Farrell Eldrian Wu [MGC New Life Christian Academy])

    SOLUTION: 
    Consider the added points Q1,Q2,,Q6 such that Q1Q2Q6 is a regular hexagon of the same length as the original dodecagon. This can be done by forming equilateral triangles P1P2Q1, P3P4Q2, and so on until P11P12Q6.



    Since the measure of one interior angle of a dodecagon is 150, then we can say that P3P2Q1=P5P4Q2==P1P12Q6=90. As a consequence, we have formed six squares in the process.

    We first focus our attention to the isosceles triangle P1Q1Q2. Here, note that the vertex angle is P1Q1Q2, which measures 150. This means that Q1Q2P1=Q1P1Q2=15.

    Since Q2P5 is the diagonal of square P4P5Q3Q2, we have P5Q2Q3=45. Of course we also have Q1Q2Q3=120.

    This means thatP1P2Q1+Q1Q2Q3+P5Q2Q3=180,which means that P1Q2P5 form a line segment. Similarly, we can say that P4Q3P8 form a line segment as well.

    Consequently, P1P5 and P4P8 intersect at the center of square P4P5Q3Q2.

    By our construction, P3P4Q2 and P5Q3P6 are symmetric with respect to the center of square P4P5Q3Q2. As such, P3P6 serves as the perpendicular bisector of P4Q2 and P5Q3. This means that indeed, P3P6 must pass through the center of square P4P5Q3Q2 as well.

    So P1P5, P4P8, and P3P6 intersect at the same point.
  6. For each permutation a1,a2,,a2014 of the integers 1,2,,2014, form the sum1007i=1|a2i1a2i|.Find the average value of all these sums. (American Invitational Mathematics Examination, 1996)
    (Solved by Jarrett Ian G. Lim [Philippine Academy of Sakya] and Farrell Eldrian Wu [MGC New Life Christian Academy]; partial credit for Madeline Tee [Jubilee Christian Academy])

    SOLUTION: 
    We first focus on the average value of A=|a1a2|. Since we are just taking permutations, then we can just take the average value of A, because this average value will be the same for all |a2i1a2i|, so the average that we want is just 1007 times the average value of A.

    Now, if a1=x, x{1,2,,2014}, then the average value of A is just(x1)+(x2)++1+1+2++(2014x)2013=12013[x(x1)2+(2014x)(2015x)2]=x22015x+1007(2015)2013.Now we need to take the sum of all possible average values when we vary the value of a1. This means that we have120142014x=1x22015x+1007(2015)2013=1201412013[2014(2015)(2031)62015(2014)(2015)2+1007(2014)(2015)]=1201412013[2014(2015)(2029)632015(2014)(2015)6+32014(2014)(2015)6]=20153.This means that the average value of all possible sums is just (1007)(2015)3.

Saturday, October 10, 2015

Sigma Graph Coloring (Tuklas Vol. 17, No. 4 - October 10, 2015)

SIGMA GRAPH COLORING

Figure 1: Graph G with 3 vertices and 3 edges

Consider the figure above. This object is called a graph, and it can be used to represent a wide variety of things. For instance, it can be used to represent a road network or a map of social relationships. A graph consists of a set of points called vertices, where some pairs of vertices are joined by a line called an edge. We say that the two vertices v1 and v2 are adjacent if there is an edge e that connects v1 and v2.

For any vertex v in G, the neighbor of v is the vertex adjacent to v. Looking back at the previous illustration, the vertex v1 in G has neighbors v2 and v3.

Mathematicians like to play around with graphs, because the properties that can be derived are very interesting. For instance, a vertex coloring of a graph G refers to an assignment of colors to the vertices in G. We use positive integers 1,2,,n as colors to be assigned to the vertices in a graph. Why does it have to be positive integers? It is because we are interested in the number of colors to be used. A color sum of a vertex v is the sum of the colors of the neighbors of v. We write the color sum of v as σ (sigma).

Moreover, a vertex coloring is a sigma coloring if for any two neighboring vertices, their color sums are not equal. In fact, a graph G has sigma k-coloring if we can assign at most k colors to the vertices in G.

In the figure below, consider a graph with a sigma 3-coloring. We choose the colors 2, 5, and 9 to label the vertices in G.

Figure 2: Sigma 3-coloring of G

Indeed, as shown above, any two neighboring vertices in G have distinct color sums. Therefore, G has sigma 3-coloring.

Notice that the following figure shows sigma colorings of the graph H with 6 vertices and 6 edges.

Figure 3: Examples of sigma colorings

We note that any graph with n vertices has a sigma k-coloring where, 1kn. However, what if we want to determine the smallest number of colors needed in a sigma coloring of a graph? In other words, we want to find the smallest value of k in which a sigma k-coloring exists. The minimum value of k is called the sigma chromatic number of a graph. Now, we recall the graph H above. Suppose we assign all the vertices in H with only one color, say, 7.

Figure 4: Not a sigma coloring of H

The illustration above shows that H has no sigma 1-coloring because any two neighboring vertices have equal color sum. Since H is sigma 2-colorable (as shown in (d) above), the sigma chromatic number of H is 2.

Mathematicians have been fascinated by sigma graph coloring not just because of the math involved, but also because of the real life applications. Some of these involve club scheduling problems and hospital planning. Can you think of an application of sigma graph coloring in daily life?

ABOUT THE AUTHOR:
Maria Czarina T. Lagura is a graduate assistant at the Ateneo de Manila University. She is currently taking her Master's degree at the same university.

REFERENCES:
Chartrand, G. and P. Zhang. Chromatic Graph Theory, Chapman & Hall/CRC Press, Boca Raton, FL (2009).
Chartrand, G., F. Okamoto, P. Zhang. "The Sigma Chromatic Number of a Graph." Graph and Combinatorics, 26:755-773 (2010).

OLYMPIAD CORNER
from the 51st International Mathematical Olympiad, 2010 (Shortlist)

Problem:  Given six positive numbers a, b, c, d, e, f such that a<b<c<d<e<f. Let a+c+e=S and b+d+f=T. Prove that 2ST>3(S+T)(S(bd+bf+df)+T(ac+ae+ce))

Solution: 
Let α=ac+ce+ae and β=bd+bf+df. Moreover, let s=c+e, hence a=Ss.

Note that(cb)(cd)+(ef)(ed)+(ef)(cb)<0,since each summand is negative. We can rewrite this as(bd+bf+df)(ac+ce+ae)<(c+e)(b+d+face)βα<s(TS)and we haveSβ+Tα=S(βα)+(S+T)α<Ss(TS)+(S+T)(ce+as)Since (ce)20, then (c+e)24ce.

Hence,Sβ+Tα<Ss(TS)+(S+T)(s24+(Ss)s)=s(2ST34(S+T)s)Using the AM-GM Inequality, we have34(S+T)(Sβ+Tα)<34(S+T)s(2ST34(S+T)s)34(S+T)s+2ST34(S+T)s2=STTherefore2ST>3(S+T)(S(bd+bf+df)+T(ac+ae+ce)).

PROBLEMS
  1. Determine the values of x such that 2x+3x4x+6x9x1.
  2. Let n, a, and b be positive integers. Prove thatgcd(na1,nb1)=ngcd(a,b)1.
  3. Let a and b, with ab be roots of x23x50=0. Determine the value of a32014b2+2015.
  4. Four spheres have radii 2, 2, 3, and 3 respectively. Each sphere is tangent to three others. There is another sphere which is tangent to all these four spheres. Determine the radius of this sphere.
  5. Let P1P2P12 be a regular dodecagon. Prove that P1P5, P4P8, and P3P6 are concurrent (they intersect at the same point).
  6. For each permutation a1,a2,,a2014 of the integers 1,2,,2014, form the sum1007i=1|a2i1a2i|.Find the average value of all these sums.
We welcome readers to submit solutions to the problems posed below for publication consideration. Solutions may ONLY be submitted online via vantonio1992@gmail.com. Solutions must be preceded by the solver's name, school affiliation and year level. The deadline for submission is 12:30 PM October 24, 2015.

SOLUTIONS
(for October 3, 2015)
  1. Find all values of x, y, and z such that x4+y4+z44xyz=1.
    (Solved by Jarrett Ian G. Lim [Philippine Academy of Sakya], Joyce Heidi Ong [Chiang Kai Shek College], Ryan Jericho Sy [Chang Kai Shek College], Farrell Eldrian Wu [MGC New Life Christian Academy]; partial credit for Madeline L. Tee [Jubilee Christian Academy])

    SOLUTION: 
    We have1=x4+y4+z44xyz0=x4+y4+z44xyz+1=(x42x2y2+y4)+(z42z2+1)4xyz+2x2y2+2z2=(x42x2y2+y4)+(z42z2+1)+2(x2y22xyz+z2)0=(x2y2)2+(z21)2+2(xyz)2.We have a sum of squares that is equal to 0. This means that each of the terms must be equal to 0. From the second term, we havez=±1,while from the first and the third we get x=±y and xy=z. This means that if z=1, we have the solutions (±1,±1,1), while for z=1, we have (±1,1,1).
  2. Find the number of ways to color 2015 points on a circle with 3 colors such that any two neighboring points have distinct colors.
    (Partial credit for Jarrett Ian G. Lim [Philippine Academy of Sakya], Joyce Heidi Ong [Chiang Kai Shek College], Farrell Eldrian Wu [MGC New Life Christian Academy])

    SOLUTION: 
    Let us solve the problem for a general number of points n. Define an to be the number of possible colorings for n points on the circle.

    Note that if n=1, then a1=3.

    If n=2, then the first point will have three color options, while the other will have one less option. We have 32=6 possible colorings, and consequently, a2=6.

    Now, if n=3, then the first point will have three options, its neighboring point (to the right) will have 2 options, while the remaining point will only have 1. This means that a3=321=6.

    On the other hand, if we have n+1 points. Consider the neighbors of the last added point. We have one color used for the last point. We have two cases.

    Consider the case when the two neighbors of this last added point has the same color. If its two neighbors share the same color, then the last added point will have two color options.

    Moreover, since the two neighbors have the same color, then we can just consider it as a single point, which means that, along with the rest of the points, the n points can be colored an1 ways. This means, in this first case, we have 2an1 colorings.

    On the other hand, if the two neighbors have different colors, then, there will only be one option for the last added point. Moreover, the rest of the n points can be colored an ways.

    This means that we have the recurrence relation (RR)an+1=an+2an1,where a1=2 and a2=a3=6. The closed form of this RR is given byan=2n+2(1)n.If we check, we havean+2an1=[2n+2(1)n]+2[2n1+2(1)n1]=2n+2(1)n+2n4(1)n=2n+1+2(1)n+1=an+1.Finally, we just need to obtain a2015=220152 colorings.
  3. Determine the minimum value of a+b+cba given that for all x, ax2+bx+c0 and a<b.
    (Partial credit for Joyce Heidi Ong [Chang Kai Shek College], Ryan Jericho Sy [Chang Kai Shek College], Farrell Eldrian Wu [MGC New Life Christian Academy])

    SOLUTION: 
    First, note that if a=0, then ax2+bx+c=bx+c. Since 0=a<b, then b is positive, thenbx+c0xcb.This means that the condition ax2+bx+c0 is only true when xcb.

    This means that a0.

    We now have0ax2+bx+c=a(x2+bax+b24a2)+4acb24a=a(x+b2a)2+4acb24a.If a<0, then we can solve for x such that the inequality is true, and certainly, the solution set of this inequality will not be R, in fact, the values will have to be in the interval[bb24ac2a,b+b24ac2a].This means that a>0.

    Now, if we want this expression to to always be greater than 0, then the right side of the expression must be nonnegative as well. We must have 4acb20. This means that cb24a.

    Since we are trying to minimize a+b+cba, we can minimize c first. So we just getc=b24a. Now,a+b+cba=a+b+b24aba=4a2+4ab+b24a(ba)=(9a2+6ab6a2)+(b22ab+a2)4a(ba)=9a2+6a(ba)+(ba)24a(ba)=9a4(ba)+32+(ba)4a.Note that ba and a are both positive. By AM-GM, we have9a4(ba)+(ba)4a29a4(ba)(ba)4a=32.This means that the minimum value of a+b+cba is 32+32=3. This can be achieved when a=1 and b=4, and c=4. Note that in this case, we have x2+4x+4=(x+2)20, so the condition is still satisfied.

Saturday, October 3, 2015

The Latin Square (Tuklas Vol. 17, No. 3 - October 3, 2015)

THE LATIN SQUARE

     
     
     

Try filling the table above with numbers from the set {1,2,3} such that no row or column contains the same number. If you have played Sudoku, you might be familiar with solving puzzles like this. These square puzzles, called Latin squares, have been around for a long time, even predating the game of Sudoku.

What is a Latin square? A Latin square of order n is an n×n array of n symbols such that each symbol appears exactly once in each row and once in each column. If you were able to solve the above square, the result is a Latin square of order 3. Here are some examples of Latin squares:

DACB
BCAD
CDBA
ABDC
,   
123456
214365
351624
462531
546213
635142

Figure 1: Latin squares of order 4 and 6.

Why is it called a “Latin” square? Though Latin squares had already existed for centuries, it was Leonhard Euler who formally introduced and developed the concept in 1783 in his mathematical paper entitled "nouveau espece de carres magiques". He used Latin characters as symbols, hence the term Latin square. Of course nowadays, one can use any other symbol to form a Latin square and not just Latin characters.

One of the most interesting problems involving a Latin square is the search for its orthogonal mate. It was again Euler who developed the definition of orthogonal Latin squares. Two Latin squares are said to be orthogonal if when superimposed (joined together), the ordered pairs in the resulting array are all distinct. When two Latin squares are orthogonal to each other, we call them orthogonal mates. Figure 2 shows two orthogonal Latin squares and their join. Notice that the elements of their join are all distinct.

A=
1234
2143
3412
4321
, B=
1234
4321
2143
3412
 
(A,B)=
(1,1)(2,2)(3,3)(4,4)
(2,4)(1,3)(4,2)(3,1)
(3,2)(4,1)(1,4)(2,3)
(4,3)(3,4)(2,1)(1,2)

Figure 2: 2 orthogonal Latin squares, denoted by A and B, alongside their join.

A classic problem that involves orthogonal Latin squares is the problem posted by Euler involving 36 officers from six ranks and six regiments. He claimed that it was impossible to arrange six regiments, each with six ranks (a colonel, a lieutenant-colonel, a major, a captain, a lieutenant, and a sub-lieutenant) in a 6×6 array such that no row or column duplicates a rank or a regiment. Thus, a Latin square of order 6 does not have an orthogonal mate. Notice also that a Latin square of order 2 does not have an orthogonal mate either (see Figure 3).

AB
BA
,   
BA
AB

Figure 3: The two Latin squares of order 2.

With this, Euler conjectured that no orthogonal Latin squares of order 6 exist, nor does one exist whenever n2mod4. This remained as a conjecture for more than 100 years until 1960 when Bose, Shrikhande and Parker disproved it.

A Latin square has so many important characteristics and it has many applications in science, engineering, statistics and most importantly, in mathematics. In abstract algebra, Cayley tables of quasigroups and the addition table of integers modulo n form a Latin square. In statistics, the Latin square design is used to control (or eliminate) two sources of the variation in an experiment. In coding theory, a pair of orthogonal Latin squares is used to design new techniques for what we call error correcting codes. In games, Sudoku puzzles and magic square puzzles are just two of most famous games that is related to Latin squares.

There are still so many things to discover about Latin squares - from construction, existence of orthogonal mate, up to the discovery of new applications. For instance, if we define a transversal of a Latin square as a set of entries which includes exactly one entry from each row and column and one of each symbol, one may try to solve this conjecture: Every Latin square of odd order has a transversal. Enjoy!

ABOUT THE AUTHOR:
Pejie Santillan is a graduate assistant at the Ateneo de Manila University. He is currently taking his Master's degree at the same university.

REFERENCES:
- Colbourn, C. J. and J. H. Dinitz (eds). The CRC Handbook of Combinatorial Designs, CRC Press, Boca Raton (1996).
- Arshaduzzaman, Md. "Connections between Latin Squares and Geometries." IOSR Journal of Mathematics, Vol. 9, Issue 5 (Jan 2014). Retrieved from http://iosrjournals.org/iosr-jm/papers/Vol9-issue5/C0951419.pdf.
- Waheed, Ali Makki Sagheer Makarim. Error-Correcting Code using Latin Square, Abdul-Jabbar. College of Computer, University of Anbar.

OLYMPIAD CORNER
from the Italian Mathematical Olympiad, 2014

Problem:  Prove that there exists a positive integer that can be written as a sum of 2015 distinct 2014th powers of positive integers x1<x2<<x2015 in at least two ways.

Solution: 
Let N be a positive integer greater than or equal to 2015. The total number of increasing sequences x1<x2<<x2015 such that all the terms are less than or equal to N is given by (N2015). For each of these sequences, we find the sum x20141+x20142++x20142015. Since x2014iN2014 for any i=1,2,...,2015, it follows that the sum will be at most 2015N2014. Therefore we have (N2015) sums, and each sum is less than or equal to 2015N2014.

We claim that (N2015)>2015N2014 for some N. Note that (N2015)=N(N1)(N2)(N2014)2015!Also considerP(N)=N(N1)(N2)(N2014)2015!2015N2014Since P(N) is a 2015th degree polynomial with a positive leading coefficient, its graph will just conitinue to move up as N goes to infinity. This means there exists an N large enough so that P(N)>0(N2015)>2015N2014.

Since there are more sums compared to the possible values for the sum, then by the pigeonhole principle, there exists at least two sums that have the same value.

PROBLEMS
  1. Find all values of x, y, and z such that x4+y4+z44xyz=1.
  2. Find the number of ways to color 2015 points on a circle with 3 colors such that any two neighboring points have distinct colors.
  3. Determine the minimum value of a+b+cba given that for all x, ax2+bx+c0 and a<b.
We welcome readers to submit solutions to the problems posed below for publication consideration. Solutions may ONLY be submitted online via vantonio1992@gmail.com. Solutions must be preceded by the solver's name, school affiliation and year level. The deadline for submission is 12:30 PM October 10, 2015.

SOLUTIONS
(for September 26, 2015)
  1. Let ABCD be a rhombus with A=120 and P is any point on the plane of the rhombus. Prove that PA+PC>BD2.(Taken from Mathematical Olympiad Challenges (2nd Edition) by Andreescu and Gelca)
    (solved by Jarrett Ian G. Lim [Philippine Academy of Sakya], Carissa Elaine Varon [Falcon School], and Steven Reyes [Saint Jude Catholic School]; partial credit for Joyce Heidi Ong [Chiang Kai Shek College], Madeline L. Tee [Chiang Kai Shek College], and Trisha T. Huang [Chiang Kai Shek College])

    SOLUTION: 
    First we prove that given an equilateral triangle XYZ and a point P not on the circumcircle of this triangle, then we can construct a triangle with side lengths PX, PY, and PZ. Moreover, if P lies on the circumcircle, then one of the three lengths is equal to the other two. This is known as Pompeiu's Theorem.

    We rotate XYZ 60 clockwise around X. As seen in the figure below, Y will rotate to Z, while Z and P are images of Z and P, respectively.


    Note that PZ is the image of PY. In addition, because of the rotation, PPX is an equilateral triangle, so PP is actually equal to PX. This means that we have PPZ as our desired triangle.

    If, on the other hand, P is on the circumcircle, then we have the following figure.


    Note that if P is on arc XZ, then ZPX is 120. Because PPX is equilateral, then XPP=60, which means that ZPX and XPP form a linear pair, so Z, P, and P are collinear.

    If P is on arc XYZ, then ZPX=60, and PPX is also 60 because PPX is equilateral. This means that Z, P, and P are collinear.

    This means that if P is on arc XZ, then ZP=ZP+PPYP=ZP+XP.If P is on arc XY, then ZP=ZP+PPZP=YP+XP.And finally, if P is on arc YZ, then PP=PZ+ZPXP=ZP+YP.Now we go to the problem. Since A=120, then B must be 60. This means that ABC is isosceles with vertex angle 60, which implies that ABC is equilateral.

    If P is not on the circumcircle of ABC, then a triangle exists with sides PA, PB, and PC. This means that, by triangle inequality, PA+PC>PB. On the other hand, if P is on the circumcircle of ABC, then PA+PC=PB. In general, we have PA+PCPB.

    Similarly, since ACD is also equilateral, we can have PA+PCPD. Combining the two inequalities, we have 2(PA+PC)PB+PD.Considering PBD, PB+PD>BD. This means that 2(PA+PC)>BDPA+PC>BD2.

  2. Determine, with proof, the values of a, b, and c, with a<b<c such that 2015=a2+b2c2. (Modified problem from 104 Number Theory Problems by Andreescu, Andrica, and Zuming)
    (solved by Jarrett Ian G. Lim [Philippine Academy of Sakya], Steven Reyes [Saint Jude Catholic School]; partial credit for Joyce Heidi Ong [Chiang Kai Shek College], Ryan Jericho Sy [Chiang Kai Shek College], and Madeline L. Tee [Chiang Kai Shek College])

    SOLUTION: 
    We prove this for a general value of n and divide the solution into cases depending on the parity of n.

    If n is even, then n=2k for some k. Note that if n=0, then we have0=a2+b2c2c2=a2+b2,which means any Pythagorean Triple will hold. On the other hand, if n=2, then2=52+112122.Now, if n4, that is, if k>1, then5k1>4k1and1<k3k+1<4k3k<4k1.So now we have candidates a=3k, b=4k1, and c=5k1. It can be verified that(3k)2+(4k1)2(5k1)2=2k=n.Now we consider the case when n is odd, then it can be written in the form n=2k+1. First, if n=1, then1=42+72823=42+62725=42+52627=62+142152.Now, if k>3 (that is, n>7), then, first,5k4>4k4. In addition,k>34k>3k+34k4>3k1.This means that our candidates are a=3k1, b=4k4, and c=5k4. It can be verified that(3k1)2+(4k4)2(5k4)2=2k+1=n.In that case, if n=2015=2(1007)+1, then we just have a=3020, b=4024, and c=5031.
  3. Consider 2015 nonnegative real numbers a1,a2,,a2015 whose sum is 1. Define M to be M=max1kak+ak+1+ak+2.As ak varies, M varies as well. Find the minimum value of M. (Canadian Mathematical Olympiad, 1981)
    (partial credit for Jarrett Ian G. Lim [Philippine Academy of Sakya], Joyce Heidi Ong [Chiang Kai Shek College], and Madeline L. Tee [Chiang Kai Shek College])

    SOLUTION: 
    Since all the numbers are nonnegative, then a1a1+a2a1+a2+a3 and a7a6+a7a5+a6+a7. This means thatM=max1k2009ak+ak+1+ak+2.=max1k2009{a1,a1+a2,a7,a6+a7,ak+ak+1+ak+2}.If we add all the 2013 quantities, we havesum=a1+(a1+a2)+(a1+a2+a3)+(a1+a2+a3+a4)+(a2009+a2010+a2011)+(a2009+a2010+a2011)+(a2009+a2010+a2011)=3(a1+a2++a2011)=3.We then take the average of these 2013 quantities, and we get 32013=1671. If all the quantities are less than 1671, then the average should be less than 1671. This means that at least one quantitity should be greater than or equal to 1671. This implies that M1671. This means that the minimum value for M is 1671. Specifically, this can be achieved by letting ak=1671 if k1mod3, and ak=0 otherwise.
ERRATA
Madeline L. Tee [Chang Kai Shek College] and Carissa Elaine Varon [Falcon School] were not properly recognized for their solutions to the questions from the previous weeks. The previous issues have been updated to rectify this error.