The Newton's method
Newton's method generates a sequence to find the root of a function starting from an initial guess . This initial guess should be close enough to the root for the convergence to be guaranteed. We construct the tangent of at and we find an approximation of by computing the root of the tangent. Repeating this iterative process we obtain the sequence .
Derivation of Newton's Method
editApproximating with a second order Taylor expansion around ,
with between and . Imposing and recalling that , with a little rearranging we obtain
Neglecting the last term, we find an approximation of which we shall call . We now have an iteration which can be used to find successively more precise approximations of :
Newton's method :
Convergence Analysis
editIt's clear from the derivation that the error of Newton's method is given by
Newton's method error formula:
From this we note that if the method converges, then the order of convergence is 2. On the other hand, the convergence of Newton's method depends on the initial guess .
The following theorem holds
Theorem
Assume that and are continuous in neighborhood of the root and that . Then, taken close enough to , the sequence , with , defined by the Newton's method converges to . Moreover the order of convergence is , as
Advantages and Disadvantages of the Newton-Raphson Method
editAdvantages of using Newton's method to approximate a root rest primarily in its rate of convergence. When the method converges, it does so quadratically. Also, the method is very simple to apply and has great local convergence.
The disadvantages of using this method are numerous. First of all, it is not guaranteed that Newton's method will converge if we select an that is too far from the exact root. Likewise, if our tangent line becomes parallel or almost parallel to the x-axis, we are not guaranteed convergence with the use of this method. Also, because we have two functions to evaluate with each iteration ( and , this method is computationally expensive. Another disadvantage is that we must have a functional representation of the derivative of our function, which is not always possible if we working only from given data.