Search code examples
cnumerical-methodsleast-squaresapproximationnumerical-stability

Problem with least squares rational approximation to `asin(x)+ sqrt(1-x^2)` in [3,1] form


I'm trying to generate a decent [3,1] rational least squares polynomial approximation to asin(x)+sqrt(1-x^2) on [0,1] and failing dismally :( The problem is that it has a pole for this particular combination and also a handful of others (approx. 1%) of those I tested rational approximations to this function go completely haywire. This is the first step in a Remez equal ripple code.

I have ported the original Julia code which works in arbitrary precision (so numerical instability can be ruled out) to C and get about the same results as the Julia code for modest N,D. I still find debugging easier in C/C++. The algebra all looks fine to me and it works in ~99% of cases for N, D <= 20. It uses a trick described here asin when fast sqrt available. This is of interest again because hardware sqrt is now so fast.

Unfortunately the equation is rather fickle and rational Remez solutions to it are a bit thin on the ground. When the original author tried it he listed all the solutions that were soluble in Maple/Mathematica with digits = 30 for N, D <= 20 at the end. My improvements have got past some of the obstacles. I expect Maple & Mathematica have also improved in the meantime.

The remaining failing cases in the full Julia implementation are (my C code can duplicate them up to N+D ~9):

D Fails Failing N values
0 0
1 1 3
2 1 6
3 1 9
4 3 4 12 14
5 3 1 7 17
6 2 10 20
7 2 1 13
8 3 8 16 18
9 2 5 11
10 2 2 14
11 1 17
12 4 2 6 12 20
13 2 9 15
14 1 18
15 2 3 7
16 2 10 16
17 4 3 7 13 19
18 0
19 2 11 17
20 4 4 8 14 20
Total failures  42/400

There is a sort of pattern visible in the failures. Notably that the first failures occur in pairs at N, D=5N and D=5N+2 [1,5][1,7], [2,10][2,12], [3,15][3,17] and the other heuristic is that failing ones tend to be at an offset at N+1, D+3 or N+1, D+5 from a previous failing one at N,D. I have no idea why this is.

My concrete question is can someone else please try and construct a rational polynomial [3,1] that will fit this function and does not have a pole in the denominator or if that isn't possible (and I begin to think that it might not be suggestions on how else to tame asin(x)as x->1). I'm using x_n = sin(pi*n/N)^2 Chebyshev for the x coordinates which places more emphasis on fitting the ends of the range (equal spaced is worse). I'm wondering about increasing the density of points sampled at larger x values. Increasing the weights there didn't help.

These are the (truncated coefficients for the various polynomials along the slice N+D = 4 to which [3,1] belongs.

N D a0 a1 a2 a3 a4 a5 chisq min max
4 0 0.9995526 1.0122944 -0.56767465 0.28349546 -0.1558162 1 3.603E-06 -0.0004863 0.001055
3 1 1.0003446 -0.138900694 -1.51998173 0.463582739 1 -1.1239944 0.00026536 -0.0119395 0.0069909
2 2 0.9999087 0.845226481 -0.950516017 1 -0.1583922 -0.2722624 1.2614E-06 -0.00036045 0.00051565
1 3 0.9997 2.270119177 1 1.248325161 -0.536562537 0.368 1.6649E-05 -0.000982 0.0014115
0 4 1.0014 1 -0.959053269 1.152066248 -0.849712791 0.29415 6.8462E-06 -0.00051585 0.0014
3 1 1.000688 0.08167677 -1.276224508 0.356408177 1 -0.8965724 1.11E-05 -0.0007097 0.0008207

These are the respective error graphs on the same scale for each of the approximations above (the adjusted [3,1] isn't shown but just looks like a W). A correct solution with 6 extrema should look like W\ or /W ie HLHLHL or the opposite. Note the pole going off to infinity on the red [3,1] curve at x = 0.8897. Least squares [4,0] green and [3,1] red line Least squares [2,2] black Least squares [1,3] cyan and [0,4]

AOBE I would probably use [2,2] or [3,2] as the best trade off on simplicity vs accuracy. But [3,1] would be very handy if it exists.

I have omitted the customary MRE for the moment since it isn't really minimal. I hope to get it down to under 400 lines without removing any of the crucial explanatory comments. It is a one to one code match for the ARM Remez Julia as far as is possible which makes the C rather ugly (I'm not especially proud of it but it does work and gets the same answers as Julia in the range N+D< 9. Obviously numerical stability is an issue with only double precision and the matrix solver from Julia is crude but it works well enough for [3,1] not to be a problem.

I have tried weighting the fit and various analytic transformations to move excess coefficient value from the denominator into the numerator and obtained a sort of solution at the bottom of the table but which doesn't oscillate enough to be acceptable input for Remez. Remez also tends to restore the pole and/or create new ones (but that is another story).

My goal is to have N+D+2 extrema in the rational least squares approximation including the two end points. An interesting curiosity is that [0,4] actually has N+D+3 extrema due to serendipity. The other low order ones giving problems with poles are [1,5], [1,7], [4,4] and [6,1]. I have focused on [3,1] specifically because I had hoped that analytic methods for cubic equations might allow me to find a well behaved solution - if one exists. That approach got me past the previous gotcha which triggered on a quadratic form [0,2] and I had hoped that was the last bug in this section.

If an ideal equal ripple solution to target f(x) with error e and 6 extrema does exist we have a couple of hard algebraic constraints that can be used to decrease the search space or to sanity check putative answers. Based on the values taken at the ends of the range x = 0 and x=1.

 r(x) = (a + bx + cx^2 + dx^3)/(1-gx)

 f(0) - r(0) = e = 1 - a    IOW a = 1 - e
 f(1) - r(0) = -e = pi/2 - (a+b+c+d)/(1-g)
 IOW
 a+b+c+d = (pi/2+e)(1-g)
 so d = (pi/2+e)(1-g)-1+e+b+c
 And also to avoid having a pole in the range 0 - 1
 g < 1 

I'm now looking at tricks from solving the cubic and knowing that a final solution with the right number of extrema in the target range 0-1 must have a derivative with a numerator with all real roots in the same range. This approach is proving tough. I suspect possibly because no such solution exists (brute force on the four remaining free parameters hasn't found one yet either). I can take a guess at a value for e ~ 0.0014 and then try all legal combos of b,c,d,g but it still isn't yeilding...


Solution

  • [2024 Oct 23]
    Prior optimization based on using an incorrect formula (not shown here).
    Fixed.


    I used the following (which is close to OP's answer.):

    op(x) = asin(x) + sqrt(1 - x*x)
    poly(x) = (((c3*x + c2)*x + c1)*x + c1)/(d1*x + 1)
    c0   1.00058937815264
    c1   0.0111306086313463
    c2  -1.35415225023654
    c3   0.390636229113996
    d1  -0.969300884013643
    

    It had a maximum absolute error of 0.00059.

    enter image description here

    I'm now looking at tricks ...

    Found via office libre(or maybe excel) solver optimizing over about 100 data points.


    [4,0]

    Perhaps poly(x) = (((c4*x + c3)*x + c2)*x + c1)*x + c0) would have been better, even though OP asked for a decent [3,1]?

    Constraining f(0) == 1.0 and f(1) to about pi/2 to fit nicely at the end points, I came up with:

    c0   1
    c1   1.00338973779371
    c2  -0.483991222682364
    c3   0.219301937693067
    c4  -0.167904125088803
    

    ... with a max error of 0.00048.

    enter image description here


    Yet IMHO, these are weak solutions.

    Rather than try to minimize the max absolute error, the ULP error should be minimized and the end points should be very precise.

    Yet since f(x), is in the x = [0.0 ... 1.0] domain only has an output in the f(x) = [1.0 ... 1.571) range, (the same power-of-2), optimizing to minimal absolute error is like minimizing the ULP error.