Search code examples
roptimizationminimizationnlm

nlm with multiple variables in R


I am trying to use nlm() to minimize the SSE in a function. I'm having trouble with they syntax and getting nlm() to provide estimates for both variables. Eventually t_1, t_2, and t_3 will be values pulled from a data.frame but I just assigned them numbers until I could get the nlm() to work. I have attempted solutions from these threads there and there, with no luck:

t_1 <- 1.91
t_2 <- 3.23
t_3 <- 4.20

fun <- function(s,y){
  (10 - s*(t_1-y + y*exp(-t_1/y)))^2+
    (20 - s*(t_2-y + y*exp(-t_2/y)))^2+
    (30 - s*(t_3-y + y*exp(-t_3/y)))^2
}

## Testing the function
fun(9.57,1.13)
[1] 0.9342627

I have tried multiple approaches for my nlm syntax. With 2 varaibles I believe I have to insert an array for p but when I've tried that it hasn't worked. None of these solutions below have worked:

# Attempt 1
p = array(c( 1,0), dim=c(2,1) ) 
ans <- nlm(fun, p)

# Attempt 2
ans <- nlm(fun, c( 0.1, 0.1))

# The first two return a "Error in f(x, ...) : argument "y" is missing, with no default"
# Attempt 3 returns a "invalid function value in 'nlm' optimizer" error

ans <- nlm(fun, c( 0.1, 0.1), y = c(1,1))

I'm sure there are several errors in my code but I'm not sure where. This task is more complex than I've attempted before as I'm relatively new to R.


Solution

  • If you look closely to the nlm function. It asks only one argument. One solution is :

    fun <- function(x){
      s <- x[1]
      y <- x[2]
      (10 - s*(t_1-y + y*exp(-t_1/y)))^2+
        (20 - s*(t_2-y + y*exp(-t_2/y)))^2+
        (30 - s*(t_3-y + y*exp(-t_3/y)))^2
    }
    
    p <- array(c(0.4, 0.4), dim = c(2, 1))
    # p <- c(0.4, 0.4)
    ans <- nlm(f = fun, p = p)
    

    Both vector or array work however you can't give two arguments like you did.

    EDIT

    In numerical optimization the initial point is really important. I advise you to use optim function which is less sensitive of misspecification of initial point.

    One idea is to do like this, you make a grid of many initial points and you select the one which give you the best result :

    initialisation <- expand.grid(seq(1, 3, 0.5),
                                  seq(1, 3, 0.5))
    res <- data.frame(optim = rep(0, nrow(initialisation)),
                      nlm = rep(0, nrow(initialisation)))
    
    for(i in 1:nrow(initialisation)){
      res[i, 1] <- optim(as.numeric(initialisation[i, ]), fun)$value
      res[i, 2] <- try(nlm(f = fun, p = as.numeric(initialisation[i, ]))$minimum, silent = T)
    }
    res
    

    I insist that with the example above the optim function is realy more stable. I advise you to use it if you have no other constrains.

    You can check function parameters thanks to ?nlm.

    I hope it helps.

    EDIT 2

    fun <- function(x){
      s <- x[1]
      y <- x[2]
      (10 - s*(t_1-y + y*exp (-t_1/y)))^2+
        (20 - s*(t_2-y + y*exp(-t_2/y)))^2+
        (30 - s*(t_3-y + y*exp(-t_3/y)))^2
    }
    

    I choose this initial point because it seems nearer to the optimal one.

    p <- c(10, 1)
    
    ans <- nlm(f = fun, p = p)
    

    You can obtain your two parameters like this : s is :

    s <- ans$estimate[1]
    

    y is :

    y <- ans$estimate[2]
    

    You also have the optimal value which is :

    ans$minimum :
    0.9337047
    fun(c(s, y)) :
    0.9337047
    

    My second post, the edit is just their to alight the fact that optimisation with nlm function is a bit tricky because you need ta carefully choose initial value.

    The optim also an optimisation function for R is more stable as in the example I give with many initialization points.

    expand.grid function is useful to obtain a grid like this :

    initialisation <- expand.grid(s = seq(2, 3, 0.5),
                                    y = seq(2, 3, 0.5))
    
    initialisation :
    
       s   y
    1 2.0 2.0
    2 2.5 2.0
    3 3.0 2.0
    4 2.0 2.5
    5 2.5 2.5
    6 3.0 2.5
    7 2.0 3.0
    8 2.5 3.0
    9 3.0 3.0
    

    res data.frame gives you the minimum obtain with different initials values. You see that the first initials values give you no good result for nlm but relatively stable one for optim.

    res <- data.frame(optim = rep(0, nrow(initialisation)),
                      nlm = rep(0, nrow(initialisation)))
    
    for(i in 1:nrow(initialisation)){
      res[i, 1] <- optim(as.numeric(initialisation[i, ]), fun)$value
      res[i, 2] <- if(is.numeric(try(nlm(f = fun, p = as.numeric(initialisation[i, ]))$minimum, silent = T)) == T){
        round(nlm(f = fun, p = as.numeric(initialisation[i, ]))$minimum, 8)
      }else{
        NA
      }
    }
    

    try function is just their to avoid the loop to break. The if is to put NA at the right place.

    res :
    optim         nlm
    1 0.9337094        <NA>
    2 0.9337058  0.93370468
    3 0.9337054        <NA>
    4 0.9337101  0.93370468
    5 0.9337125 61.18166446
    6 0.9337057  0.93370468
    7 0.9337120  0.93370468
    8 0.9337080  0.93370468
    9 0.9337114  0.93370468
    

    When there is NA values it meens that nlm doesn't work well because of initialization. I advise you to choose optim if you don't need really precise optimisation because of its stability.

    To an extensive discussion on optim vs nlm, you may have a look their. In your specific case optim seems to be a better choice. I don't know if we could generalise a bit.