A similar question was asked here but it didn't receive any answers.
I have a function that I need to minimize. My function has 3 parameters, x, y, and w
. Given w
, I need to find the optimal pair of x and y
that minimizes my function. However, x and y
are restricted in the interval (-0.5 ,0.5]
I tried to use the optim
function and it works when I don't have restrictions. However, when I apply the restrictions and use the L-BFGS-B method, I get an error. I cant figure out why I am getting this error.
Below is an example of my function and some of the methods I have tried. This is the function that Im trying to minimize:
fn <- function(par, w) {
x <- par[1]
y <- par[2]
p1 <- w^x * (w + 1) * beta(x + 1, w + 1 - x)
p2 <- w^y * (w + 1) * beta(y + 1, w + 1 - y)
p3 <- w^(2 * x) * (w + 1) * (beta(2 * x + 1, w + 1 - 2 * x) - (w + 1) * (beta(x + 1, w + 1 - x)^2))
p4 <- w^(2 * y) * (w + 1) * (beta(2 * y + 1, w + 1 - 2 * y) - (w + 1) * (beta(y + 1, w + 1 - y)^2))
p5 <- w^(y + x) * (w + 1) * (beta(y + x + 1, w + 1 - y - x) - (w + 1) * beta(y + 1, w + 1 - y) * beta(x + 1, w + 1 - x))
p6 <- (y^2) * p3 * p1^(2 * y - 2) * p2^(-2 * x)
p7 <- y * x * p1^(2 * y - 1) * p2^(-2 * x - 1) * p5
p8 <- (x^2) * p4 * p1^(2 * y) * p2^(-2 * x - 2)
mu <- (p1^y) / (p2^x)
sigma <- p6 - 2 * p7 + p8
v <- (gamma(x + 1)^y / gamma(y + 1)^x)
quant <- ceiling((qnorm(0.95)*sqrt(sigma)/(v-mu))^2)
return(quant)
}
Here is my issue:
# Here is an example of it working when there are no restrictions
oo = optim(par = c(0.1, 0.2), fn, w = 0.1)
oo$par
# In this example, I apply restrictions and it returns an error
oo1 = optim(par = c(0.1, 0.2), fn, w = 0.1, lower = c(-0.49, -0.49), upper = c(0.5, 0.5), method="L-BFGS-B")
oo1$par
In the second example, I apply restrictions and get the following error:
Error in optim(par = c(0.1, 0.2), fn, w = 0.1, lower = c(-0.49, -0.49), : L-BFGS-B needs finite values of 'fn'
Additionally, I also tried the optimx
package and it doesn't seem to work. It just returns NA
s. For example:
library(optimx)
opt <- optimx(par = c(0.1, 0.2), fn = fn, lower = c(-0.4, -0.4), upper = c(0.5, 0.5), w = 0.1)
opt
> opt
p1 p2 value fevals gevals niter convcode kkt1 kkt2 xtime
L-BFGS-B NA NA 8.988466e+307 NA NA NA 9999 NA NA 0.001
Does anyone have any idea as to why I cant minimize my function when I apply the restrictions? (and why the optimx
package returns NA
s?)
Here's a start. I added a line
cat(x,y, w, quant, "\n")
right before return(quant)
. Your first (no-bounds) run gave:
0.1 0.2 0.1 40
0.12 0.2 0.1 40
0.1 0.22 0.1 41
0.12 0.18 0.1 39
0.13 0.16 0.1 39
0.14 0.18 0.1 40
0.11 0.195 0.1 40
0.11 0.19 0.1 39
0.12 0.19 0.1 40
0.11 0.18 0.1 39
0.1125 0.1825 0.1 39
Your second (with bounds) gave:
0.1 0.2 0.1 40
0.101 0.2 0.1 40
0.099 0.2 0.1 39
0.1 0.201 0.1 40
0.1 0.199 0.1 39
-0.49 -0.49 0.1 NaN
in other words, R's attempt to evaluate the function at the lower bounds gave NaN
.
Poking around some more:
library(emdbook)
cc <- curve3d(fn(c(x,y), w = 0.1), xlim = c(-0.5,0.5), ylim = c(-0.5, 0.5),
sys3d = "image")
it looks like your function is giving non-finite values if x=y
or x=0
or y=0
? You might want to catch those cases and return a finite value ...
In fact, that leads to a workaround, which is to not make the lower or upper bounds exactly equal, to stop R trying to evaluate the function in the bad places:
oo1 = optim(par = c(0.1, 0.2), fn, w = 0.1,
lower = c(-0.49, -0.48),
upper = c(0.5, 0.49), method="L-BFGS-B")
It probably would have been possible to diagnose this by looking at the objective function and thinking hard about where it would have non-finite values.