I'm not sure if this is for StackOverflow or Programmers but since it's more leaning towards implementation, I'm asking it here.
I'm looking for an algorithm that could take n
inputs (all floats) and produce m
(all floats; m < n
) outputs. This system could then be trained using a sort of fitness score to learn the correlation between the inputs and the outputs.
What would be the best algorithm to use for such a purpose?
A little bit of context: I want to use machine learning instead of a self invented algorithm because I don't know the (full) correlation between the data, I do know if the outcome of the machine learning algorithm will be any good or not and train it from there.
I have a couple of variables to pass in like:
0-1
)0-1
)0-1
)0-1
)0-1
)0-1
)0-1
)The output should be:
0-1
)0-1
)I have very large data sets that can be processed, so ideally the algorithm suggested can also be persisted.
I have seen algorithms like Artificial Neural Networks but those don't allow for a fitness score as they need input and output coupled together. I can't provide that, I can only calculate the chance that those numbers would be correct (the fitness score – by design never >= 1
)
From the description it looks like a classical problem of reinforcement learning where you do have some agent performing actions (here defined as action+strength, but this is still an action) which changes some internal state of the agent and gets (at some point at least) a reward.
There are many methods to learn a good policy (rule selecting a particular action) from your environment, including (but not limited to):