Has anyone any idea where to start? For example, extract "two" from "I have two apples". I'm looking in the direction of NLP or QDA. Any leads for how to go about it would be appreciated.
You might be interested in Stanford NER system. It identifies numeric entities.
You can try it here: http://nlp.stanford.edu:8080/corenlp/