I know the more data, the better it's but what would be a reasonable amount of data required to train SyntaxNet?
Based on some trial and error, I have arrived at the following minimums:
But please note that with this, I've only managed to get the steps in the NLP pipeline to run, I actually haven't managed to get anything usable out of it.