I would like to do LDA topic modeling on a 9GB corpus. The plan is to train LDA model using MALLET for 1000 iterations with 100 topics, optimizing hyperparameters every 10 iterations after a 200 iteration burn-in period. I am working on 64-bit Win8, computer has 16GB RAM, Intel® Core™ i7-4720HQ Processor. Can anyone tell me how much time should I expect this to take? Are we talking about hours or days? This is the first question I am asking here, so if I've skipped some important info, please let me know.
So, just in case there is someone interested, in the end I have run the topic modeling (as detailed in question), and it took almost two days for it to finish (1day 20hours).