I need to break a line of string into different columns into excel. Here is te input that i get.
Input:
Output: CSV record with structure (Code, Sem/Year, Subject, Course, Exam Date, Time, Marks)
I have data in different sets which constructs above lines. For example:
Grammar (this is an array / dictionary):
FYI,
My assumption is, read word by word and try to match with any word in any array that I have. If its match with any word, then categorize that word into falling category and add into relevant column in excel.
Here, I know how to handle data and everything, except what is the optimized / best way to understand each word falls under which category.
Is there any lexical analysis expert that can share some thoughts on this?
You should use regular expressions for matching such complicated text pattern.