Search code examples
excelms-accessexcel-formulanormalizing

MS Access 03 - Normalizing Data from an Excel spreadsheet


I have a question about a method to break something apart. I get this Excel spreadsheet that provides me with data that I need to do one report. Its pretty simple and straight forward, however there is one particular part of it that is giving me some grief.

In the Excel spreadsheet there is a column that lists "parties involved" in one column. It usually is about 12 people's names separated by commas, but also has orgID in parenthesis behind it:

joe smith (DIV32), john doe (DIV12), roger andrews (DIV14, DIV67, DIV01), etc

and I need to break these into individual columns so that they will be individual fields once I import them into access. I know how to "text to columns" in Excel, but this get screwed up when jon doe (DIV13, DIV54, etc), has more than one division.

Not sure how to do this in access but would love to know.

Anyone got either an excel formula, or access method for this please?


Solution

  • I assume you are importing this into your database. If not, it is probably easier if you do, even if it's temporary; and redone every time you have to run the report

    You are going to end up having three tables to represent this situation

    1. theParty (person)
    2. theOrganisation
    3. thePartyOrg

    theParty with have an ID column

    theOrganisation will have it's own ID column

    thePartyOrder will have it's own ID column, one for theParty, and one for theOrganisation

    Whenever you want to represent a Party as bring a member of and Organisation, you have to make sure the Party exists/is created; the Organisation exists/is created, and hen create an entry in the thePartyOrg table which points at both.

    Because this connection information just stored as text within a single column, it's probably a easiest to first read it all into a staging table and then parse that column in VBA