I have an excel spreadsheet with 15 columns, one of which is EmailAddress and then 100,000+ records..
In my data i know that there are many duplicate email addresses.
Can someone tell me how can i extract a distinct list where each record is represented only once by emailaddress?
Alternately, if i import the data into SQL, how can i remove records that are duplicated by emailaddress but leave one record if 4 are found...
Thx..
In simpler tasks I would suggest the use of openrowset. Of course you could combine that with other queries following that, to do the filtering, but in this case it would be great to use DTS for MSSQL2000 and SSIS for later versions.
How?
the question is complex, and has 2 sub-questions that must have been answered previously.
and you could read more about SSIS https://stackoverflow.com/search?q=SSIS+Excel