I’ve got some raw data that exports into an excel file with some funky formatting.
The data requires merging of different sets of columns but the number of columns in each set can change every week.
Data contains free-text entries that can contain html tables. Excel spreads the columns of the html tables into adjacent cells within the dataset. Thus when loading into Power Query, the additional cells are given their own columns but with no header.
The number of columns in an html table is dynamic .
There are multiple free-text variables in the dataset that can contain html tables.
Use power query editor to merge these columns into one but be able to handle any number of columns dynamically.
Example of how it looks like in Excel:
Example how it looks like Power Query:
Here is a pbi file that contains my work thus far and what the data should ultimately look like.
The column merging steps related to this post however are in the query labeled ‘Mock Data’.
Mock RAW QMS for COE.xlsx (9.4 KB) RAW QMS Mock for COE.pbix (41.7 KB)