The Ten Pitfalls of the Data Wrangler (aka the Wielder of M/Power Query)

February 21, 2017 at 1:34 pm

Such an awesome post Gil! I’ve been following this on your blog, and you’ve systematically made my queries much more bulletproof. List.Union and Liat.Transform combo is POWERFUL.

Thanks again for sharing your M expertise!

February 21, 2017 at 9:49 pm

Thank you Chris. I am glad you found it useful. Would you like to share which scenario / data challenge you were able to resolve with List.Union and List.Transform?

February 21, 2017 at 9:03 pm

Thanks for sharing. As you mention in your article, it is not difficult to author robust queries– The only challenge is being prepared to do it. This post is a great way to prepare.

February 21, 2017 at 9:55 pm

Thank you sherlock spreadsheet. Looking forward to hearing about your experience in creating robust queries.

February 22, 2017 at 7:53 am

I was going to say…I think I’ve seen these before…on DataChant.com!

February 22, 2017 at 8:59 am

Wow. I dropped a project I was working on a few weeks ago due to refresh errors (column name changes in source data). I’ll be dusting that back off now.

February 26, 2017 at 12:14 pm

Good luck, Geary. Wishing you zero refresh errors.

February 22, 2017 at 9:36 am

Awesome post! I had the changed type problem a few times and it took a few minutes for me to figure out that it was Power BI changing the type not me. Thanks! Now I feel like I have the keys to the city – or the rules for creating very useful templates.

Great job!

February 26, 2017 at 12:17 pm

Thank you Colleen. Loved the metaphor.

February 22, 2017 at 10:15 am

Ten alligators! My Pitfall guy has never had to swing over more than three at a time. Very daunting.

Thank you for this post. I know I’ll be referring back to it and the underlying posts quite a bit in the future!

March 1, 2017 at 7:04 am

Hope you’ll find it useful

February 22, 2017 at 11:26 am

Great post. It would have saved me a lot of headaches in the last few months and I’m sure that it will help me in the future, especiallt #7 and #10 that were still a mistery to me. Thanks.

March 5, 2017 at 2:29 pm

Thank you FranzV, Sorry I didn’t publish it earler 🙂

February 22, 2017 at 11:28 am

Excellent post Gil. Thank you. I already felt pretty powerful with Power Query. Now that has been multiplied!

March 1, 2017 at 7:06 am

Thank you Ken, Hope you enjoy the ride with your new powers.

February 23, 2017 at 12:05 pm

Fantastic, Gil! Thanks for sharing all these tips with us. So helpful.

February 26, 2017 at 12:21 pm

Thank you Dory.

February 24, 2017 at 11:39 am

Thanks a Lot. One of the best PQ related post!!

February 26, 2017 at 12:23 pm

Thank you AshT. This series was so time-consuming. I am glad that the outcome is appreciated.

February 27, 2017 at 6:32 am

A major pitfall not covered here which affects me is the Path to Source files, e.g.

= Csv.Document(File.Contents(“E:\OneDrive\Company\Sub Folder\SO_Info.csv”),[Delimiter=”,”, Columns=6, Encoding=1252, QuoteStyle=QuoteStyle.None])

Why can we not use the Current folder, rather than using the Full File Path??

March 1, 2017 at 7:08 am

You are right. Perhaps I should start a new series on callboration pitfalls.

March 1, 2017 at 11:45 am

Regarding Pitfall #4. Could a function be used to perform an action on every column OR on each column in turn using so that no matter what each Column is named the number could be used. In my case an action such as FillDown.

March 5, 2017 at 2:28 pm

To fill down multiple columns, there is no need to perform iteration. Assuming you have the offset (a zero-based index of the first column), and the count, you can apply this code:
FilledDownColumn = List.Range(Table.ColumnNames(Source), offset, count),
FilledDown = Table.FillDown(Source, FilledDownColums)

March 6, 2017 at 4:17 am

Many thanks for the Tip. I have used your code, but nested together to form:
#”Filled Down” = Table.FillDown(#”Previous Line”,List.Range(Table.ColumnNames(#”Previous Line”), 1))
Looking forward to future Tips on your site.

March 6, 2017 at 2:53 pm

Great article. Some of these I have run into and figured out on my own; others i didn’t know about and great to have a heads up on. Thanks for writing it up.

April 27, 2017 at 3:35 pm

One I found useful in a planning tool that I created was the following, which checked if the Promo column exists, and created a null one if it did not.

#”Column check” = if not(Table.HasColumns(#”Renamed Columns”, “Promo”)) then Table.AddColumn(#”Renamed Columns”,”Promo”, each null) else #”Renamed Columns”

Since was possible that a sales person had no promotions, which after the depivot of their data, removing all the empty cells and repivoting to desired format

June 15, 2017 at 1:21 am

HI Gill

Thanks for sharing these tips.

I used #9 to bring in all the columns when the columns can change. I couldn’t get your DataChant #9 solution (the one at the bottom of the post) to work using the List.Union – no doubt my fault in missing something,

But I did get the exception code to work above to work – i just removed one of the columns i didn’t need, but it then it brings in all the other columns even if new ones are added – perfect.

Thanks again.

Neale

February 22, 2018 at 1:32 pm

Very helpful; thank you!

November 2, 2023 at 9:06 pm

Terrific!

November 2, 2023 at 9:07 pm

Terrific!

The Ten Pitfalls of the Data Wrangler (aka the Wielder of M/Power Query)

The Root cause: Preview

Pitfall #1 – Your formula bar is inactive

Pitfall #2 – Auto-generated Changed Types step

Pitfall #3 – Include vs. Exclude Filters

Interested in Learning How to Do this Kind of Thing?

Pitfall #4 – Column Reordering

Pitfall #5 – Removing and Selecting Column

Pitfall #6 – Column Renaming

Pitfall #7 – Split Columns By Delimiter

Pitfall #8 – Merge Columns

Pitfall #9 – Expand Table Columns

Pitfall #10 – Remove Duplicates and Lookup Tables

Conclusions

Cancel reply