Dax: Back to the basics

August 21, 2018 at 11:19 am

Thanks so much for this post! This is a concept I continue to struggle to fully understand. I was following all the way down until the last example. I still can’t fully wrap my mind around why in the second example, Table1[Person] = “Ben” removed/replaced the existing filter (Table1[Person] = “Anne”), whereas in the third example, FILTER (Table1, Table1[Color] = “Green”) did not remove the existing Table1[Color] = “Red” filter but instead created an “and” filter context.

August 23, 2018 at 3:14 am

This is one of those things that you have to just “know” about Dax.
CALCULATE ( SUM ( Table1[Amount] ), ALL ( Table1[Type] ), Table1[Person] = “Ben”)
is the second example code. A simple expression of Table1[Person] = “Ben” is called a boolean expression – it’s either true or false. But what you have to keep in mind is it is just Dax shorthand that the engine internally rewrites to :
CALCULATE ( SUM ( Table1[Amount] ), ALL ( Table1[Type] ), FILTER ( ALL ( Table1[Person] ), Table1[Person] = “Ben” ) )
The ALL in this case ignores any filters on the ‘Person’ column and therefore returns all the values of the ‘Person’ column; effectively replacing for my highlighted value the filter ‘Anne’ with ‘Ben’ (As it will for every row in the table. No matter what ‘Person’ is on the row in visual, it will get replaced with ‘Ben’ – and to be technically precise even ‘Ben’ in visual is replaced with ‘Ben’ from Dax code. But are the same value so results are as expected.) Compare that to my third example which is :
CALCULATE ( SUM ( Table1[Amount] ), FILTER ( Table1, Table1[Color] = “Green” ) )
In this case the argument to CALCULATE is a full expression, but note the most important part: it does not use the ALL function. So FILTER then iterates over Table1 which is retrieved using the “old” filter context, meaning it was already pre-filtered down to rows with ‘Anne’ , ‘Red’, ‘Bike’. The FILTER function then further filters down this pre-filtered table to ‘[Color] = “Green”‘. Since no rows in pre-filtered table are ‘Green’, FILTER returns a blank table to CALCULATE, so CALCULATE returns a BLANK value. So the difference between the two examples is the hidden ALL. By trying to shorten the code, the Dax developers made it a bit harder to understand what is going on. But this is not the only place you’ll run into hidden functions. Make sense? Clear as mud?

August 23, 2018 at 1:25 pm

That was the missing piece! I think I finally understand …thank you!!!

August 21, 2018 at 12:12 pm

Good explanation!

August 21, 2018 at 4:29 pm

wow! Genial Matthew, aprendi algo nuevo. 🙂 muchas gracias.

August 21, 2018 at 9:18 pm

Great explanation, in particular around the effect of using FILTER or not in the CALCULATE – and very timely for me as was starting to troubleshoot two formulas that were producing incorrect result due to this exact aspect, I had not used FILTER. Thanks

August 22, 2018 at 5:15 pm

Yes – that was something I considered while drafting the post: how far down the rabbit hole should I go? There are many nuances in DAX, and a Boolean expression used as a filter argument to CALCULATE is one of them. You have to remember that Table[Color] = “Red” is internally expanded by Dax engine to : FILTER ( ALL ( Table[Color]), Table[Color] = “Red” ). This quasi-hidden use of ALL has messed up a lot of people…including me early on.

August 22, 2018 at 9:57 am

Could you please provide a scenario to demonstrate the case how the step 2 “Move Row context to Filter context” work? Thanks.

August 23, 2018 at 3:33 am

The concept of Row Context really needs to be its own post. I don’t think I can adequately explain it in the comments section. But it is a vitally important concept IMO to understand because it can rear its head in places and cause results that at first glance appear to make no sense. I see that done all the time. If PPPro permits, I’ll work on a post about it. Stay tuned(?).

August 23, 2018 at 6:42 am

For sure I’ll keep walking along your forthcoming posts. Thanks in advance.

August 22, 2018 at 11:09 am

The Filter Context is just a replica of the Criteria Range of an Advanced Filter (which in turn emulates the WHERE clause of a SQL Query)
Now the only thing to teach in CALCULATE to an excel user is the Blocking semantics of Calculate where in CALCULATE wins in case of Conflicts (two filters on the same field one from pivot and one from CALCULATE)
The next is the concept of Context transition where in a ROW context converted to a FILTER Context when CALCULATE is used inside a calculated field
And Finally the Measure [mSales] defined as =SUM(DATA[ACTUALS]) is actually CALCULATE(SUM(DATA[ACTUALS]))
So when a measure name is used on the right hand side of an equals operator in a Filter Argument of the FILTER function – Context transition happens

August 23, 2018 at 1:53 pm

“Now the only thing to teach in CALCULATE to an excel user is the Blocking semantics of Calculate where in CALCULATE wins in case of Conflicts (two filters on the same field one from pivot and one from CALCULATE)” This is what I was trying to convey in my post but perhaps I missed the mark a bit. And to be more precise, it is not CALCULATE that performs blocking semantics (or overwrite is now the preferred terminology?), but rather some other function used as a filter argument to CALCULATE. For example, CALCULATE ( SUM ( Table1[Amount] ), ALL( Table1[Color] ) ) does perform blocking on the column Table1[Color] due to the use of ALL, not because the column was referenced in CALCULATE. However, if we do: CALCULATE ( SUM ( Table1[Amount] ), FILTER ( VALUES ( Table1[Color] ) , Table1[Color] = “Green” ) ) no blocking semantics are performed because neither FILTER nor VALUES perform blocking.
And yes Row Context Transition is another vital aspect of Dax that people need to understand. I skipped it in this post because it is too big a topic to mash together with this one. And yes when used on the right side of a equals operator context transition may happen depending on circumstance. But context transition also happens without a CALCULATE via all the table functions such as FIRSTDATE, LASTNONBLANK, etc. It’s the combo of an iterator & CALCULATE/Table Function that triggers context transition. But good point.
Thanks for commenting.

August 22, 2018 at 7:22 pm

Two questions on this excellent article:
1) In the “Sum of Amount No Type Ben Only” measure you say “This measure reads “Sum the values according to the Filter context box, but remove any filters on the column Table1[Type] and only keeps the rows where the column Table1[Person] = “Ben”.”

But it doesn’t do that – keep rows where the column is Ben. It *replaces* everything in the Person column with Ben. So in the first row, it says “Ok, this is Anne, blue, and bike, but I’ll replace Anne with Ben, and get all types” so it ads up uses Ben and Blue. Right? Or am I not understanding what you are saying?

2) Related to #1, why would you write a measure like this? After seeing your example, I am having trouble figuring out where I’d want that behavior. Your article would now make me more inclinded to write:

Total Ben Only =
CALCULATE(
[Total Amount],
ALL(‘Test Data'[Type]),
FILTER(
‘Test Data’,
‘Test Data'[Person] = “Ben”
)
)

This gives blank values for all of Anne’s and Charlie’s, and fills in Ben’s. Much like your other FILTER() example.

Am I misunderstanding something, and can you give an example of where you’d want the results of your Ben Only measure?

August 23, 2018 at 2:50 am

1) After reviewing your comment and my post I can see your point that perhaps it was a poor choice of words on my part. A boolean expression used as a filter argument like Table1[Person] = “Ben” is just Dax shorthand that the Dax engine internally rewrites to: FILTER ( ALL ( Table1[Person] ), Table1[Person] = “Ben” ). The use of ‘ALL’ does return all values for the ‘Person’ column in the table ‘Table1’, and then the FILTER iterates over the list of values and only keeps ‘Ben’. So you are correct that effectively it does replace ‘Anne’ with ‘Ben’ in your scenario and will sum and display the amount column for all rows where it is true that ‘Person = “Ben”‘ and ‘Color = “Blue”‘ even though the pivot shows ‘Anne’, ‘Blue’, and ‘Bike’ for the respective row.
2) The intent of the examples was to explain the algorithmic logic of the Dax engine, not to resolve any particular problem so I wasn’t trying to solve for any particular real world use scenario. As I was writing the post I also thought of many other examples I though to include to shed light on other nuances of Dax, but I came to the realization that the post would be too long so I “settled” for three.
ok?

August 23, 2018 at 10:32 am

Thanks Matthew. I wasn’t meaning to challenge “why would you do this” but was wondering if this is how a raw filter in CALCULATE works, I am having a difficult time coming up with a reason to use one. The results of the FILTER() filter is what I would always want – I think.

Again, appreciate the article. Can never read enough in understanding the nuances that filter context and CALCULATE() has. 🙂

August 23, 2018 at 1:47 pm

No problem. I didn’t feel challenged at all.

And “always” is a long time… I use the shorthand notation all the time.

Thanks for commenting.

August 24, 2018 at 3:03 pm

If I understand Edh’s question correctly, it’s about the use of a raw CALCULATE filter like this:
CALCULATE (SUM (Table1[Amount]), Table1[Person] = “Ben”))
instead of the Filter version like this:
CALCULATE (SUM (Table1[Amount]), FILTER(ALL(Table1[Person]),Table1[Person] = “Ben”)),
If I’ve not misunderstood the issue, then according to Rob Collie’s PP book (2nd Edition), it has to do with performance, as well as some safety in keeping early users of DAX out of trouble. If you can use the simpler ‘shorthand’ of the Boolean version, then apparently it will be faster (if there is any difference). The equivalence is shown on page 244 of Rob’s book, and the performance concern is mentioned on page 172. Maybe this helps in understanding why you might want to use the shorthand or raw Boolean version of the expression.

BTW, excellent post. The visual model was very helpful.

August 24, 2018 at 7:53 pm

Thanks Renny. I have that book at home so I’ll check out the details.

August 25, 2018 at 5:03 am

Errr….sorry…but no. There is no performance difference between the two measures you have listed because the query plans are the same. The Dax engine always rewrites the former into the latter. The only advantages are code that is more compact / easier to read and less keystrokes for the author. Thanks.

September 18, 2018 at 9:24 am

“DAX is simple , but it is not easy” – Alberto Ferrari. Perfect example of that right here. Nice job Matt.

February 20, 2019 at 6:02 pm

Salut. Merci
Votre réponse à la question de Matthew Brice, m’a vraiment aidé a bien comprendre le contexte de transition de la Fonction CALCULATE().
Ça m’aide beaucoup dans mes rapports mensuels avec des visualisations très facile, car j’occupe un poste de contre maitre dans un service de calcul de production chez une entreprise pétrolier algérienne (SONATRACH). merci encore

August 6, 2019 at 3:04 pm

thanks for the post, good examples with explanations, also thanks Elizabeth, the question you asked is just the question I want to ask.

Dax: Back to the basics

Intro

Filter Context

CALCULATE

Conclusion

Where It’s At: The Intersection of Biz, Human, and Tech*

Cancel reply