Power BI (DAX and M) vs. R: A Summer of Perspective

August 23, 2017 at 3:36 pm

Nice article!
I like to add my 3 cents 🙂

1) R vs M: Although they are both functional languages (and at least in my rookie eyes have lot in common: https://www.thebiccountant.com/2015/11/11/power-query-power-bi-are-ideal-learning-paths-from-excel-to-r/ ), the query editor in which M runs in Excel and Power BI makes M a direct language as well: After you’ve finished your expression, you can see the results immediately. For me this was essential to get my head around M at all, and I believe that this make a huge difference for many folks, who like me are lacking a formal technical education.

2) R & M always produce one result, while DAX-measures (I think you agree if we just ignore calculated columns here) produce … “elements” with the determination to produce multiple results. So one formula has the potential to show the correct results in exponentially increasing cases/coordinates of your 2-dimensional reports. Super power, super effective! But they need to be evaluated in an “intelligent environment” like pivot-tables or cubefunctions in Excel or the grid in PowerBI (where the context is provided and the compilation takes place).

3) While R & M can only produce one result, this result can have different characters: It can either be an end-result (like a nice R-chart) or an intermediate data-dump in form of a flat table. People have used and cherished ordinary pivot tables long enough to be tempted to use implicit measures (or quick measures) as the start of their data discovery. So the environment that DAX cannot live without (or at least cannot show its strengths without) also sets free some M-agic for simpler approaches 🙂

So my mental image adds M as a flat table with an arrow to a pivot-table to your R-pyramid and the DAX-cube here 🙂

August 24, 2017 at 11:22 am

Thanks Imke, very cool! So you *ARE* using M today to feed modified data sets into R?

August 24, 2017 at 5:21 pm

Hi Rob,
yes, I use R-functions if I cannot implement things in M. The cool thing is that you can encapsulate the R-code into a custom M-function where you can present a nice function-dialogue to the user.
And if we had the R-integration in Excel as well, I would do much more with it, but this holds me back considerably here at the moment.

August 25, 2017 at 10:56 am

Imke can you give us an example of when/how you incorporate some R code into an M wrapper? I’m intrigued, and it sounds from some other comments below that some folks would LOVE this, and don’t yet know that it’s possible.

August 25, 2017 at 10:58 am

Fresh from the press: Sharing some tips & tricks around R-scripts in the editor here 🙂

August 25, 2017 at 11:00 am

There is a link as well: https://www.thebiccountant.com/2017/08/25/tips-and-tricks-for-r-scripts-in-the-query-editor-in-power-bi/

August 23, 2017 at 9:21 pm

Rob, another good post. I learnt a lot from this and now see a priority for developing my data analytics skills. First “DAX”, close second “M” and third for polish, “R”.

August 23, 2017 at 10:49 pm

I’ve never looked at Power BI and R as competing against each other. Yes, they have some similarities but their strengths are very different which you pointed out in the article. I use both on an almost daily basis and sometimes together. My favorite so far was building a dynamic ggplot heat map in Power BI with only a few lines of code and it looked great.

August 24, 2017 at 11:28 am

In terms of perception, I think R and Power BI kinda *do* compete, because people rarely get to the bottom of the capabilities of a technology until *after* they have chosen to adopt said technology.

For instance, I’m a DAX man, through and through. Until recently I haven’t even started to *investigate* R – cuz hey, DAX is killing it for me.

There’s similarly a whole generation of students now coming up through the ranks being taught that “Data = R.” So there’s a perception gap on both sides.

Now that I’m getting into it a bit, I’m seeing that R isn’t a replacement for what we do in DAX – I’m positive that if I had to choose one over the other, I’d choose DAX. But since we *can* use both, I want to make sure we are leveraging them both properly. (And people on staff here at PowerPivotPro are already quite well-versed in R – it’s just ME, the “Old Man,” who’s catching up on the strategic front).

Lastly, I always use the word “versus” in these article titles to indicate “compare” – not “compete.” 🙂

August 24, 2017 at 12:34 am

It’s not a dax vs m vs r game. It’s how can I make sense of this data as quickly as possible.

August 24, 2017 at 11:29 am

Repeating part of my comment above: I always use the word “versus” in these article titles to indicate “compare” – not “compete.” 🙂

August 24, 2017 at 6:56 am

DAX (as expressed in Power BI) can leverage data transformations using R Script steps embedded in Power BI Queries (M), and also Power BI’s “R script visuals” can leverage data prepared in Power BI Queries and DAX transformations. So we probably all should learn more of all 3 to use the best combination of tools for the job.

August 24, 2017 at 10:50 am

A lot of your readers will probably have seen Chandoo’s recent post on creating a statistical chart using R.

https://chandoo.org/wp/2017/08/17/visualize-salary-increases-jitter-plot/

7 lines of R code… This is a tool we need to learn! Based on the variety of chart options in ggplot2 it doesn’t seem an appropriate use of time for the Power BI team to try and duplicate every visual. But on the other hand, and to Rob’s point, can R read a multi-table data model – or result set – to generate such a chart?

So while Rob’s distinction may be appropriate today, both sets of tools are going to evolve. It may be easier for R to create a tabular library than for PowerBI to develop an R shell, but we won’t care as long as it makes it easy to display the visuals. The “Sweet Spot” features in the above table are (deliberately?) distinct between business and engineering objectives. Manufacturing may thus be the initial point of integration between the two. Time to study up on what sort of data extracts MRP and MRO systems typically offer?

August 24, 2017 at 11:15 am

Rob,

Thoroughly enjoyed the insights AND the dialog!

August 24, 2017 at 11:46 am

Thank you for the clarity…totally useful

August 24, 2017 at 12:10 pm

I cannot agree or disagree more. I have used DAX for last 5 years and used R for couple of years. Both have their own strengths and weaknesses. One cannot replace the other. Like Jimmy Glenn mentioned – ‘It’s not a dax vs m vs r game. It’s how can I make sense of this data as quickly as possible.’ M can be good to retrieve data quickly from variety of sources/connections and R is great for visualization (Power BI or Excel don’t come anywhere close, period). An end user will pick his/her option based on how fast he/she can get to the end result. I think on a long run there has to be a synergy between DAX-M-R. For someone who can get this synergy working well will get the quickest results (provided they need the high degree of analytical complexity that R can offer)

We also have to keep in mind the segment of people who come to R or have picked R – most of them have already made a choice between R and Python (There’s a very small overlapping end user base between the two). So, people who have picked R, have streamlined their aspirations – they like the speed of R, the vast ocean of packages for flexibility and finally visualization. More importantly they do not belong in production environment (some overlap with DAX end users – “quick and dirty”) or they may not be creating scalable enterpise level ML or AI applications.

I think this niche segment of end users is the common base between DAX-M-R (or MR. DAX) that makes this an exciting zone. A nice venn-diagram is needed (@Rob Collie).

Thank you for a wonderful post and even more wonderful comments.

August 24, 2017 at 12:14 pm

Venn Diagram – I like it! Trouble is, I already spend more time on the visuals in each article than I do on the writing – by a lot, actually. I’ll come back to this topic a lot in the future, I suspect. Plenty of chances for new visuals 🙂

August 24, 2017 at 1:32 pm

Not a comment about the methods, but what a trip down memory lane seeing a control chart. I used to be in QA during the growth of SPC in the Auto Industry in the 80’s, and the amount of CPK calculations and control charts that I used in my job to track and adjust production on the fly or verify new equipment functionality… Just happy to see it still in use.

August 24, 2017 at 2:12 pm

This is one of THE COOLEST comments I’ve ever read. Thank you for sharing. I’m kinda infatuated with the idea of control charts now actually, there’s gonna be more on this 🙂

August 24, 2017 at 2:16 pm

I am an enthousiastic Power BI user. A year ago I bought a booklet about R because someone told me that R was “the” free solution for all statistics. Used on universities. I bought a simple book: R for Everyone from Jared P. Lander. My impression was that R looks like MS DOS and Power BI looks like Windows 10.

August 24, 2017 at 4:50 pm

Hi, Rob, I want to highlight something. I became a big fan/supporter of DAX following your blog and when I read your book I thought “I’m becoming a Data Jedi”, and I did. I achieved more productive than the rest of my team and that encouraged them to learn the language too.

I did a lot of awesome projects with that acquired knowledge, and then you wrote a post talking about us ExcelPros and Data Scientists (2012). The things you listed we didn’t do, pushed me to pursue that knowledge (via R) and even started a business after acquiring some of them.

Now I can confirm that in reality R, DAX and M aren’t competing. Sometimes I use R to make a highly specified analysis that was reproducible and other times I found myself making PowerBI reports because the consumer needed to slice and dice data. At the end I think that through development we are going to find a sweet spot where:

* The PowerBi environment will benefit for some of the high-level statistical functionality found in R, via tidy data output of statistical objects.
* The R environment will benefit from the logic that can be infused into data through the DAX/M engines, making more easier to slice and dice data, something very important to the EDA stage when producing high-level statistical analysis.

So, you’re doing well in putting this thought in public discussion and taking an interest in this topic. I think it’s where everything is heading towards.

August 25, 2017 at 12:55 pm

Wow wow WOW. So, so good. Thrilled to be a part of career and personal growth stories like this. (And let us know if you’d like to collaborate a bit.)

August 24, 2017 at 11:43 pm

One of the greatest features of R (and Python… which some of us prefer to R!) is the ability to add libraries on the fly.

DAX is a very controlled language – you don’t see new libraries being added in to enhance the language the way that new libraries are added in. And for specific or scientific calculations, you still need to create your own custom M functions.

Now… if we could use Python Pandas libraries in M – well then we’d have something!!

August 25, 2017 at 9:09 am

Cannot agree more with the comment above !
We definitely need to be able to use libraries in M.
Libraries for data prep’ and analysis are like custom visuals for data visualisation. They extand your power without any additional effort.

August 27, 2017 at 1:24 pm

I’ve set up the infrastructure for it here: https://www.thebiccountant.com/2017/08/27/how-to-create-and-use-r-function-library-in-power-bi/
Create an M (or R-) library for PowerBI or Power Query as a record. This can automatically be filled with code from GitHub.
Now it’s just up to the community to fill it with content 🙂

August 25, 2017 at 12:57 pm

On its face I don’t see much need for custom libraries in DAX. I mean, I’d love some date intelligence functions that deal with custom calendars I suppose. But can you give me some other examples of custom DAX libraries you’d like to see? (I’m genuinely curious rather than trying to “call your bluff” or otherwise dismiss you.)

August 25, 2017 at 3:32 pm

Excel has a pretty robust set of financial functions; that would be somewhere to start from my perspective.
Trying to do something like a depreciation calculator requires an in depth understanding of ListGenerate in M in order to determine what the value of an asset is at a particular point in time. Totally do-able, but definitely an impediment for your casual PowerBI user.

August 28, 2017 at 10:34 am

Great article and lots of very insightful comments. I will say that R does handle multi-table datasets easily, using one of many R packages like dplyr.
To me, both DAX and R are similar in that they can quite often give the “Wow!” factor to the analyses they can do. They are different, but very complementary when diving into data.

August 29, 2017 at 4:32 pm

Well, I have to say, this is one of the most exciting posts I have seen on PPPro. I have been converting all our reports from a previous reporting tool to PowerPivot and came across this stuff about control charts through a reference given to me by the illustrious Fred Kaffenberger (yes, your Fred Kaffenberger). Fred steered me towards a video from the 2016 Microsoft Data Summit which featured a PowerPivot implementation for Tyco. In that video was mentioned a lady by the name of Stacey Barr. I looked her up and whoa, talk about Control Charts! Stacey is part of a very modern movement which involves the use of these charts (which she likes to call Smart Charts). These kinds of charts have been used in manufacturing for decades as part of Quality Control. But people like Stacy discovered that they can be used for all sorts of business processes to measure performance and they have now become part of the Evidenced-based Management Movement. They are referred to as Process Behavior Charts.

Well, dilemma for me. Should I continue with use of PowerPivot or get involved with Process Behavior Charts as our primary reporting mechanism (I do a lot of the corporate reporting for our company).. In fact, I didn’t think we would be ceasing our use of PowerPivot, but I was curious as to whether or not I could use PowerPivot to produce Process Behavior Charts. As I was much of a DAX newbie as I was with respect to using these charts I asked Fred K to assist with creating a PowerPivot report for a Process Behavior Chart.

So, very cool to see this post!

If you are really interested in learning about modern uses of Process Behavior Charts in business, I’d strongly recommend reading Donald Wheeler’s book Making Sense of Data, as well as: Understanding Variation: the key to managing chaos. Stacey Bar’s latest book Prove It! is also great.

August 29, 2017 at 5:34 pm

The forename of MR. DAX is BERT. This Excel-toolkit provides an awesome R-Integration. DAX, M, Excel functions, VBA and R Code/Graphics, the complete business analytics workflow in one program (Excel): https://bert-toolkit.com/

August 30, 2017 at 7:46 pm

Rob,

First of all – great post!

I think a natural way of extending the topic is to do a comparation between Python and M. As you have mentioned, M is very powerful in terms of tranforming the dataset, while fit well with the power of Python with its data wrangling library like pandas etc.

Personally I have been using M for 2 years building corporate level dashboard, and recently get into Python and found it fascinating to performing certain task, particularly its massive library.

Tom

September 2, 2017 at 1:43 am

Excellent post agreed. Control Charts are a big deal to those of us that are in the process improvement (PI) space, Lean Six Sigma (LSS) work specifically. I have been using control charts for well over a decade and the idea of finding them thru DAX would be most excellent since now I use a wonderful tool called QI Macros, but I have to go out of PP and into the excel data so I have learned a bit on using PQ to get good data tables for both PQ and my PI work. I wish I was better at PQ and PP and am working on it.

So you will find a rather large space available for new recruits into the PP and PBI world if control charts and then some other LSS tools can be easily developed with DAX. As an aside I believe Tableau has R integrated into it maybe tree or so years ago?

Carl
Lean Six Sigma Black Belt
PP, PQ,PBI Padawan

September 2, 2017 at 10:51 am

Epic post! ROB thank you so much for the time and energy put on these posts! I just wanted to say that i think R and Power Pivot/ DAX serve different analysis purposes. For instance, you cant do machine learning algorithms with DAX while you can do machine learning processes with R. Also, i don’t agree when you say, somewhere in the post that R is used more to analyze static data. In R, you can perfectly set a predefined algorithm to forecast time series data (yeah alright i know, you can do MA’s algorithms with DAX but you cant do ARIMA or SARIMA algorithms with it witch are more used in real world cases, imo) and run that algorithm in a scheduled windows task. You also have R server now, for SQL server and make R to analyze dynamic data from a SQL database for instance. 🙂

September 7, 2017 at 10:12 pm

the way i see it is , R can be used as a stop gate until a particular functionality is implemented in PowerBI, there is no reason M and DAX can not evolve to solved all the problems solved now by R

November 16, 2017 at 5:00 am

Just a couple of comments on the article and on some of the comments above:

1) R is a functional programming language at heart and is therefore very different from C or Java in spirit (that’s why C or Java programmers find R weird). Claiming (as done in the article) that R is procedural while DAX is functional betrays a lack of understanding of what R is.

2) R is widely used in academia and industry for problems having to do with statistical analysis, predictive modelling or machine learning, and the flexibility and power of analyses you can do with R surpasses anything you can do in DAX/powerBI by a LONG shot. Of course DAX could evolve to solve all the problems solved by R/python for data analysis, but that’s very unlikely to happen since R has a huge knowledge base, a huge hold on academia (where new statistical tools and algorithms are likely to be first implemented), and most importantly it is open source, which means it will always evolve faster than a proprietary tool/language. I mean, just look at what you can do with R in relation to, say, time series forecasting:
https://cran.r-project.org/web/views/TimeSeries.html

There’s no way powerBI or DAX could ever do that (they don’t have the skills, know how or the community to develop these solutions and keep up to date with recent developments), and to be honest they shouldn’t; they should focus on what they do best, i.e. fast data visualization for BI problems, and implement extensive support for R in powerBI for anything relating to statistical analysis, predictive modelling and so forth. Indeed, the people at Microsoft (who are smart) have invested a lot in R with the acquisition of revolution analytics and the ever increasing support for R in powerBI, SQL server and so forth is a smart move and the way to go in my opinion.

3) I’ve seen above a claim that R is not used in production. This is false; I personally use R in production and many big companies (Allianz, etc) do the same. Hell, I even use powerBI in production, for the development of proprietary dashboards…

4) Again, the claim that R can only deal with static datasets and doesn’t do well with dynamic slicing is false. If anything, R can be better than DAX/powerBI at this because through shiny (https://shiny.rstudio.com/) one can develop proper web applications (not dashboards, web apps) which can not only dynamically slice/visualize data, but even do prediction on parameters provided by the user at runtime (powerBI is trying to do this via its what-if parameters, which is great, but it’s no way close to what you can do with R shiny). In the end, R is so flexible and customizable that the sky is the limit

5) Just to make it clear, I love powerBI. I use it basically every day and it’s excellent for what it’s supposed to do. I just want wanted to point out what I think are misrepresentations of R vs PowerBI above. In the end, however, if I had to suggest what tool to learn to do data analysis for someone who’s just starting, I’d say go with R (or python) rather than DAX/powerBI, as there’s nothing you can do in DAX/powerBI that you cannot do in R/python but there’s a lot you can do in R/Python which you cannot do (or it would be very painful to do) with DAX/powerBI.

July 2, 2018 at 9:00 pm

Good comments spiritus87; however, when recommending a data analysis tool (as you do in point #5 when you recommend R), it’s important to keep in mind that “data analysis” means different things to different people. As a strategic finance and BI consultant (and financial analyst), data analysis in my domain means calculating cash flows, volume-cost-profit, project ROI, DuPont ROE, CapEx/Opex spend, variance vs expectation etc. And equally as important as the initial calculations is the ability to (1) dynamically shift parameters at the speed of thought for new insights and (2) present everything in an instant cognition manner (via interactive reports and dashboards) for dynamic, self-service exploration by end users–I was unaware that these are (apparently) R’s strengths.

Now, recently, as a one-off, I wanted to determine if an international equity mutual fund had a stronger relationship with a domestic index or an international index. So I ran a quick multi-variable regression in Excel with the mutual fund as the response and the two indexes as predictors and, voila, I had my answer with no fuss and no R. Now, this kind of thing comes up maybe once a year for a financial analyst and can easily be handled in Excel. Indeed, not everyone equates data analysis with statistics. Moreover, I use PQ/MDX, DAX, and Power Pivot for Excel every single day to aggregate vast amounts of data, I’m orders of magnitude more productive because of them, and I would almost literally be unable to function without them at this point.

Just another perspective.

August 13, 2018 at 5:23 am

DAX and R are both full functional languages but DAX has two huge drawbacks:
1) it is cursor reference based instead then matrix/vector based like R
2) functions cannot be piped like in R (using dplyr::)

January 29, 2019 at 11:55 pm

I have been using Power Bi PRO for a year but when I get near 3 million lines of data it takes hours some times to match data and refresh data. Is R faster for larger data sets? I have never used R but am looking for a faster solution for larger data. Someone said Tableau, another said R and another said SQL – we are a small consulting company that do Supply Chain analysis so we don’t have a lot of money.

March 2, 2019 at 11:01 am

Hi Rob, came across this post and agree with you RE control charts. It would be great to have a custom visual. Here’s a link to what I’ve done in DAX: https://harborview.com/process-behavior-chart. This design is meant to monitor the process in real-time. Red events are indicators of an event causing an excursion from expected variability and should be investigated. If a data point is purple it means 3 out of 4 values are closer to the upper or lower limit and are a leading indicator of a loss of control, a yellow data point means 8 values in a row have been on the same side of the mean (like getting 8 heads or tails in a row) and is a sign that the process has likely shifted and if the cause is not investigated and understood, will likely fail in the future.

Power BI (DAX and M) vs. R: A Summer of Perspective

Zoe Helps Me Answer a Long-Simmering Question

The “Exit Interview” on R vs. DAX and M…

Rob’s (Tentative) Conclusions

“Hey, what About M?”

Cancel reply