Power BI Antipatterns #12: Model case sensitivity

If you’re accustomed to performing merges in Power Query, you probably know that M is case-sensitive by default. In M, values should match exactly for a merge to work unless you do a fuzzy merge. In this blog post, I’m discussing case sensitivity in data models.

Here’s the basic antipattern that I see from time to time:

ProductBKeyForJoin = UPPER('Product'[SKU])

It’s a calculated column created specifically for a relationship by using the UPPER or LOWER functions in DAX. Why is this a bad pattern? Because relationships in Power BI aren’t case-sensitive. Therefore, you create a copy of a column and use precious resources for no good reason.

What’s more, when Power BI loads data, it’ll minimize the different cases. Here’s an illustration of what I mean — use the Enter data feature in Power Query and create the following table:

Column1
A
a

If you enable column distribution in Power Query Editor, you’ll see that there are two distinct values:

Note: what’s the difference between distinct and unique? Distinct is the number of different values; unique is the number of values that appear exactly once.

Once you’ll load this table, the Data view will show just one distinct value:

Again, relationships also won’t care about the case, as long as the values match, so no need to create copies of your columns with some consistent case.

To be continued…

  • Jes Hansen

    Little known fact; DAX is case-insensitive and accent-sensitive.
    EVALUATE
    {“a” = “A”, “a” = “á”, “á” = “Á”}