From the course: Power BI Weekly

Unlock this course with a free trial

Join today to access over 24,900 courses taught by industry experts.

Clustering values in Dataflows

Clustering values in Dataflows - Power BI Tutorial

From the course: Power BI Weekly

Clustering values in Dataflows

- [Instructor] Fuzzy Matching is one of my own favorite functionalities in Power BI. We also might see it presented in Power BI as the cluster values algorithm. The data scientists at Microsoft wrote the Fuzzy Matching algorithm for us behind the scenes. We can then connect our own data to this algorithm as a function in Power BI. On a high level, this Fuzzy Matching uses Natural Language Processing, often called NLP, and clustering to help us correct issues like spelling mistakes for example. If you would like to learn more about Fuzzy Matching by playing around with the algorithm yourself, you can check out options like the fuzzywuzzy library in Python. There's a lot to this algorithm that happens behind the scenes, but here's how we can think of how it works on a high level. Let's say we have a sample set of different words. We can see these words are spelled both incorrectly and correctly. Natural Language…

Contents