In addition, suppose that we have the same data as before and want to create a list of the states that appear in our dataset. Among the values, we have Hawaii, Hawai, and Howaii. We don't want the three values on our final list. We only want a single state: Hawaii. If we try to deduplicate the data with the Unique rows step, we will still have three values. The only solution is trying to fix the values with a fuzzy search algorithm, and only after that doing the deduplication. This doesn't differ much from the previous solution:
- Open the transformation you just created and save it under a different name.
- Run a preview of the Fuzzy match step. In the preview window, click on the title of the match column to ...