We primarely work on data warehouses and it is very common to have data quality issues. In this step by step sql tutorial blog post I will show how you can deal with certain types of data quality issues and resolve them using SQL Soundex function. But before we start I will explain when you can use it and when you should NOT use it.
Microsoft Soundex page explains soundex function; as always in a very technical way without providing much guidance on when you should use it and when you shoudn't use it, so let's start from the very beginnng.
What is soundex function?
SQL Server Soundex is a function that converts string (words) into 4 characters code. The code represent "sound" of the word so you can use function on another word and see if they "sounds" the same. If 4 characters code is the same than it sounds the same.
Let's show you a simple SQL query example:
SELECT SOUNDEX('Katie') --returns K300
To get full access to this page
SELECT SOUNDEX('Emil') --returns E540
As you can see Katie & Emil don't sound the same. Let's give you another example.
Become a member or Sign in
Membership is just $3.99/month