Data Cleaning
Data Cleaning is the process of obtaining data by Finguine from various sources and organizing it by fixing mistakes, inconsistencies, duplicate entries, and other issues. Once data collected by Finguine, data is examined for "integrity" and quality in several domains, such as:
- Data Accuracy
- Completeness
- Consistency
- Validity
- Uniformity
In the data cleaning process, the system can perform the following tasks:
- Special Character Removal
- Leading and Trailing Space Removal
- Alpha Only Columns – Conversion of Zeros to NULLs
- Numeric Columns – Conversion of NULLs to Zeros
- Case Conversion (Upper, Lower, and Proper)
- Email Cleaner (account, sub-domain, domain, and country)
To understand the data cleaning processes, the application displays the successful completion of data cleaning, and the resultant data is stored under different categories as listed below:
- Rejected Account Holder: This file gives the list of rejected records where the Account Holder ID is missing.
- Valid Account Holders: This file contains the data of valid Account Holders where the Unit ID is present.
Upon successful completion of the data cleaning process, the system will display the message to the user "Data cleaning process is completed successfully".
