Data Quality Management

Cancelled Posted Jun 11, 2014 Paid on delivery
Cancelled Paid on delivery

aim of my project is to do Reference data automation.I have data in Sql server (Database 1 and Database 2).The data is is not in proper way that we can access directly. So first I have to do the analysis from the database [url removed, login to view] I have categorized the data into valid and invalid after [url removed, login to view] i sent it to SME's for review the [url removed, login to view] that started with grouping the data for the correspond groups and sub groups in a seperate table called Group [url removed, login to view] steps followed for both [url removed, login to view] Again send it for review whether correct grouping done or not. After that collected distinct records(reference data id's ,groups and sub groups) from both data databases and inserted into the global [url removed, login to view] any database came like database 3 in future again no need to do the analysis on that database for that one i had created a stored procedure it will do the data categorization,grouping and maintaining in the global table.

FEATURES :

[url removed, login to view] are created on Global table to trigger all updates at all group table levels,instead of going to each group table and updating manually.

[url removed, login to view] procedure with parameter as source table on which we want to do the analysis.

Example :

Data id =1 has code value as 'Male' in Database 1

Data id =1 has code value as 'M' in Database 2

There is a data inconsistency from those database tables but the meaning is same. So i have fixed that problem by above steps.

Finally

Group tabel has following columns

[url removed, login to view] Name

[url removed, login to view] Group

[url removed, login to view] Id

[url removed, login to view] Flag

Global Table has following columns

[url removed, login to view] Name

[url removed, login to view] Group

[url removed, login to view] Id

[url removed, login to view] Name(database 1/database 2/database 3 etc)

[url removed, login to view](count of code values from database 1/database 2/database 3 etc)

[url removed, login to view] 1(Flag for data id whether existing or not Y/N)

[url removed, login to view] 2(Flag for data id whether existing or not Y/N)

Microsoft SQL Server

Project ID: #6055452

About the project

Remote project Active Jun 11, 2014