FetchRewards

Challenge 1: Diagram a New Structured Relational Data Model

The Relational Data Model consisted of four tables such as Users, Brands, Receipts and RewardsReceiptsItemList.

Challenge 2: Queries for the questions from business stakeholders

The queries/answers to the questions are in the file with the title “SQL_Queries.pdf”

Challenge 3: Data Quality Issues in the Data

The data quality issues and solutions to overcome these issues are discussed in the Jupyter Notebook with the title “FetchRewards.ipynb” Here, the data is converted from JSON to CSV format for the easier execution of data analysis part and for the data analysis and exploration purpose, Python programming language is used.

Some of the data quality issues faced during the exploration of the files (Users, Brands, Receipts):

Null/NA Values present
Duplicate records – there are multiple instances where duplicate IDs with respective attributes are duplicated
Wrong formatting of the data – Date is stored in the 13-digit number instead of standard format of timestamp, Title names are written in the wrong format

Through data cleaning and formatting processes, data quality issues from the data have been removed. For these processes, several python functions, and libraries (Pandas, NumPy) are used.

Also, many records from rewardsReceiptItemList possess barcode as ‘4011’ which is nothing but ‘Item not found’ case. As cost of these products are getting added to the Total Spent, it is not correct to remove these products directly during data cleaning process.

Addition to this, this column has many items barcodes whose values are not matching with the barcodes from the Brands table. So, it is difficult to track them down. Therefore, the join between the tables Brands and RewardsReceiptsItemList is with brand name -> description and not with the barcode.

Challenge 4: Communicate with stakeholders

The email to business people/ Stakeholders is in the file with the title "Email.pdf"

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
CSVs		CSVs
JSON		JSON
ERDiagram.JPG		ERDiagram.JPG
ERDiagram.pdf		ERDiagram.pdf
Email.pdf		Email.pdf
FetchRewards.ipynb		FetchRewards.ipynb
Question.pdf		Question.pdf
README.md		README.md
SQL_Queries.pdf		SQL_Queries.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FetchRewards

Challenge 1: Diagram a New Structured Relational Data Model

Challenge 2: Queries for the questions from business stakeholders

Challenge 3: Data Quality Issues in the Data

Challenge 4: Communicate with stakeholders

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FetchRewards

Challenge 1: Diagram a New Structured Relational Data Model

Challenge 2: Queries for the questions from business stakeholders

Challenge 3: Data Quality Issues in the Data

Challenge 4: Communicate with stakeholders

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages