Coursera | Introduction to Data Analytics(IBM) | Final Assignment
其他鏈接:
假裝有筆記(有時間寫出來)
Coursera | Introduction to Data Analytics(IBM) | Quiz答案
Coursera | Introduction to Data Analytics | Final Assignment
- Final Assignment: Data Analysis in Action
- Task
- My submission
- Rubric(評分標準/參考答案)
這個assignment也不難,仔細點沒有問題 😃
Final Assignment: Data Analysis in Action
又是一個peer review的assignment,加油~
Task
Using Data Analysis for Detecting Credit Card Fraud
Companies today are employing analytical techniques for the early detection of credit card frauds, a key factor in mitigating fraud damage. The most common type of credit card fraud does not involve the physical stealing of the card, but that of credit card credentials, which are then used for online purchases.
Imagine that you have been hired as a Data Analyst to work in the Credit Card Division of a bank. And your first assignment is to join your team in using data analysis for the early detection and mitigation of credit card fraud.
In order to prescribe a way forward, that is, suggest what should be done in order for fraud to get detected early on, you need to understand what a fraudulent transaction looks like. And for that you need to start by looking at historical data.
Here is a sample data set that captures the credit card transaction details for a few users.
Descriptive techniques of analysis, that is, techniques that help you gain an understanding of what happened, include the identification of patterns and anomalies in data. Anomalies signify a variation in a pattern that seems uncharacteristic, or, out of the ordinary. Anomalies may occur for perfectly valid and genuine reasons, but they do warrant an evaluation because they can be a sign of fraudulent activity.
Past studies have suggested that some of the common events that you may need to watch out for include:
- A change in frequency of orders placed, for example, a customer who typically places a couple of orders a month, suddenly makes numerous transactions within a short span of time, sometimes within minutes of the previous order.
Orders that are significantly higher than a user’s average transaction. - Bulk orders of the same item with slight variations such as color or size—especially if this is atypical of the user’s transaction history.
- A sudden change in delivery preference, for example, a change from home or office delivery address to in-store, warehouse, or PO Box delivery.
- A mismatched IP Address, or an IP Address that is not from the general location or area of the billing address.
Before you can analyze the data for patterns and anomalies, you need to:
- Identify and gather all data points that can be of relevance to your use case. For example, the card holder’s details, transaction details, delivery details, location, and network are some of the data points that could be explored.
- Clean the data. You need to identify and fix issues in the data that can lead to false or incomplete findings, such as missing data values and incorrect data. You may also need to standardize data formats in some cases, for example, the date fields.
Finally, when you arrive at the findings, you will create appropriate visualizations that communicate your findings to your audience. The graph below samples one such visualization that you would use to capture a trend hidden in the sample data set shared earlier on in the case study.
In the next section you will be asked to answer the following 5 (five) questions based on this case study:
My submission
我自己的答案,僅供參考~ 歡迎指出問題與討論 😃
Rubric(評分標準/參考答案)
進行peer review的時候可以看到評分標準,也包括參考答案~
總結
以上是生活随笔為你收集整理的Coursera | Introduction to Data Analytics(IBM) | Final Assignment的全部內容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: 七月算法机器学习笔记2--机器学习中的数
- 下一篇: mysql execute 存储过程_M