This project is mostly based on issues with data quality
Description
This project is mostly based on issues with data quality: see more carefully chapter 2 Data and Sampling Distributions from 2actical Statistics for Data Scientists Cover in the project the following:
Explain figures 2.5 and 2.6 from the text book Practical Statistics for Data Scientists chapter 2 Data and Sampling Distributions .
Discuss the following:
- How can you tell if the data is an outlier or if it is something important?
- Which data is the noise and how is the noise different from outliers?
When there are missing values, explain the pros and cons of the following strategies:
- Elimination of Data Objects
- Estimation of Missing Values
- What are the limitations of analyzing real data with missing values and why is it impossible to really know such data?
Have a similar assignment? "Place an order for your assignment and have exceptional work written by our team of experts, guaranteeing you A results."