New England College Data Mining and Statistical Modeling Case Study Project

Description

Final Case Analysis

Data for this can be found here:

https://www.epa.gov/compliance-and-fuel-economy-data/data-cars-used-testing-fuel-economy

Please use 2014-Present. You will have to merge the files yourself.

Definitions:

https://www.epa.gov/sites/default/files/2016-07/documents/test-car-list-definitions.pdf

You should begin the Final Case Analysis this week. It is due in Week 13, Sunday 11:59 PM EST.

Introduction:

Environmental Protection Agency (or EPA for short) is responsible for regulating the amount of pollutant emission from all automobiles that run on American roads. You are asked to analyze the data released by EPA for more than a decade, specifically for three time periods: 2010 12, 2014-16, and 2018 20. There are several objectives to this case analysis, one of which is to test and learn about the possible changes in the amount of pollutions emitted by vehicles overtime. You are also asked to analyze similarities between vehicles over the three time periods and empirically determine if certain vehicles became more (or less) polluting over the period of study.

You will analyze various aspects of vehicle induced pollution using R programing. You are expected to submit findings in a report format. The report must be at least 20 pages long with written description and explanation of your findings to the questions asked below. Make sure to run all code using R Markdown and create a formal report with your remarks, comments or explanations embedded within the document.

You are given nine years of individual EPA data in csv format. The data files are not very large (each file is approx. 1 MB) . Each yearly file contains thousands of vehicles along with their vital information and pollution testing records. Each file contains 42 columns, the details of which are given in the Data Dictionary document. Please note that the original data had more columns, and some of them were removed for the consistency purposes. The deleted columns also exist in the data dictionary and you are asked to ignore them while referring to the dictionary.

There are three sections to this case study : Merging and cleaning ( 20 points), Data Analysis ( 6 0 points), Visualization ( 20 points) totaling 100 points.

Please note that all code assignments must be submitted as a screenshot with a slice of your desktop showing the timestamp.

User generated content is uploaded by users for the purposes of learning and should be used following Studypool’s honor code & terms of service.



^{Have a similar assignment? "Place an order for your assignment and have exceptional work written by our team of experts, guaranteeing you A results."}

Our Service Charter

1. Professional & Expert Writers: Eminence Papers only hires the best. Our writers are specially selected and recruited, after which they undergo further training to perfect their skills for specialization purposes. Moreover, our writers are holders of masters and Ph.D. degrees. They have impressive academic records, besides being native English speakers.

2. Top Quality Papers: Our customers are always guaranteed of papers that exceed their expectations. All our writers have +5 years of experience. This implies that all papers are written by individuals who are experts in their fields. In addition, the quality team reviews all the papers before sending them to the customers.

3. Plagiarism-Free Papers: All papers provided by Eminence Papers are written from scratch. Appropriate referencing and citation of key information are followed. Plagiarism checkers are used by the Quality assurance team and our editors just to double-check that there are no instances of plagiarism.

4. Timely Delivery: Time wasted is equivalent to a failed dedication and commitment. Eminence Papers are known for the timely delivery of any pending customer orders. Customers are well informed of the progress of their papers to ensure they keep track of what the writer is providing before the final draft is sent for grading.

5. Affordable Prices: Our prices are fairly structured to fit in all groups. Any customer willing to place their assignments with us can do so at very affordable prices. In addition, our customers enjoy regular discounts and bonuses.

6. 24/7 Customer Support: At Eminence Papers, we have put in place a team of experts who answer all customer inquiries promptly. The best part is the ever-availability of the team. Customers can make inquiries anytime.