Description
Problem 1: (10+10+10+10+10+10+40 = 100 points)
WARNING: Please show your OWN work. Any cheating can be easily detected and will not be graded.
For the question, please follow the file called manufacturing defects.txt while reading the text below.
In each year from 2000 to 2019, the number of manufacturing defects in auto manufacturers were counted.
The data was collected from 14 different auto manufactory companies. The numbers of defects for the companies are indicated in 14 columns following the year column. Assume that the number of manufacturing defects
per auto company per year is a random variable having a Poisson(λ) and that the number of defects in different
companies or in different years are independent.
(Note: You should implement a code for your calculations for each following subproblem. You are free to use
any programming languages (Python, R, C, C++, Java) and their related library.)
(a) Give a table how many cases occur for all companies between 2000 and 2019 for each number of defects (#
of Defects).
Hint: When you check the file you will see: # of Defects = {0, 1, 2, 3, 4}.
(b) Estimate λ from the given data.
(c) Update Table 1 in Table 2 with Poisson predicted cases with the estimated λ.
(d) Draw a barplot for the actual cases (Table 2 in column 2) and the predicted cases (Table 2 column 3) with
respect to # of defecrs. You should put the figure.
1
– Homework #2 2
\# of
Defects
\# of cases
in all company
between the years
0
1
2
3
4
Table 1: Actual cases
\# of
Defects
\# of cases
in all companies
between the years
Predicted \# of cases
in all companies
between the years
0
1
2
3
4
Table 2: Actual vs. Predicted Cases
(e) According to the barplot in (c), does the poisson distribution fit the data well? Compare the values of the
actual cases and the values of the poisson predicted cases, and write your opinions about performance of the
distribution.
(f) According to your estimations above, write your opinions considering your barplot and Table 2. Which
company do you prefer to buy a car? Why?
(g) Paste your code that you implemented for the subproblems above. Do not forget to write comments on
your code.
Example:
• The common code block for all subproblems
Paste here. Your code should read the file and compute other things which the following subproblems
need.
• The code block for (a)
Paste here. Your code should compute the values in Table 1 column 2.
• The code block for (b)
Paste here. Your code should compute λ.
• The code block for (c)
Paste here. Your code should compute the values in Table 2 column 3.
• The code block for (d)
Paste here. Your code should draw the barplot.