Description
Problem 1 (100 points)
Homework 1 considers a Covid-19 dataset which is published on Github. Please download any document
type that you prefer of the dataset from the links which are shown in Figure 1. The dataset is updated daily and
Figure 1: The complete dataset links
includes data on confirmed cases, deaths, hospitalizations, testing, and vaccinations as well as other variables
of potential interest. The data set has the following basic columns:
• iso code: Short name of the country
• continent: The continent where the country exists
• location: The country name
• date: The date when the data about various variables are taken.
You are responsible to implement a program which reads the given dataset from the file and computes
the data for the following questions. Any programming language that you prefer will be accepted. Putting
comments on your functions that you implement is must. Each question must be appended to a file which is
called ”output{.csv, .txt}”. The file contains the first 18 questions listed below. The 18th question will be
written in this document.
1. How many countries the dataset has?
2. When is the earliest date data are taken for a country? Which country is it?
1
– Homework #1 2
3. How many cases are confirmed for each country so far? Print pairwise results of country and total cases.
4. How many deaths are confirmed for each country so far? Print pairwise results of country and total deaths.
5. What are the average, minimum, maximum and variation values of the reproduction rates for each country?
Table 1: The format of the output for the questions 5, 6, 7, 8, 9, 10, 12, 13.
Country minimum maximum average variation
value value value value value
6. What are the average, minimum, maximum and variation values of the icu patients (intensive care unit
patients) for each country?
7. What are the average, minimum, maximum and variation values of the hosp patients (hospital patients)
for each country?
8. What are the average, minimum, maximum and variation values of the weekly icu (intensive care unit)
admissions for each country?
9. What are the average, minimum, maximum and variation values of the weekly hospital admissions for
each country?
10. What are the average, minimum, maximum and variation values of new tests per day for each country?
11. How many tests are conducted in total for each country so far?
12. What are the average, minimum, maximum and variation values of the positive rates of the tests for each
country?
13. What are the average, minimum, maximum and variation values of the tests per case for each country?
14. How many people are vaccinated by at least one dose in each country?
15. How many people are vaccinated fully in each country?
16. How many vaccinations are administered in each country so far?
17. List information about population, median age, # of people aged 65 older, # of people aged 70 older,
economic performance, death rates due to heart disease, diabetes prevalence, # of female smokers, #
of male smokers, handwashing facilities, hospital beds per thousand people, life expectancy and human
development index.
Table 2: The format of the output for the question 17
Country population median age # of people aged 65 older
value value value value
18. Summarize all the results that you obtain by the first 17 questions (except question 2).
Table 3: The format of the output for the question 18
Country q#3 q#4 q#5 min q#5 max q#5 avg q#5 var
value value value value value value value
19. Comment the results based on your observations. Write your opinions about the reasons of increasing
infection rates by giving examples from the results. Feel free to explain any situation that you observe.
More observations more opportunities will bring you for the second homework.
(Solution) (Write your observations here.)