Description
1. WATEROIL. In the oil industry, water that mixes with crude oil during production and
transportation must be removed. Chemists have found that the oil can be extracted from the
water/oil mix electrically. Researchers at the University of Bergen (Norway) conducted a series of
experiments to study the factors that influence the voltage (y) required to separate the water
from the oil (Journal of colloid and interface science, Aug. 1995). The seven independent variables
investigated in the study are listed in the table. (Each variable was measured at two levels – a
“low” level and a “high” level.) Sixteen water/oil mixtures were prepared using different
combinations of independent variables; then each emulsion was exposed to a high electric field.
In addition, three mixtures were tested when all independent variables were set to 0. The
variables are given in the table below.
Experiment number
y: voltage (kw/cm)
x1: disperse phase volume (%)
x2: salinity (%)
x3: temperature (0C)
x4: time delay (hours)
x5: surfactant concentration (%)
x6: span:triton
x7: solid particles (%)
Use R to perform a regression analysis on the WATEROIL dataset (found on the D2L). Consider interaction
terms and second-order terms.
a. (10 pts.) Paste your final model into your submission (just the R output).
b. (10 pts.) Describe the model building process through which you generated this model.
c. (10 pts.) What significant second-order terms did you find, if any? Did you try all
second-order terms? Did you look at scatter plots to determine which second-order
terms to evaluate? Discuss the benefits and drawbacks of these two strategies.
d. (10 pts.) What significant interaction terms did you find, if any? Did you try all
combinations of interaction terms? Do you think that is an appropriate strategy? What
happens to the number of interaction terms as the number of independent terms
increases?
e. (10 pts.) Discuss your final model. Evaluate the t-tests, F-Test and adj-R2
accordingly.
Do you think this is a “good” model? Explain.