DSCI 552: Programming Assignment 4

$30.00

Category: You will Instantly receive a download link for .zip solution file upon Payment || To Order Original Work Click Custom Order?

Description

5/5 - (1 vote)

Part 1: Implementation [7 points]

[2 points] Implement the Perceptron Learning algorithm. Run it on the data file “classification.txt”
ignoring the 5th column. That is, consider only the first 4 columns in each row. The first 3 columns are
the coordinates of a point; and the 4th column is its classification label +1 or -1. Report your results
(weights and accuracy after the final iteration).

[1 point] Implement the Pocket algorithm and run it on the data file “classification.txt” ignoring the 4th
column. That is, consider only the first 3 columns and the 5th column in each row. The first 3 columns
are the coordinates of a point; and the 5th column is its classification label +1 or -1. Plot the number of
misclassified points against the number of iterations of the algorithm. Run up to 7000 iterations. Also,
report your results (weights and accuracy after the final iteration).

[3 points] Implement Logistic Regression and run it on the points in the data file “classification.txt”
ignoring the 4th column. That is, consider only the first 3 columns and the 5th column in each row. The
first 3 columns are the coordinates of a point; and the 5th column is its classification label +1 or -1. Use
the sigmoid function Ɵ(s) = es
/(1+es
). Run up to 7000 iterations. Report your results (weights and
accuracy after the final iteration).

[1 point] Implement Linear Regression and run it on the points in the data file “linear-regression.txt”.
The first 2 columns in each row represent the independent X and Y variables; and the 3rd column
represents the dependent Z variable. Report your results (weights after the final iteration).

You can write your programs in any programming language. However, you will have to implement the
algorithms yourself instead of using high-level library functions, except for solving a system of linear
equations. Please provide a description of the data structures you use, any code-level optimizations you
perform, any challenges you face, and of course, the requested outputs.

Part 2: Software Familiarization [Optional – No Credit]
Do your own research and find out about library functions that offer good implementations of linear
classification, linear regression, and logistic regression. Learn how to use them. Compare them against
your implementations and suggest some ideas for how you can improve your code. Describe all this in
your report.

Part 3: Applications [Optional – No Credit]
Do your own research and describe some interesting applications of linear classification, linear
regression, and logistic regression.

Submission Guidelines
In your report, please include the names of all group members and mention their individual contributions.

The maximum number of the members in a team is 2. The report should be in a PDF format. Your
submission should include the code as well as the report and is due before 3/10, 11:59pm in an archive
in a zip, tar.gz or tar.xz format. Only one submission is required for each group by one of the group
members. Please submit your homework on D2L (do NOT email the homework to the instructor or the
TA).