Description

5/5 - (6 votes)

1. Introduction
You will do a sequence of projects in this course. These sequence of
projects will give you practical experience with common attacks and
counter measures. To make the issues concrete, you will explore the
attacks and counter measures in the context of the zoobar web
application in some of the subsequent projects. Following is a list of
possible projects that might be assigned to you. Not all of these projects
will be assigned, as there are other projects related to other topics in the
class. But just to give you an idea of possible subsequent projects — here is
a possible list of projects. (Since multiple projects will depend on your
current project, you are strongly urged to do this project seriously
and completely).
Project 1: you will explore the base structure of the zoobar
web application, and use buffer overrun attacks to break its
security properties.
Project 2: you will improve zoobar web application by using privilege
separation, so that if one components is compromised, the adversary
doesn’t have control over the whole web application.
Project 3: you will build a program analysis tool (likely based on
symbolic execution) to find bugs in Python code such as the zoobar
web application.
Project 4: you will perform a security audit of two other student’s
solutions. You probably want to do a good job in project 2 and
project 3.
Project 5: you will improve the zoobar application against browser
attacks.
Project 6: you will extend the zoobar application to support javascript
user profiles in a secure manner.
2
Project 1 will introduce you to buffer overflow vulnerabilities, in the
context of a web server called zookws. The zookws web server is
running a simple python web application, zoobar, where users transfer
“zoobars” (credits) between each other. You will find buffer overflows in
the zookws web server code, write exploits for the buffer overflows to
inject code into the server, figure out how to bypass non-executable stack
protection, and finally look for other potential problems in the web server
implementation. Later projects look at other security aspects of the
zoobar and zookws infrastructure.
Each project may require you to learn a new programming language or
some other piece of infrastructure. For example, in this project you must
become intimate familiar with certain aspects of the C language, x86
assembly language, gdb, etc. The projects do so because that allows you
to understand attacks and defenses in realistic situations. Often you need
to understand certain parts of
this new infrastructure in detail; security weaknesses often show up in
corner cases, and so you need to understand the details to craft exploits
and design defenses for those corner cases. These two
factors (new infrastructure and details) can make the projects time
consuming. You should start early on the projects and work on
them daily for some limited time (each project has several exercises),
instead of trying to do all exercises in a single shot before the
deadline. You should also try to understand the necessary details, instead
of muddling your way through the exercises. If you don’t, the projects will
take a lot of time. If you get stuck on a detail, post a question on CS628
google group or see the TAs.
Several projects, including this project, ask you to design exploits. These
exploits are realistic enough that you might be able to use them for a real
attack, but you should not do so. The point of the designing exploits is to
teach you how to defend against them, not how to use them—attacking
computer systems is illegal and can get you into serious trouble. Please
Don’t do it.
2. Project infrastructure
Exploiting buffer overflows requires precise control over the execution
environment. A small change in the compiler, environment variables, or
the way the program is executed can result in slightly different memory
3
layout and code structure, thus requiring a different exploit. For this
reason, this project uses a VirtualBox virtual machine to run the
vulnerable web server code.
Once you have VirtualBox installed on your machine, you should
download the course VM image, provided with this project
assignment and unpack it on your computer. This virtual machine
contains an installation of Ubuntu 14.04.1 Linux, and the following
accounts have been created inside the VM. The name of the VM is vmCS628. The file containing the VM is provided along with this document
via http://security.cse.iitk.ac.in
Username Password Description
root CS628A You can use the root account to install new software packages
into the VM, if you find something missing, using apt-get
install pkgname.
httpd CyberSecurity The httpd account is used to execute the web server, and
contains the source code you will need for this project assignment,
in /home/httpd/lab.
You can either log into the virtual machine using its console, or you can
use ssh to log into the virtual machine over the (virtual) network. To
determine the virtual machine’s IP address, log in as root on the console
and run /sbin/ifconfig eth0 and find the iNET address.
For the rest of the document we assume this IP address is
192.168.134.128 for example. Each of you will get a different IP address
of course.
The files you will need for this project assignment is already preloaded
in the VM as ~httpd/lab
httpd@vm-628:~$ cd lab
httpd@vm-628:~/lab$
proceed with this project assignment, make sure you can compile the
zookws web server:
httpd@vm-628:~/lab$ make
4
zookld.c -c -o zookld.o -m32 -g -std=c99 -Wall
-Werror -D_GNU_SOURCE -fno-stack-protector
http.c -c -o http.o -m32 -g -std=c99 -Wall -Werror
-D_GNU_SOURCE -fno-stack-protector
zookld.o http.o -lcrypto -o zookld
zookfs.c -c -o zookfs.o -m32 -g -std=c99 -Wall
-Werror -D_GNU_SOURCE -fno-stack-protector
zookfs.o http.o -lcrypto -o zookfs
zookfs zookfs-nxstack
zookd.c -c -o zookd.o -m32 -g -std=c99 -Wall
-Werror -D_GNU_SOURCE -fno-stack-protector
zookd.o http.o -lcrypto -o zookd
zookd-nxstack
zookfs zookfs-exstack
execstack -s zookfs-exstack
zookd-exstack
execstack -s zookd-exstack
zookfs.c -c -o zookfs-withssp.o -m32 -g -std=c99
-Wall -Werror -D_GNU_SOURCE
http.c -c -o http-withssp.o -m32 -g -std=c99 -Wall
-Werror -D_GNU_SOURCE
5
zookfs-withssp.o http-withssp.o -lcrypto -o
zookfs-withssp
zookd.c -c -o zookd-withssp.o -m32 -g -std=c99
-Wall -Werror -D_GNU_SOURCE
zookd-withssp.o http-withssp.o -lcrypto -o
zookd-withssp
-c -o shellcode.o shellcode.S
-S -O binary -j .text shellcode.o shellcode.bin
run-shellcode.c -c -o run-shellcode.o -m32 -g
-std=c99 -Wall -Werror -D_GNU_SOURCE -fnostack-protector
run-shellcode.o -lcrypto -o run-shellcode
shellcode.o
httpd@vm-628:~/lab$
The server consists of the following components.
zookld, a launcher daemon that launches services configured in the
file zook.conf.
zookd, a dispatcher that routes HTTP requests to corresponding
services.
zookfs and other services that may serve static files or execute
dynamic scripts.
After zookld launches configured services, zookd listens on a port
(8080 by default) for incoming HTTP requests and reads the first line of
each request for dispatching. In this project, zookd is configured to
dispatch every request to the zookfs service, which reads the rest of
6
the request and generates a response from the requested file. Most
HTTP-related code is in http.c. Here is a tutorial of the HTTP protocol.
There are two versions of the web server you will be using:
zookld, zookd-exstack, zookfs-exstack, as configured
in the file zook-exstack.conf;
zookld, zookd-nxstack, zookfs-nxstack, as configured
in the file zook-nxstack.conf.
In the first one, the *-exstack binaries have an executable stack,
which makes it easier to inject executable code given a stack buffer
overflow vulnerability. The *-nxstack binaries in the second version
have a non-executable stack, and you will write exploits that bypass nonexecutable stacks later in this project assignment.
In order to run the web server in a predictable fashion—so that its stack
and memory layout is the same every time—you will use the cleanenv.sh script. This is the same way in which we will run the web server
during grading, so make sure all of your exploits work on this
configuration!
The reference binaries of zookws are provided in bin.tar.gz,
which we will use for
grading. Make sure your exploits work on those binaries.
Now, make sure you can run the zookws web server and
access the zoobar web application from a browser running on
your machine, as follows:
httpd@vm-628:~/lab$ /sbin/ifconfig eth0
eth0 Link encap:Ethernet HWaddr
00:0c:29:57:90:a1
7
inet addr: 198.168.134.128
Bcast:198.168.134.255
Mask:255.255.255.0
inet6 addr: fe80::20c:29ff:fe57:90a1/64
Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500
Metric:1
RX packets:149 errors:0 dropped:0
overruns:0 frame:0
TX packets:94 errors:0 dropped:0
overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:15235 (15.2 KB) TX bytes:12801
(12.8 KB)
Interrupt:19 Base address:0x2000
httpd@vm-628:~/lab$ ./clean-env.sh ./zookld zookexstack.conf
The /sbin/ifconfig command will give you the virtual machine’s
IP address. In this particular example you should go to
http://198.168.134.128:8080/ from a browser on your host machine.
If something doesn’t seem to be working, try to figure out what went
wrong, or contact the course TAs or the instructor before proceeding
further.
8
Part 1: Finding buffer overflows
In the first part of this project assignment, you will find buffer overflows in
the provided web server. Read Aleph One’s article, ” Smashing the Stack
for Fun and Profit”, as well as paper posted associated with this project
assignment titled ” Buffer Overflows: Attacks and Defenses for the
Vulnerability of the Decade”, to figure out how buffer overflows work.
Exercise 1. Study the web server’s code, and find examples of code
vulnerable to memory corruption through a buffer overflow. Write
down a description of each vulnerability in the file
/home/httpd/project/bugs.txt; use the format described
in that file. For each vulnerability, describe the buffer which may
overflow, how you would structure the input to the web server (i.e.,
the HTTP request) to overflow the buffer, and whether the
vulnerability can be prevented using stack canaries. Locate at least 5
different vulnerabilities.
You can use the command make check-bugs to check if your
bugs.txt file matches the required format, although the command
will not check whether the bugs you listed are actual bugs or whether
your analysis of them is correct.
Now, you will start developing exploits to take advantage of the buffer
overflows you have found above. We have provided template Python code
for an exploit in /home/httpd/project/exploittemplate.py, which issues an HTTP request. The exploit template
takes two arguments, the server name and port number, so you might run
it as follows to issue a request to zookws running on localhost:
httpd@vm-628:~/lab$ ./clean-env.sh ./zookld
zook-exstack.conf &
[1] 2676
httpd@vm-628:~/lab$ ./exploit-template.py
localhost 8080
HTTP request:
GET / HTTP/1.0
…
9
httpd@vm-628:~/lab$
You are free to use this template, or write your own exploit code from
scratch. Note, however, that if you choose to write your own exploit, the
exploit must run correctly inside the provided virtual machine. You will
find gdb useful in building your exploits. As zookws forks off many
processes, it can be difficult to debug the correct one. The easiest way to
do this is to run the web server ahead of time with clean-env.sh
and then attaching gdb to an already-running process with the -p flag.
To help find the right process for debugging, zookld prints out the
process IDs of the child processes that it spawns. You can also find the
PID of a process by using pgrep; for example, to attach to zookdexstack, start the server and, in another shell, run
httpd@vm-628:~/lab$ gdb -p $(pgrep zookdexstack)
…
0x4001d422 in kernel_vsyscall ()
(gdb) break your-breakpoint
Breakpoint 1 at 0x1234567: file zookd.c, line
999.
(gdb) continue
Continuing.
Keep in mind that a process being debugged by gdb will not get killed
even if you terminate the parent zookld process using ^C. If you are
having trouble restarting the web server, check for leftover processes
from the previous run, or be sure to exit gdb before restarting zookld.
When a process being debugged by gdb forks, by default gdb continues
to debug the parent process and does not attach to the child. Since
zookfs forks a child process to service each request, you may find it
helpful to have gdb attach to the child on fork, using the command set
follow-fork-mode child. We have added that command to
/home/httpd/lab/.gdbinit, which will take effect if you start
gdb in that directory.
For this and subsequent exercises, you may need to encode your attack
payload in different ways, depending on which vulnerability you are
10
exploiting. In some cases, you may need to make sure that your attack
payload is URL-encoded; that is, use + instead of space and %2b instead
of +. Here is a URL encoding reference and a handy conversion tool.
You can also use quoting functions in the python urllib module to URL
encode strings. In other cases, you may need to include binary values into
your payload. The Python struct module can help you do that. For
example, struct.pack(“<I”, x) will produce a 4-byte (32-bit) binary encoding of the integer x. Exercise 2. Pick two buffer overflows out of what you have found for later exercises (although you can change your mind later, if you find your choices are particularly difficult to exploit). The first must overwrite a return address on the stack, and the second must overwrite some other data structure that you will use to take over the control flow of the program. Write exploits that trigger them. You do not need to inject code or do anything other than corrupt memory past the end of the buffer, at this point. Verify that your exploit actually corrupts memory, by either checking the last few lines of dmesg | tail, using gdb, or observing that the web server crashes. Provide the code for the exploits in files called exploit-2a.py and exploit-2b.py, and indicate in answers.txt which buffer overflow each exploit triggers. If you believe some of the vulnerabilities you have identified in Exercise 1 cannot be exploited, choose a different vulnerability. You can check whether your exploits crash the server as follows: httpd@vm-628:~/lab$ make check-crash Part 2: Code injection In this part, you will use your buffer overflow exploits to inject code into the web server. The goal of the injected code will be to unlink (remove) a sensitive file on the server, namely /home/httpd/grades.txt. 11 Use the *-exstack binaries, since they have an executable stack that makes it easier to inject code. The zookws web server should be started as follows. httpd@vm-628:~/lab$ ./clean-env.sh ./zookld zook-exstack.conf We have provided Aleph One’s shell code for you to use in /home/httpd/lab/shellcode.S, along with Makefile rules that produce /home/httpd/lab/shellcode.bin, a compiled version of the shell code, when you run make. Aleph One’s exploit is intended to exploit setuid-root binaries, and thus it runs a shell. You will need to modify this shell code to instead unlink /home/httpd/grades.txt. To help you develop your shell code for this next exercise, we have provided a program called runshellcode that will run your binary shell code, as if you correctly jumped to its starting point. For example, running it on Aleph One’s shell code will cause the program to execve(“/bin/sh”), thereby giving you another shell prompt: httpd@vm-628:~/lab$ ./run-shellcode shellcode.bin $ When developing an exploit, you will have to think about what values are on the stack, so that you can modify them accordingly. For your reference, below you will find what the stack frame of some function foo looks like; here, foo has a local variable char buf[256]: 12 Note that the stack grows down in this figure, and memory addresses are increasing up. When you’re constructing an exploit, you will often need to know the addresses of specific stack locations, or specific functions, in a particular program. The easiest way to do this is to use gdb. For example, suppose you want to know the stack address of the pn[] array in the http_serve function in zookfs-exstack, and the address of its saved %ebp register on the stack. You can obtain them using gdb as follows: httpd@vm-6858:~/lab$ gdb -p $(pgrep zookfsexstack) … 0x40022416 in kernel_vsyscall () (gdb) break http_serve Breakpoint 1 at 0x8049415: file http.c, line 248. (gdb) continue Continuing. return address to foo’s caller …….. Stack Frame of foo’s caller ……… saved %ebp …………………. .. buf[255] …. …. …. buf[0] (4 bytes) %ebp –>
memory
addresses
increasing
up
stack
growing
down
13
Be sure to run gdb from the ~/lab directory, so that it picks up the set
follow-fork-mode child command from ~/lab/.gdbinit.
Now you can issue an HTTP request to the web server, so that it triggers
the breakpoint, and so that you can examine the stack of http_serve:
[New
proce
ss
1339]
[Swit
ching
to
proce
ss
1339
]
Breakpoint 1, http_serve (fd=3, name=0x8051014
“/”) at http.c:248
248 void (*handler)(int, const char *) =
http_serve_none;
(gdb) print &pn
$1 = (char (*)[1024]) 0xbfffd10c
(gdb) info registers
eax 0x3 3
ecx 0x400bdec0 1074519744
edx 0x6c6d74 7105908
ebx 0x804a38e 134521742
esp 0xbfffd0a0 0xbfffd0a0
ebp 0xbfffd518 0xbfffd518
esi 0x0 0
edi 0x0 0
eip 0x8049415 0x8049415
<http_serve+9>
eflags 0x200286 [ PF SF IF ID ]
cs 0x73 115
ss 0x7b 123
ds 0x7b 123
14
es 0x7b 123
fs 0x0 0
gs 0x33 51
(gdb)
From this, you can tell that, at least for this invocation of http_serve,
the pn[] buffer on the stack lives at address 0xbfffd10c, and the
value of %ebp (which points at the saved %ebp on the stack) is
0xbfffd518.
Now it’s your turn to develop an exploit.
Exercise 3. Starting from one of your exploits from Exercise 2,
construct an exploit that hijacks control flow of the web server and
unlinks /home/httpd/grades.txt. Save this exploit in a file
called exploit-3.py.
Explain in answers.txt whether or not the other buffer
overflow vulnerabilities you found in Exercise 1 can be exploited
in this manner.
Verify that your exploit works; you will need to re-create
/home/httpd/grades.txt after each successful exploit run.
Suggestion: first focus on obtaining control of the program counter.
Sketch out the stack layout that you expect the program to have at
the point when you overflow the buffer, and use gdb to verify that
your overflow data ends up where you expect it to. Step through the
execution of the function to the return instruction to make sure you
can control what address the program returns to. The next,
stepi, info reg, and disassemble commands in gdb
should prove helpful.
Once you can reliably hijack the control flow of the program,
find a suitable address that will contain the code you want to
execute, and focus on placing the correct code at that
address—e.g. a derivative of Aleph One’s shell code.
Note: SYS_unlink, the number of the unlink syscall, is 10 or
‘\n’ (newline). Why does this complicate matters? How can
you get around it?
You can check whether your exploit works as follows:
15
httpd@vm-628:~/lab$ make check-exstack
The test either prints “PASS” or fails. We will grade your exploits in this way.
If you use another name for the exploit script, change Makefile
accordingly.
The standard C compiler used on Linux, gcc, implements a version of
stack canaries (called SSP). You can explore whether GCC’s version of
stack canaries would or would not prevent a given vulnerability by using
the SSP-enabled versions of the web server binaries (zookd-withssp
and zookfs-withssp), by using the zook-withssp.conf config
file when starting zookld.
Run make prepare-submit-a. The resulting lab1ahandin.tar.gz file will be graded. Submit it via mookit
Part 3: Return-to-libc attacks
Many modern operating systems mark the stack non-executable in an
attempt to make it more difficult to exploit buffer overflows. In this part,
you will explore how this protection mechanism can be circumvented. Run
the web server configured with binaries that have a non-executable stack,
as follows.
httpd@vm-628:~/project$ ./clean-env.sh
./zookld zook-nxstack.conf
The key observation to exploiting buffer overflows with a non-executable
stack is that you still control the program counter, after a RET instruction
jumps to an address that you placed on the stack. Even though you
cannot jump to the address of the overflowed buffer (it will not be
executable), there’s usually enough code in the vulnerable server’s
address space to perform the operation you want. Thus, to bypass a nonexecutable stack, you need to first find the code you want to execute.
This is often a function in the standard library, called libc, such as
execl, system, or unlink. Then, you need to arrange for the stack
to look like a call to that function with the desired arguments, such as
system(“/bin/sh”). Finally, you need to arrange for the RET
instruction to jump to the function you found in the first step. This attack
is often called a return-to-libc attack. The document ‘return to
16
libc” provided with this project assignment, contains a more
detailed description of this style of attack
In the next exercise, you will need to understand the calling convention
for C functions. For your reference, consider the following simple C
program:
void
foo(int x, char *msg, int y)
{
/* … */
}
void bar (void) {
int a = 3;
foo(5, “Hello, world!”, 7);
}
The stack layout when bar invokes foo, just after the program counter
has switched to the beginning of foo, looks like this:
17
When foo starts running, the first thing it will do is save the %ebp register
on the stack, and set the %ebp register to point at this saved value on the
stack, so the stack frame will look like the one shown just above exercise 3.
Exercise 4. Starting from your two exploits in Exercise 2,
construct two exploits that take advantage of those
vulnerabilities to unlink /home/httpd/grades.txt when
run on the binaries that have a non-executable stack. Name
these new exploits exploit-4a.py and exploit-4b.py.
Although in principle you could use shellcode that’s not located on
the stack, for this exercise you should not inject any shellcode into
the vulnerable process. You should use a return-to-libc (or at least
a call-to-libc) attack where you vector control flow directly into
code that existed before your attack.
In answers.txt, explain whether or not the other buffer
overflow vulnerabilities you found in Exercise 1 can be
exploited in this same manner.
saved %ebp
…….
3
……
7
pointer to string
5
return address into
bar
%ebp ———–>
bar’s a ——>
%esp ———–>
(4 bytes)
(4 bytes)
(4 bytes)
—-> “Hello world!” (somewhere in memory)
(4 bytes)
(4 bytes)
(4 bytes)
18
You can test your exploits as follows:
httpd@vm-6858:~/lab$ make check-libc
The test either prints two “PASS” messages or fails. We will grade your
exploits in this way. If you use other names for the exploit scripts, change
Makefile accordingly.
Part 4: Fixing buffer overflows and other bugs
Now that you have figured out how to exploit buffer overflows, you will try
to find other kinds of vulnerabilities in the same code. As with many realworld applications, the “security” of our web server is not well-defined.
Thus, you will need to use your imagination to think of a plausible threat
model and policy for the web server.
Exercise 5. Look through the source code and try to find more
vulnerabilities that can allow an attacker to compromise the
security of the web server. Describe the attacks you have found in
answers.txt, along with an explanation of the limitations of
the attack, what an attacker can accomplish, why it works, and how
you might go about fixing or preventing it. You should ignore bugs
in zoobar’s code. They will be addressed in future labs.
One approach for finding vulnerabilities is to trace the flow of inputs
controlled by the attacker through the server code. At each point
that the attacker’s input is used, consider all the possible values the
attacker might have provided at that point, and what the attacker
can achieve in that manner.
You should find at least two vulnerabilities for this exercise.
Finally, you will explore fixing some of the vulnerabilities you have found in
this project assignment.
Exercise 6. For each buffer overflow vulnerability you have
found in Exercise 1, fix the web server’s code to prevent the
vulnerability in the first place. Do not rely on compile-time or
19
runtime mechanisms such as stack canaries, removing -fnostack-protector, baggy bounds checking, etc.
Run make prepare-submit. The resulting lab1-
handin.tar.gz file will be graded. So submit it via mookit.

Project 1 CS 628/628A: Computer System Security solved

Description

Related products

Project 2 CS 628/628A: Computer System Security solved

Project 3 CS 628/628A: Computer System Security solved