Testing performance of marshalling code: machine clocks per iteration.
gcc -O3 -Wall marsh.c mt.c
yields 69 G4 PowerPC machine clocks. 120 Pentium III clocks.
gcc -O3 -Wall Pmarsh.c Pmt.c
yields 36 G4 PowerPC machine clocks. 86 Pentium III clocks.