From a5aade2fa9ff89f9f3c4a91261071299de0d0fa4 Mon Sep 17 00:00:00 2001 From: Mike Pall Date: Fri, 11 Feb 2011 01:21:46 +0100 Subject: FFI: Finish FFI docs. --- doc/ext_ffi_tutorial.html | 352 +++++++++++++++++++++++++++++++++++++++++++++- 1 file changed, 348 insertions(+), 4 deletions(-) (limited to 'doc/ext_ffi_tutorial.html') diff --git a/doc/ext_ffi_tutorial.html b/doc/ext_ffi_tutorial.html index 11e83339..c43b223b 100644 --- a/doc/ext_ffi_tutorial.html +++ b/doc/ext_ffi_tutorial.html @@ -8,6 +8,13 @@ +
@@ -33,8 +40,6 @@
  • ffi.* API
  • -64 bit Integers -
  • FFI Semantics
  • @@ -57,7 +62,14 @@
  • -TODO +This page is intended to give you an overview of the features of the FFI +library by presenting a few use cases and guidelines. +

    +

    +This page makes no attempt to explain all of the FFI library, though. +You'll want to have a look at the ffi.* API +function reference and the FFI +semantics to learn more.

    Loading the FFI Library

    @@ -76,7 +88,339 @@ of globals — you really need to use the local variable. The require function ensures the library is only loaded once.

    -

    TODO

    +

    Accessing Standard System Functions

    +

    +The following code explains how to access standard system functions. +We slowly print two lines of dots by sleeping for 10 milliseconds +after each dot: +

    +
    +local ffi = require("ffi")
    +ffi.cdef[[ //
    +void Sleep(int ms);
    +int poll(struct pollfd *fds, unsigned long nfds, int timeout);
    +]]
    +
    +local sleep
    +if ffi.os == "Windows" then --
    +  function sleep(s) --
    +    ffi.C.Sleep(s*1000) --
    +  end
    +else
    +  function sleep(s)
    +    ffi.C.poll(nil, 0, s*1000) --
    +  end
    +end
    +
    +for i=1,160 do
    +  io.write("."); io.flush()
    +  sleep(0.01) --
    +end
    +io.write("\n")
    +
    +

    +Here's the step-by-step explanation: +

    +

    + This defines the +C library functions we're going to use. The part inside the +double-brackets (in green) is just standard C syntax. You can +usually get this info from the C header files or the +documentation provided by each C library or C compiler. +

    +

    + The difficulty we're +facing here, is that there are different standards to choose from. +Windows has a simple Sleep() function. On other systems there +are a variety of functions available to achieve sub-second sleeps, but +with no clear consensus. Thankfully poll() can be used for +this task, too, and it's present on most non-Windows systems. The +check for ffi.os makes sure we use the Windows-specific +function only on Windows systems. +

    +

    + Here we're wrapping the +call to the C function in a Lua function. This isn't strictly +necessary, but it's helpful to deal with system-specific issues only +in one part of the code. The way we're wrapping it ensures the check +for the OS is only done during initialization and not for every call. +

    +

    + A more subtle point is +that we defined our sleep() function (for the sake of this +example) as taking the number of seconds, but accepting fractional +seconds. Multiplying this by 1000 gets us milliseconds, but that still +leaves it a Lua number, which is a floating-point value. Alas, the +Sleep() function only accepts an integer value. Luckily for +us, the FFI library automatically performs the conversion when calling +the function (truncating the FP value towards zero, like in C). +

    +

    +Some readers will notice that Sleep() is part of +KERNEL32.DLL and is also a stdcall function. So how +can this possibly work? The FFI library provides the ffi.C +default C library namespace, which allows calling functions from +the default set of libraries, like a C compiler would. Also, the +FFI library automatically detects stdcall functions, so you +don't need to declare them as such. +

    +

    + The poll() +function takes a couple more arguments we're not going to use. You can +simply use nil to pass a NULL pointer and 0 +for the nfds parameter. Please note that the +number 0 does not convert to a pointer value, +unlike in C++. You really have to pass pointers to pointer arguments +and numbers to number arguments. +

    +

    +The page on FFI semantics has all +of the gory details about +conversions between Lua +objects and C types. For the most part you don't have to deal +with this, as it's performed automatically and it's carefully designed +to bridge the semantic differences between Lua and C. +

    +

    + Now that we have defined +our own sleep() function, we can just call it from plain Lua +code. That wasn't so bad, huh? Turning these boring animated dots into +a fascinating best-selling game is left as an exercise for the reader. +:-) +

    + +

    Accessing the zlib Compression Library

    +

    +The following code shows how to access the zlib compression library from Lua code. +We'll define two convenience wrapper functions that take a string and +compress or uncompress it to another string: +

    +
    +local ffi = require("ffi")
    +ffi.cdef[[ //
    +unsigned long compressBound(unsigned long sourceLen);
    +int compress2(uint8_t *dest, unsigned long *destLen,
    +	      const uint8_t *source, unsigned long sourceLen, int level);
    +int uncompress(uint8_t *dest, unsigned long *destLen,
    +	       const uint8_t *source, unsigned long sourceLen);
    +]]
    +local zlib = ffi.load(ffi.os == "Windows" and "zlib1" or "z") --
    +
    +local function compress(txt)
    +  local n = zlib.compressBound(#txt) --
    +  local buf = ffi.new("uint8_t[?]", n)
    +  local buflen = ffi.new("unsigned long[1]", n) --
    +  local res = zlib.compress2(buf, buflen, txt, #txt, 9)
    +  assert(res == 0)
    +  return ffi.string(buf, buflen[0]) --
    +end
    +
    +local function uncompress(comp, n) --
    +  local buf = ffi.new("uint8_t[?]", n)
    +  local buflen = ffi.new("unsigned long[1]", n)
    +  local res = zlib.uncompress(buf, buflen, comp, #comp)
    +  assert(res == 0)
    +  return ffi.string(buf, buflen[0])
    +end
    +
    +-- Simple test code. --
    +local txt = string.rep("abcd", 1000)
    +print("Uncompressed size: ", #txt)
    +local c = compress(txt)
    +print("Compressed size: ", #c)
    +local txt2 = uncompress(c, #txt)
    +assert(txt2 == txt)
    +
    +

    +Here's the step-by-step explanation: +

    +

    + This defines some of the +C functions provided by zlib. For the sake of this example, some +type indirections have been reduced and it uses the pre-defined +fixed-size integer types, while still adhering to the zlib API/ABI. +

    +

    + This loads the zlib shared +library. On POSIX systems it's named libz.so and usually +comes pre-installed. Since ffi.load() automatically adds any +missing standard prefixes/suffixes, we can simply load the +"z" library. On Windows it's named zlib1.dll and +you'll have to download it first from the +» zlib site. The check for +ffi.os makes sure we pass the right name to +ffi.load(). +

    +

    + First, the maximum size of +the compression buffer is obtained by calling the +zlib.compressBound function with the length of the +uncompressed string. The next line allocates a byte buffer of this +size. The [?] in the type specification indicates a +variable-length array (VLA). The actual number of elements of this +array is given as the 2nd argument to ffi.new(). +

    +

    + This may look strange at +first, but have a look at the declaration of the compress2 +function from zlib: the destination length is defined as a pointer! +This is because you pass in the maximum buffer size and get back the +actual length that was used. +

    +

    +In C you'd pass in the address of a local variable +(&buflen). But since there's no address-of operator in +Lua, we'll just pass in a one-element array. Conveniently it can be +initialized with the maximum buffer size in one step. Calling the +actual zlib.compress2 function is then straightforward. +

    +

    + We want to return the +compressed data as a Lua string, so we'll use ffi.string(). +It needs a pointer to the start of the data and the actual length. The +length has been returned in the buflen array, so we'll just +get it from there. +

    +

    +Note that since the function returns now, the buf and +buflen variables will eventually be garbage collected. This +is fine, because ffi.string() has copied the contents to a +newly created (interned) Lua string. If you plan to call this function +lots of times, consider reusing the buffers and/or handing back the +results in buffers instead of strings. This will reduce the overhead +for garbage collection and string interning. +

    +

    + The uncompress +functions does the exact opposite of the compress function. +The compressed data doesn't include the size of the original string, +so this needs to be passed in. Otherwise no surprises here. +

    +

    + The code, that makes use +of the functions we just defined, is just plain Lua code. It doesn't +need to know anything about the LuaJIT FFI — the convenience +wrapper functions completely hide it. +

    +

    +One major advantage of the LuaJIT FFI is that you are now able to +write those wrappers in Lua. And at a fraction of the time it +would cost you to create an extra C module using the Lua/C API. +Many of the simpler C functions can probably be used directly +from your Lua code, without any wrappers. +

    +

    +Side note: the zlib API uses the long type for passing +lengths and sizes around. But all those zlib functions actually only +deal with 32 bit values. This is an unfortunate choice for a +public API, but may be explained by zlib's history — we'll just +have to deal with it. +

    +

    +First, you should know that a long is a 64 bit type e.g. +on POSIX/x64 systems, but a 32 bit type on Windows/x64 and on +32 bit systems. Thus a long result can be either a plain +Lua number or a boxed 64 bit integer cdata object, depending on +the target system. +

    +

    +Ok, so the ffi.* functions generally accept cdata objects +wherever you'd want to use a number. That's why we get a away with +passing n to ffi.string() above. But other Lua +library functions or modules don't know how to deal with this. So for +maximum portability one needs to use tonumber() on returned +long results before passing them on. Otherwise the +application might work on some systems, but would fail in a POSIX/x64 +environment. +

    + +

    Translating C Idioms

    +

    +Here's a list of common C idioms and their translation to the +LuaJIT FFI: +

    + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
    IdiomC codeLua code
    Pointer dereference
    int *p;
    x = *p;
    *p = y;
    x = p[0]
    p[0] = y
    Pointer indexing
    int i, *p;
    x = p[i];
    p[i+1] = y;
    x = p[i]
    p[i+1] = y
    Array indexing
    int i, a[];
    x = a[i];
    a[i+1] = y;
    x = a[i]
    a[i+1] = y
    struct/union dereference
    struct foo s;
    x = s.field;
    s.field = y;
    x = s.field
    s.field = y
    struct/union pointer deref.
    struct foo *sp;
    x = sp->field;
    sp->field = y;
    x = s.field
    s.field = y
    Pointer arithmetic
    int i, *p;
    x = p + i;
    y = p - i;
    x = p + i
    y = p - i
    Pointer difference
    int *p1, *p2;
    x = p1 - p2;x = p1 - p2
    Array element pointer
    int i, a[];
    x = &a[i];x = a+i
    Cast pointer to address
    int *p;
    x = (intptr_t)p;x = tonumber(
     ffi.cast("intptr_t",
              p))
    Functions with outargs
    void foo(int *inoutlen);
    int len = x;
    foo(&len);
    y = len;
    local len =
      ffi.new("int[1]", x)
    foo(len)
    y = len[0]
    Vararg conversions
    int printf(char *fmt, ...);
    printf("%g", 1.0);
    printf("%d", 1);
     
    printf("%g", 1);
    printf("%d",
      ffi.new("int", 1))
    + +

    To Cache or Not to Cache

    +

    +It's a common Lua idiom to cache library functions in local variables +or upvalues, e.g.: +

    +
    +local byte, char = string.byte, string.char
    +local function foo(x)
    +  return char(byte(x)+1)
    +end
    +
    +

    +This replaces several hash-table lookups with a (faster) direct use of +a local or an upvalue. This is less important with LuaJIT, since the +JIT compiler optimizes hash-table lookups a lot and is even able to +hoist most of them out of the inner loops. It can't eliminate +all of them, though, and it saves some typing for often-used +functions. So there's still a place for this, even with LuaJIT. +

    +

    +The situation is a bit different with C function calls via the +FFI library. The JIT compiler has special logic to eliminate all +of the lookup overhead for functions resolved from a +C library namespace! +Thus it's not helpful and actually counter-productive to cache +individual C functions like this: +

    +
    +local funca, funcb = ffi.C.funcb, ffi.C.funcb -- Not helpful!
    +local function foo(x, n)
    +  for i=1,n do funcb(funca(x, i), 1) end
    +end
    +
    +

    +This turns them into indirect calls and generates bigger and slower +machine code. Instead you'll want to cache the namespace itself and +rely on the JIT compiler to eliminate the lookups: +

    +
    +local C = ffi.C          -- Instead use this!
    +local function foo(x, n)
    +  for i=1,n do C.funcb(C.funca(x, i), 1) end
    +end
    +
    +

    +This generates both shorter and faster code. So don't cache +C functions, but do cache namespaces! Most often the +namespace is already in a local variable at an outer scope, e.g. from +local lib = ffi.load(...). Note that copying +it to a local variable in the function scope is unnecessary. +