diff options
| author | Denis Vlasenko <vda.linux@googlemail.com> | 2007-03-15 19:52:42 +0000 |
|---|---|---|
| committer | Denis Vlasenko <vda.linux@googlemail.com> | 2007-03-15 19:52:42 +0000 |
| commit | 4efeaee387eb9c20afcf5576802755a646adb776 (patch) | |
| tree | 306d8ad4c9a20e971356afc3b3a76abc15263458 /docs | |
| parent | 5e892bae51a1f4414006a70f37d87b339e943cba (diff) | |
| download | busybox-w32-4efeaee387eb9c20afcf5576802755a646adb776.tar.gz busybox-w32-4efeaee387eb9c20afcf5576802755a646adb776.tar.bz2 busybox-w32-4efeaee387eb9c20afcf5576802755a646adb776.zip | |
mkfs_minix: stop using lots of bss/data.
data -3500 bytes, code -300 bytes
keep_data_small: expand/fix
Diffstat (limited to 'docs')
| -rw-r--r-- | docs/keep_data_small.txt | 68 |
1 files changed, 52 insertions, 16 deletions
diff --git a/docs/keep_data_small.txt b/docs/keep_data_small.txt index ec13b4d3f..5137dffc6 100644 --- a/docs/keep_data_small.txt +++ b/docs/keep_data_small.txt | |||
| @@ -1,4 +1,4 @@ | |||
| 1 | Keeping data small | 1 | Keeping data small |
| 2 | 2 | ||
| 3 | When many applets are compiled into busybox, all rw data and | 3 | When many applets are compiled into busybox, all rw data and |
| 4 | bss for each applet are concatenated. Including those from libc, | 4 | bss for each applet are concatenated. Including those from libc, |
| @@ -10,6 +10,11 @@ On nommu it's probably bites the most, actually using real | |||
| 10 | RAM for rwdata and bss. On i386, bss is lazily allocated | 10 | RAM for rwdata and bss. On i386, bss is lazily allocated |
| 11 | by COWed zero pages. Not sure about rwdata - also COW? | 11 | by COWed zero pages. Not sure about rwdata - also COW? |
| 12 | 12 | ||
| 13 | In order to keep bbox NOMMU and small-mem systems friendly | ||
| 14 | we should avoid large global data in our applets, and should | ||
| 15 | minimize usage of libc functions which implicitly use | ||
| 16 | such structures in libc. | ||
| 17 | |||
| 13 | Small experiment measures "parasitic" bbox memory consumption. | 18 | Small experiment measures "parasitic" bbox memory consumption. |
| 14 | Here we start 1000 "busybox sleep 10" in parallel. | 19 | Here we start 1000 "busybox sleep 10" in parallel. |
| 15 | bbox binary is practically allyesconfig static one, | 20 | bbox binary is practically allyesconfig static one, |
| @@ -34,14 +39,10 @@ bash-3.2# nmeter '%t %c %b %m %p %[pn]' | |||
| 34 | 23:17:43 .......... 0 0 168M 0 147 | 39 | 23:17:43 .......... 0 0 168M 0 147 |
| 35 | 40 | ||
| 36 | This requires 55M of memory. Thus 1 trivial busybox applet | 41 | This requires 55M of memory. Thus 1 trivial busybox applet |
| 37 | takes 55k of userspace memory (nmeter doesn't account for kernel-side | 42 | takes 55k of memory. |
| 38 | allocations). Definitely can be improved. | ||
| 39 | 43 | ||
| 40 | Thus we should avoid large global data in our applets, | ||
| 41 | and should minimize usage of libc functions which implicitly use | ||
| 42 | such structures in libc. | ||
| 43 | 44 | ||
| 44 | Example 1 | 45 | Example 1 |
| 45 | 46 | ||
| 46 | One example how to reduce global data usage is in | 47 | One example how to reduce global data usage is in |
| 47 | archival/libunarchive/decompress_unzip.c: | 48 | archival/libunarchive/decompress_unzip.c: |
| @@ -57,12 +58,15 @@ archival/libunarchive/decompress_unzip.c: | |||
| 57 | #define STATE_IN_BSS 0 | 58 | #define STATE_IN_BSS 0 |
| 58 | #define STATE_IN_MALLOC 1 | 59 | #define STATE_IN_MALLOC 1 |
| 59 | 60 | ||
| 61 | (see the rest of the file to get the idea) | ||
| 62 | |||
| 60 | This example completely eliminates globals in that module. | 63 | This example completely eliminates globals in that module. |
| 61 | Required memory is allocated in inflate_gunzip() [its main module] | 64 | Required memory is allocated in inflate_gunzip() [its main module] |
| 62 | and then passed down to all subroutines which need to access 'globals' | 65 | and then passed down to all subroutines which need to access 'globals' |
| 63 | as a parameter. | 66 | as a parameter. |
| 64 | 67 | ||
| 65 | Example 2 | 68 | |
| 69 | Example 2 | ||
| 66 | 70 | ||
| 67 | In case you don't want to pass this additional parameter everywhere, | 71 | In case you don't want to pass this additional parameter everywhere, |
| 68 | take a look at archival/gzip.c. Here all global data is replaced by | 72 | take a look at archival/gzip.c. Here all global data is replaced by |
| @@ -70,7 +74,7 @@ single global pointer (ptr_to_globals) to allocated storage. | |||
| 70 | 74 | ||
| 71 | In order to not duplicate ptr_to_globals in every applet, you can | 75 | In order to not duplicate ptr_to_globals in every applet, you can |
| 72 | reuse single common one. It is defined in libbb/messages.c | 76 | reuse single common one. It is defined in libbb/messages.c |
| 73 | as struct globals *ptr_to_globals, but the struct globals is | 77 | as struct globals *const ptr_to_globals, but the struct globals is |
| 74 | NOT defined in libbb.h. You first define your own struct: | 78 | NOT defined in libbb.h. You first define your own struct: |
| 75 | 79 | ||
| 76 | struct globals { int a; char buf[1000]; }; | 80 | struct globals { int a; char buf[1000]; }; |
| @@ -79,13 +83,45 @@ and then declare that ptr_to_globals is a pointer to it: | |||
| 79 | 83 | ||
| 80 | #define G (*ptr_to_globals) | 84 | #define G (*ptr_to_globals) |
| 81 | 85 | ||
| 82 | Linker magic ensures that these two merge into single pointer object. | 86 | ptr_to_globals is declared as constant pointer. |
| 83 | Now initialize it in <applet>_main(): | 87 | This helps gcc understand that it won't change, resulting in noticeably |
| 88 | smaller code. In order to assign it, use PTR_TO_GLOBALS macro: | ||
| 89 | |||
| 90 | PTR_TO_GLOBALS = xzalloc(sizeof(G)); | ||
| 91 | |||
| 92 | Typically it is done in <applet>_main(). | ||
| 93 | |||
| 94 | Now you can reference "globals" by G.a, G.buf and so on, in any function. | ||
| 95 | |||
| 96 | |||
| 97 | bb_common_bufsiz1 | ||
| 98 | |||
| 99 | There is one big common buffer in bss - bb_common_bufsiz1. It is a much | ||
| 100 | earlier mechanism to reduce bss usage. Each applet can use it for | ||
| 101 | its needs. Library functions are prohibited from using it. | ||
| 102 | |||
| 103 | 'G.' trick can be done using bb_common_bufsiz1 instead of malloced buffer: | ||
| 104 | |||
| 105 | #define G (*(struct globals*)&bb_common_bufsiz1) | ||
| 106 | |||
| 107 | Be careful, though, and use it only if | ||
| 108 | sizeof(struct globals) <= sizeof(bb_common_bufsiz1). | ||
| 109 | |||
| 110 | |||
| 111 | Drawbacks | ||
| 112 | |||
| 113 | You have to initialize it by hand. xzalloc() can be helpful in clearing | ||
| 114 | allocated storage to 0, but anything more must be done by hand. | ||
| 115 | |||
| 116 | All global variables are prefixed by 'G.' now. If this makes code | ||
| 117 | less readable, use #defines: | ||
| 118 | |||
| 119 | #define dev_fd (G.dev_fd) | ||
| 120 | #define sector (G.sector) | ||
| 84 | 121 | ||
| 85 | ptr_to_globals = xzalloc(sizeof(G)); | ||
| 86 | 122 | ||
| 87 | and you can reference "globals" by G.a, G.buf and so on, in any function. | 123 | Word of caution |
| 88 | 124 | ||
| 89 | The drawback is that now you have to initialize it by hand. xzalloc() | 125 | If applet doesn't use much of global data, converting it to using |
| 90 | can be helpful in clearing allocated storage to 0, but anything more | 126 | one of above methods is not worth resulting code obfuscation. |
| 91 | must be done by hand. | 127 | If you have less that ~300 bytes of global data - don't bother. |
