diff options
| author | Denys Vlasenko <vda.linux@googlemail.com> | 2010-02-01 22:35:30 +0100 |
|---|---|---|
| committer | Denys Vlasenko <vda.linux@googlemail.com> | 2010-02-01 22:35:30 +0100 |
| commit | 4875e7148b0512ee3c255526a484503da984935a (patch) | |
| tree | bc931a92ab19716b86809d1c06a38f1c7c88ec7c /docs | |
| parent | 698dca5805117f470ef19488428c8a5f795b9e0c (diff) | |
| download | busybox-w32-4875e7148b0512ee3c255526a484503da984935a.tar.gz busybox-w32-4875e7148b0512ee3c255526a484503da984935a.tar.bz2 busybox-w32-4875e7148b0512ee3c255526a484503da984935a.zip | |
docs/unicode.txt: added more TODOs
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
Diffstat (limited to 'docs')
| -rw-r--r-- | docs/unicode.txt | 31 |
1 files changed, 23 insertions, 8 deletions
diff --git a/docs/unicode.txt b/docs/unicode.txt index 019d12f65..32df24dc8 100644 --- a/docs/unicode.txt +++ b/docs/unicode.txt | |||
| @@ -26,30 +26,45 @@ But we also need to handle the following problematic moments: | |||
| 26 | * Bidirectional handling. If user wants to echo a phrase | 26 | * Bidirectional handling. If user wants to echo a phrase |
| 27 | in Hebrew, he types: echo "srettel werbeH" | 27 | in Hebrew, he types: echo "srettel werbeH" |
| 28 | 28 | ||
| 29 | Editors | 29 | Editors (vi, ed) |
| 30 | 30 | ||
| 31 | This case is a bit similar to "shell input", but unlike shell, | 31 | This case is a bit similar to "shell input", but unlike shell, |
| 32 | editors may encounder many more unexpected unicode sequences | 32 | editors may encounder many more unexpected unicode sequences |
| 33 | (try to load a random binry file...), and they need to preserve | 33 | (try to load a random binary file...), and they need to preserve |
| 34 | them, unlike shell which can afford to drop bogus input. | 34 | them, unlike shell which can afford to drop bogus input. |
| 35 | 35 | ||
| 36 | |||
| 37 | more, less | 36 | more, less |
| 38 | 37 | ||
| 39 | . | 38 | Need to correctly display any input file. Ideally, with |
| 39 | ASCII/unicode/filtered_unicode option or keyboard switch. | ||
| 40 | Note: need to handle tabs and backspaces specially | ||
| 41 | (bksp is for manpage compat). | ||
| 42 | |||
| 43 | cut, fold, watch | ||
| 44 | |||
| 45 | May need ability to cut unicode string to specified number of wchars | ||
| 46 | and/or to specified screen width. Need to handle tabs specially. | ||
| 47 | |||
| 48 | sed, awk, grep | ||
| 49 | |||
| 50 | Handle unicode-aware regexp match | ||
| 40 | 51 | ||
| 41 | ls (multi-column display) | 52 | ls (multi-column display) |
| 42 | 53 | ||
| 43 | . | 54 | ls will fail to line up columnar output if it will not account |
| 55 | for character widths (and maybe filter out some of them, see | ||
| 56 | above). OTOH, non-columnar views (ls -1, ls -l, ls | car) | ||
| 57 | should NOT filter out bad unicode (but need to filter out | ||
| 58 | control chars (coreutils does that). Note that unlike more/less, | ||
| 59 | tabs and backspaces need not special handling. | ||
| 44 | 60 | ||
| 45 | top, ps | 61 | top, ps |
| 46 | 62 | ||
| 47 | . | 63 | Need to perform filtering similar to ls. |
| 48 | 64 | ||
| 49 | Filename display (in error messages and elsewhere) | 65 | Filename display (in error messages and elsewhere) |
| 50 | 66 | ||
| 51 | . | 67 | Need to perform filtering similar to ls. |
| 52 | |||
| 53 | 68 | ||
| 54 | 69 | ||
| 55 | TODO: write an email to Asmus Freytag (asmus@unicode.org), | 70 | TODO: write an email to Asmus Freytag (asmus@unicode.org), |
