diff options
author | Denys Vlasenko <vda.linux@googlemail.com> | 2010-02-01 22:35:30 +0100 |
---|---|---|
committer | Denys Vlasenko <vda.linux@googlemail.com> | 2010-02-01 22:35:30 +0100 |
commit | 4875e7148b0512ee3c255526a484503da984935a (patch) | |
tree | bc931a92ab19716b86809d1c06a38f1c7c88ec7c /docs/unicode.txt | |
parent | 698dca5805117f470ef19488428c8a5f795b9e0c (diff) | |
download | busybox-w32-4875e7148b0512ee3c255526a484503da984935a.tar.gz busybox-w32-4875e7148b0512ee3c255526a484503da984935a.tar.bz2 busybox-w32-4875e7148b0512ee3c255526a484503da984935a.zip |
docs/unicode.txt: added more TODOs
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
Diffstat (limited to 'docs/unicode.txt')
-rw-r--r-- | docs/unicode.txt | 31 |
1 files changed, 23 insertions, 8 deletions
diff --git a/docs/unicode.txt b/docs/unicode.txt index 019d12f65..32df24dc8 100644 --- a/docs/unicode.txt +++ b/docs/unicode.txt | |||
@@ -26,30 +26,45 @@ But we also need to handle the following problematic moments: | |||
26 | * Bidirectional handling. If user wants to echo a phrase | 26 | * Bidirectional handling. If user wants to echo a phrase |
27 | in Hebrew, he types: echo "srettel werbeH" | 27 | in Hebrew, he types: echo "srettel werbeH" |
28 | 28 | ||
29 | Editors | 29 | Editors (vi, ed) |
30 | 30 | ||
31 | This case is a bit similar to "shell input", but unlike shell, | 31 | This case is a bit similar to "shell input", but unlike shell, |
32 | editors may encounder many more unexpected unicode sequences | 32 | editors may encounder many more unexpected unicode sequences |
33 | (try to load a random binry file...), and they need to preserve | 33 | (try to load a random binary file...), and they need to preserve |
34 | them, unlike shell which can afford to drop bogus input. | 34 | them, unlike shell which can afford to drop bogus input. |
35 | 35 | ||
36 | |||
37 | more, less | 36 | more, less |
38 | 37 | ||
39 | . | 38 | Need to correctly display any input file. Ideally, with |
39 | ASCII/unicode/filtered_unicode option or keyboard switch. | ||
40 | Note: need to handle tabs and backspaces specially | ||
41 | (bksp is for manpage compat). | ||
42 | |||
43 | cut, fold, watch | ||
44 | |||
45 | May need ability to cut unicode string to specified number of wchars | ||
46 | and/or to specified screen width. Need to handle tabs specially. | ||
47 | |||
48 | sed, awk, grep | ||
49 | |||
50 | Handle unicode-aware regexp match | ||
40 | 51 | ||
41 | ls (multi-column display) | 52 | ls (multi-column display) |
42 | 53 | ||
43 | . | 54 | ls will fail to line up columnar output if it will not account |
55 | for character widths (and maybe filter out some of them, see | ||
56 | above). OTOH, non-columnar views (ls -1, ls -l, ls | car) | ||
57 | should NOT filter out bad unicode (but need to filter out | ||
58 | control chars (coreutils does that). Note that unlike more/less, | ||
59 | tabs and backspaces need not special handling. | ||
44 | 60 | ||
45 | top, ps | 61 | top, ps |
46 | 62 | ||
47 | . | 63 | Need to perform filtering similar to ls. |
48 | 64 | ||
49 | Filename display (in error messages and elsewhere) | 65 | Filename display (in error messages and elsewhere) |
50 | 66 | ||
51 | . | 67 | Need to perform filtering similar to ls. |
52 | |||
53 | 68 | ||
54 | 69 | ||
55 | TODO: write an email to Asmus Freytag (asmus@unicode.org), | 70 | TODO: write an email to Asmus Freytag (asmus@unicode.org), |