aboutsummaryrefslogtreecommitdiff
path: root/docs/unicode.txt
diff options
context:
space:
mode:
authorDenys Vlasenko <vda.linux@googlemail.com>2010-02-01 22:35:30 +0100
committerDenys Vlasenko <vda.linux@googlemail.com>2010-02-01 22:35:30 +0100
commit4875e7148b0512ee3c255526a484503da984935a (patch)
treebc931a92ab19716b86809d1c06a38f1c7c88ec7c /docs/unicode.txt
parent698dca5805117f470ef19488428c8a5f795b9e0c (diff)
downloadbusybox-w32-4875e7148b0512ee3c255526a484503da984935a.tar.gz
busybox-w32-4875e7148b0512ee3c255526a484503da984935a.tar.bz2
busybox-w32-4875e7148b0512ee3c255526a484503da984935a.zip
docs/unicode.txt: added more TODOs
Signed-off-by: Denys Vlasenko <vda.linux@googlemail.com>
Diffstat (limited to 'docs/unicode.txt')
-rw-r--r--docs/unicode.txt31
1 files changed, 23 insertions, 8 deletions
diff --git a/docs/unicode.txt b/docs/unicode.txt
index 019d12f65..32df24dc8 100644
--- a/docs/unicode.txt
+++ b/docs/unicode.txt
@@ -26,30 +26,45 @@ But we also need to handle the following problematic moments:
26* Bidirectional handling. If user wants to echo a phrase 26* Bidirectional handling. If user wants to echo a phrase
27 in Hebrew, he types: echo "srettel werbeH" 27 in Hebrew, he types: echo "srettel werbeH"
28 28
29 Editors 29 Editors (vi, ed)
30 30
31This case is a bit similar to "shell input", but unlike shell, 31This case is a bit similar to "shell input", but unlike shell,
32editors may encounder many more unexpected unicode sequences 32editors may encounder many more unexpected unicode sequences
33(try to load a random binry file...), and they need to preserve 33(try to load a random binary file...), and they need to preserve
34them, unlike shell which can afford to drop bogus input. 34them, unlike shell which can afford to drop bogus input.
35 35
36
37 more, less 36 more, less
38 37
39. 38Need to correctly display any input file. Ideally, with
39ASCII/unicode/filtered_unicode option or keyboard switch.
40Note: need to handle tabs and backspaces specially
41(bksp is for manpage compat).
42
43 cut, fold, watch
44
45May need ability to cut unicode string to specified number of wchars
46and/or to specified screen width. Need to handle tabs specially.
47
48 sed, awk, grep
49
50Handle unicode-aware regexp match
40 51
41 ls (multi-column display) 52 ls (multi-column display)
42 53
43. 54ls will fail to line up columnar output if it will not account
55for character widths (and maybe filter out some of them, see
56above). OTOH, non-columnar views (ls -1, ls -l, ls | car)
57should NOT filter out bad unicode (but need to filter out
58control chars (coreutils does that). Note that unlike more/less,
59tabs and backspaces need not special handling.
44 60
45 top, ps 61 top, ps
46 62
47. 63Need to perform filtering similar to ls.
48 64
49 Filename display (in error messages and elsewhere) 65 Filename display (in error messages and elsewhere)
50 66
51. 67Need to perform filtering similar to ls.
52
53 68
54 69
55TODO: write an email to Asmus Freytag (asmus@unicode.org), 70TODO: write an email to Asmus Freytag (asmus@unicode.org),