Page 11 of 11
Posted: 27 Apr 2008 16:53
by infimum
admin wrote:All I could do is to create always UNICODE files if the system has a DBCS codepage (as japan, korea, china, thailand). Is that good?
Not restricted to DBCS, but Arabic, Greek, Hebrew, Russian, among others would probably do "funny things."

If you are serious about the phrase "Explorer replacement," you have to find a way to handle this. There has to be a way. There are so many mp3 tag editors, for exmaple, that can handle unicode fine. There is no reason you alone can't find a way to handle it
What does Explorer when you drop an "ö" in Japan?
Windows has been in full-unicode since Windows 2000. Of course it can handle "ö". I don't really get why you keep asking about Windows native applications such as Explorer and Notepad. Why doubt? It's been a long time since Windows 2000. It's high time for other applications to catch up with this.
Posted: 27 Apr 2008 20:50
by admin
infimum wrote:What does Explorer when you drop an "ö" in Japan?
Windows has been in full-unicode since Windows 2000. Of course it can handle "ö". I don't really get why you keep asking about Windows native applications such as Explorer and Notepad. Why doubt? It's been a long time since Windows 2000. It's high time for other applications to catch up with this.
Notepad does allow you to save a file containing only the word "Cologne" as ANSI (= 7 bytes), but a file containing the word "Köln" must be saved as UNICODE (= 2 * 4 + 2 BOM = 10 bytes), correct?
Posted: 27 Apr 2008 21:04
by admin
infimum wrote:admin wrote:All I could do is to create always UNICODE files if the system has a DBCS codepage (as japan, korea, china, thailand). Is that good?
Not restricted to DBCS, but Arabic, Greek, Hebrew, Russian, among others would probably do "funny things."
AFAIK: wrong! It's actually only these four (not Thai):
• 932 (Japanese Shift-JIS)
• 936 (Simplified Chinese GBK)
• 949 (Korean)
• 950 (Traditional Chinese Big5)
And the funny effects are due to "
unused leadbytes", see
http://www.microsoft.com/globaldev/reference/wincp.mspx
Google for "unused leadbytes" and you'll find not much...
Posted: 27 Apr 2008 22:30
by infimum
admin wrote:Notepad does allow you to save a file containing only the word "Cologne" as ANSI (= 7 bytes), but a file containing the word "Köln" must be saved as UNICODE (= 2 * 4 + 2 BOM = 10 bytes), correct?
Correct.
admin wrote:Google for "unused leadbytes" and you'll find not much...
Yeah, people moved onto unicode

Posted: 28 Apr 2008 07:50
by admin
infimum wrote:
admin wrote:Google for "unused leadbytes" and you'll find not much...
Yeah, people moved onto unicode

