Page 11 of 11

Posted: 27 Apr 2008 16:53
by infimum
admin wrote:All I could do is to create always UNICODE files if the system has a DBCS codepage (as japan, korea, china, thailand). Is that good?
Not restricted to DBCS, but Arabic, Greek, Hebrew, Russian, among others would probably do "funny things.":wink: If you are serious about the phrase "Explorer replacement," you have to find a way to handle this. There has to be a way. There are so many mp3 tag editors, for exmaple, that can handle unicode fine. There is no reason you alone can't find a way to handle it :wink:
What does Explorer when you drop an "ö" in Japan?
Windows has been in full-unicode since Windows 2000. Of course it can handle "ö". I don't really get why you keep asking about Windows native applications such as Explorer and Notepad. Why doubt? It's been a long time since Windows 2000. It's high time for other applications to catch up with this.

Posted: 27 Apr 2008 20:50
by admin
infimum wrote:
What does Explorer when you drop an "ö" in Japan?
Windows has been in full-unicode since Windows 2000. Of course it can handle "ö". I don't really get why you keep asking about Windows native applications such as Explorer and Notepad. Why doubt? It's been a long time since Windows 2000. It's high time for other applications to catch up with this.
Notepad does allow you to save a file containing only the word "Cologne" as ANSI (= 7 bytes), but a file containing the word "Köln" must be saved as UNICODE (= 2 * 4 + 2 BOM = 10 bytes), correct?

Posted: 27 Apr 2008 21:04
by admin
infimum wrote:
admin wrote:All I could do is to create always UNICODE files if the system has a DBCS codepage (as japan, korea, china, thailand). Is that good?
Not restricted to DBCS, but Arabic, Greek, Hebrew, Russian, among others would probably do "funny things."
AFAIK: wrong! It's actually only these four (not Thai):

• 932 (Japanese Shift-JIS)
• 936 (Simplified Chinese GBK)
• 949 (Korean)
• 950 (Traditional Chinese Big5)

And the funny effects are due to "unused leadbytes", see http://www.microsoft.com/globaldev/reference/wincp.mspx

Google for "unused leadbytes" and you'll find not much...

Posted: 27 Apr 2008 22:30
by infimum
admin wrote:Notepad does allow you to save a file containing only the word "Cologne" as ANSI (= 7 bytes), but a file containing the word "Köln" must be saved as UNICODE (= 2 * 4 + 2 BOM = 10 bytes), correct?
Correct.
admin wrote:Google for "unused leadbytes" and you'll find not much...
Yeah, people moved onto unicode :wink:

Posted: 28 Apr 2008 07:50
by admin
infimum wrote:
admin wrote:Google for "unused leadbytes" and you'll find not much...
Yeah, people moved onto unicode :wink:
:?