I spoke yesterday about Unicode, and the difference between the Unicode character set, and specific encodings of this character set. This post is a follow-up which describes in detail one particular and popular character set - UTF-8.
It’s imortant to understand that encoding is simply a means of representing a Unicode character in terms of [...]
How UTF-8 Encoding works
May 3rd, 2008 · No Comments · Programming
BOM - Is it part of the data?
May 3rd, 2008 · No Comments · Programming
This is a post in response to a comment at Ben Nadel’s blog by PaulH which I think is an interesting and important discussion, but sufficiently off-topic to the blog entry at hand that I didn’t want to completely derail the on-topic discussion.
Whereas initially BOM (Byte Order Marker U+FEFF) was intended to indicate the order [...]
Tags: bom·ColdFusion·Unicode·utf-8
Unicode: The absolute minimum every developer should know.
May 2nd, 2008 · 6 Comments · Programming
Over the past few years, I’ve been engaged in a few projects which have required international support. Via trial and fire, I’ve learned a fair amount about Unicode and character encoding. I now consider this to be essential knowledge for all programmers - web programmers especially. This becomes particularly important when you [...]
Tags: Unicode