On Wed, Nov 9, 2011 at 12:42 PM, Joshua Bell
<jsbell@chromium.org> wrote:
I've updated the doc -Âhttp://wiki.whatwg.org/wiki/StringEncodingÂ- to reflect the discussion on this thread, most notably:
* Removed detectEncoding
* Added stringLength, removed special-case null termination
* Reordered arguments so input is first
* Updated JS "shim" implementation
* Sprinkled a few more "ISSUES" in where the doc still needs updating and/or a decision made
Further feedback is appreciated.
- do we want to say anything about canonical forms? ÂFor example, are over-long UTF8 sequences allowed? ÂHow are combining marks/etc represented?
- decode should specify the behavior if byteLength stops inside a multi-byte sequence for a character