On Fri, Jan 16, 2015 at 12:11:41PM +0000, Aapo Talvensaari wrote:
> Is it by design that utf.len count the BOM to length?
>
> Say utf8.len("\xEF\xBB\xBFa") will return 2 instead of 1?
Given UTF8 has only one valid "byte order", it makes no sense to ever
include a byte order marker in a UTF8 document.
B.