Re: Changes in the validation of UTF-8
[
Date Prev][
Date Next][
Thread Prev][
Thread Next]
[
Date Index]
[
Thread Index]
- Subject: Re: Changes in the validation of UTF-8
- From: Andrew Gierth <andrew@...>
- Date: 2019年3月18日 19:35:30 +0000
>>>>> "Roberto" == Roberto Ierusalimschy <roberto@inf.puc-rio.br> writes:
>> I noticed the new commit that adds support for longer (deprecated in
>> 2003) utf8 sequences:
>> https://github.com/lua/lua/commit/1e0c73d5b643707335b06abd2546a83d9439d14c
>>
>> I'm curious why this changed? It seems like a backwards step to me.
Roberto> Why is rejecting surrogates a backwards step?
Rejecting surrogates is a forward step, that's not the problem.
Accepting values over 10FFFF is the backward step.
--
Andrew.