Return to Question

replaced http://codereview.stackexchange.com/ with https://codereview.stackexchange.com/

edited Apr 13, 2017 at 12:40

This is the complete rewrite of a function to decode an UTF-8 codepoints and return the encoding length (or 0 if the codepoint is 0円). In response to the review of my previous version review of my previous version I decided that I needed a different approach and I should recognize ALL the invalid encodings (as per reviewer suggestion). If an invalid encoding is found, the first byte is returned with length 1.

This is the complete rewrite of a function to decode an UTF-8 codepoints and return the encoding length (or 0 if the codepoint is 0円). In response to the review of my previous version I decided that I needed a different approach and I should recognize ALL the invalid encodings (as per reviewer suggestion). If an invalid encoding is found, the first byte is returned with length 1.

edited tags

Link

edited Sep 25, 2016 at 15:48

200_success

edited Sep 25, 2016 at 15:48

200_success

145.5k
22
190
479

performance c state-machine utf-8

added 4 characters in body

Source Link

edited Sep 24, 2016 at 19:48

Remo.D

edited Sep 24, 2016 at 19:48

Remo.D

I also compared my code with one directly based on the original version by Bjoern Hoehrmann (using a table to represent the FSM) and I found it having the same speedthem being equally fast but mine being much clearer in terms of understanding how it works.

I also compared my code with one directly based on the original version by Bjoern Hoehrmann (using a table to represent the FSM) and I found it having the same speed but being much clearer in terms of understanding how it works.

I also compared my code with one directly based on the original version by Bjoern Hoehrmann (using a table to represent the FSM) and I found them being equally fast but mine being much clearer in terms of understanding how it works.

added 83 characters in body

Source Link

edited Sep 24, 2016 at 19:08

Remo.D

edited Sep 24, 2016 at 19:08

Remo.D

Source Link

asked Sep 24, 2016 at 7:29

Remo.D

asked Sep 24, 2016 at 7:29

Remo.D

lang-c