How to convert python Unicode string to bytes

Asked 8 years, 5 months ago

Viewed 7k times

I have a string x as below

x = "\xe9\x94\x99\xe8\xaf\xaf"

This string should be Unicode string, but cannot be displayed (print) correctly.

And the string y is Unicode string/ bytes started with b, And y can be displayed correctly by y.decode('utf-8')

y = b"\xe9\x94\x99\xe8\xaf\xaf"

My question is how to convert x to y ?

Improve this question

asked Jul 13, 2017 at 7:55

ybdesire's user avatar

ybdesire

1,7411 gold badge22 silver badges38 bronze badges

1

How are those supposed to be displayed? My Windows sees "é[x][x]è" for x, and "[x][x]" for y.

Right leg
– Right leg

2017年07月13日 08:08:39 +00:00
Commented Jul 13, 2017 at 8:08

Add a comment |

1 Answer 1

Sorted by: Reset to default

Assuming we're talking about Python3, the Unicode string x is 6 code points long. It happens to be that each of those code points is in range 0x00 to 0xff (ASCII subset). We can get the exact byte string with the raw_unicode_escape codec, like this:

>>> x = "\xe9\x94\x99\xe8\xaf\xaf"
>>> y = x.encode('raw_unicode_escape')
>>> y
b'\xe9\x94\x99\xe8\xaf\xaf'
>>> y.decode('utf8')
'错误'

Note that this will only work if the string x contains only ASCII subrange of Unicode; otherwise you'll just get escaped Unicode code points (as the codec's name suggests):

>>> "šž".encode('raw_unicode_escape')
b'\\u0161\\u017e'

Improve this answer

edited Jul 13, 2017 at 8:19

answered Jul 13, 2017 at 8:13

randomir's user avatar

randomir

18.8k1 gold badge46 silver badges60 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

How to convert python Unicode string to bytes

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related