str to bytes in Python3.3

Asked 11 years, 1 month ago

Viewed 1k times

How can I get b'\xe3\x81\x82' from '\xe3\x81\x82'?

Finally, I want u'\u3042', which means Japanese letter 'あ',

b'\xe3\x81\x82'.decode('utf-8') makes u'\u3042' but

'\xe3\x81\x82'.decode('utf-8') causes the following error

AttributeError: 'str' object has no attribute 'decode'

because b'\xe3\x81\x82' is bytes and '\xe3\x81\x82' is str.

I have DB with data like '\xe3\x81\x82'.

Improve this question

asked Dec 1, 2014 at 12:32

papico's user avatar

papico

393 bronze badges

Add a comment |

1 Answer 1

Sorted by: Reset to default

If you have bytes disguising as Unicode code points, encode to Latin-1:

'\xe3\x81\x82'.encode('latin1').decode('utf-8')

Latin-1 (ISO-8859-1) maps Unicode codepoints one-on-one to bytes:

>>> '\xe3\x81\x82'.encode('latin1').decode('utf-8')
'あ'

Improve this answer

answered Dec 1, 2014 at 12:36

Martijn Pieters's user avatar

Martijn Pieters

1.1m326 gold badges4.2k silver badges3.5k bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

str to bytes in Python3.3

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related