Python unicode woes

Asked 14 years, 9 months ago

Viewed 1k times

What is the correct way to convert '\xbb' into a unicode string? I have tried the following and only get UnicodeDecodeError:

unicode('\xbb', 'utf-8')
'\xbb'.decode('utf-8')

Improve this question

asked Mar 21, 2011 at 21:42

Jason Christa's user avatar

Jason Christa

12.5k15 gold badges62 silver badges86 bronze badges

It is part of a file that someone pasted from Word (so its a str). If you type print u'\xbb' you get the double arrow (>>) character.

Jason Christa
– Jason Christa

2011年03月21日 21:50:30 +00:00
Commented Mar 21, 2011 at 21:50

Add a comment |

3 Answers 3

Sorted by: Reset to default

Since it comes from Word it's probably CP1252.

>>> print '\xbb'.decode('cp1252')
»

Improve this answer

answered Mar 21, 2011 at 21:57

Ignacio Vazquez-Abrams's user avatar

Ignacio Vazquez-Abrams

804k160 gold badges1.4k silver badges1.4k bronze badges

Comments

It looks to be Latin-1 encoded. You should use:

unicode('\xbb', 'Latin-1')

Improve this answer

answered Mar 21, 2011 at 21:56

Ioan Alexandru Cucu's user avatar

Ioan Alexandru Cucu

12.4k7 gold badges41 silver badges39 bronze badges

Comments

Not sure what you are trying to do. But in Python3 all strings are unicode per default. In Python2.X you have to use u'my unicode string \xbb' (or double, tripple quoted) to get unicode strings. When you want to print unicode strings you have to encode them in character set that is supported on the output device, eg. the terminal. u'my unicode string \xbb'.endoce('iso-8859-1') for instance.

Improve this answer

edited Mar 8, 2012 at 14:15

Bill the Lizard's user avatar

Bill the Lizard

407k213 gold badges579 silver badges892 bronze badges

answered Mar 21, 2011 at 22:00

Bernhard's user avatar

Bernhard

8,8715 gold badges43 silver badges46 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Python unicode woes

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related