Decoding utf8 literal python

Asked 5 years, 9 months ago

Viewed 294 times

I am trying to decode strings in a list of strings, for example 'caf\\xc3\\xab' what I want if this to be 'café'.

I tried some things but ran into problems.

when i do:

for i in range(len(words):
 words[i] = words[i].decode("utf8")

I still need to convert to byte type but how do I do this,

also when I do it like this I need to remove the double backslashes for this to work

b'caf\\xc3\\xab'.decode("utf8")

Improve this question

edited Mar 26, 2020 at 16:10

asked Mar 26, 2020 at 11:56

Loïc Noest's user avatar

Loïc Noest

1251 silver badge11 bronze badges

python2's str is bytes, you can just use unicode or ues python3 (in python3 str is unicode)

panda912
– panda912

2020年03月26日 13:33:44 +00:00
Commented Mar 26, 2020 at 13:33
I use python3 but read the strings from a file in that specific format

Loïc Noest
– Loïc Noest

2020年03月26日 13:56:10 +00:00
Commented Mar 26, 2020 at 13:56
words.decode() is not an in-place operation, you need to capture the return value: word = word.decode("utf8"). (Further note: this will only change the value of the loop variable word, but not the elements in words.)

lenz
– lenz

2020年03月26日 15:32:10 +00:00
Commented Mar 26, 2020 at 15:32

Add a comment |

1 Answer 1

Sorted by: Reset to default

Suppose you have string as follow:

bef = 'caf\\xc3\\xab'

To convert to 'café' you can do the following:

aft = bef.encode().decode('unicode-escape').encode('latin1').decode('utf-8')

Then print(aft) should show 'café'

Improve this answer

edited Mar 26, 2020 at 18:10

Nikos Hidalgo's user avatar

Nikos Hidalgo

3,7669 gold badges27 silver badges41 bronze badges

answered Mar 26, 2020 at 17:04

Yosua's user avatar

Yosua

4213 silver badges7 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Decoding utf8 literal python

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related