1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

Return to Answer

Post Timeline

deleted 10 characters in body

Source Link

edited Dec 9, 2013 at 10:03

svk

edited Dec 9, 2013 at 10:03

svk

5.9k
20
22

You need to know the encoding of the input string. There is no reliable universal solution.

The encoding should be available from the source of the input string. For instance, if you're taking text from a web page, the encoding should be indicated as part of the HTTP Content-EncodingType should be indicated, either as a HTTP response header from the server or as <meta> tag in the page source.

Once you know the encoding, use the decode method.

This string appears to be Shift-JIS:

>>> x = '\x83h\x83L\x83\x85\x83\x81\x83\x93\x83g\x82\xf0\x96|\x96\xf3\x82\xb5\x82\xdc\x82\xb7'
>>> print x.decode( "shift-jis" )
ドキュメントを翻訳します

You need to know the encoding of the input string. There is no reliable universal solution.

The encoding should be available from the source of the input string. For instance, if you're taking text from a web page, the Content-Encoding should be indicated either as a HTTP response header from the server or as <meta> tag in the page source.

Once you know the encoding, use the decode method.

This string appears to be Shift-JIS:

>>> x = '\x83h\x83L\x83\x85\x83\x81\x83\x93\x83g\x82\xf0\x96|\x96\xf3\x82\xb5\x82\xdc\x82\xb7'
>>> print x.decode( "shift-jis" )
ドキュメントを翻訳します

You need to know the encoding of the input string. There is no reliable universal solution.

The encoding should be available from the source of the input string. For instance, if you're taking text from a web page, the encoding should be indicated as part of the HTTP Content-Type, either as a HTTP response header from the server or as <meta> tag in the page source.

Once you know the encoding, use the decode method.

This string appears to be Shift-JIS:

>>> x = '\x83h\x83L\x83\x85\x83\x81\x83\x93\x83g\x82\xf0\x96|\x96\xf3\x82\xb5\x82\xdc\x82\xb7'
>>> print x.decode( "shift-jis" )
ドキュメントを翻訳します

Source Link

answered Dec 9, 2013 at 9:56

svk

answered Dec 9, 2013 at 9:56

svk

5.9k
20
22

You need to know the encoding of the input string. There is no reliable universal solution.

Once you know the encoding, use the decode method.

This string appears to be Shift-JIS:

>>> x = '\x83h\x83L\x83\x85\x83\x81\x83\x93\x83g\x82\xf0\x96|\x96\xf3\x82\xb5\x82\xdc\x82\xb7'
>>> print x.decode( "shift-jis" )
ドキュメントを翻訳します

lang-py

CollectivesTM on Stack Overflow