Edit - Stack Overflow

You are not logged in. Your edit will be placed in a queue until it is peer reviewed.

We welcome edits that make the post easier to understand and more valuable for readers. Because community members review edits, please try to make the post substantially better than how you found it, for example, by fixing grammar or adding additional resources and hyperlinks.

Required fields*

Rev

Required fields*

What's the deal with Python 3.4, Unicode, different languages and Windows? [duplicate]

Happy examples:

#!/usr/bin/env python
# -*- coding: utf-8 -*-
czech = u'Leoš Janáček'.encode("utf-8")
print(czech)
pl = u'Zdzisław Beksiński'.encode("utf-8")
print(pl)
jp = u'リング 山村 貞子'.encode("utf-8")
print(jp)
chinese = u'五行'.encode("utf-8")
print(chinese)
MIR = u'Машина для Инженерных Расчётов'.encode("utf-8")
print(MIR)
pt = u'Minha Língua Portuguesa: çáà'.encode("utf-8")
print(pt)

Unhappy output:

b'Leo\xc5\xa1 Jan\xc3\xa1\xc4\x8dek'
b'Zdzis\xc5\x82aw Beksi\xc5\x84ski'
b'\xe3\x83\xaa\xe3\x83\xb3\xe3\x82\xb0 \xe5\xb1\xb1\xe6\x9d\x91 \xe8\xb2\x9e\xe5\xad\x90'
b'\xe4\xba\x94\xe8\xa1\x8c'
b'\xd0\x9c\xd0\xb0\xd1\x88\xd0\xb8\xd0\xbd\xd0\xb0 \xd0\xb4\xd0\xbb\xd1\x8f \xd0\x98\xd0\xbd\xd0\xb6\xd0\xb5\xd0\xbd\xd0\xb5\xd1\x80\xd0\xbd\xd1\x8b\xd1\x85 \xd0\xa0\xd0\xb0\xd1\x81\xd1\x87\xd1\x91\xd1\x82\xd0\xbe\xd0\xb2'
b'Minha L\xc3\xadngua Portuguesa: \xc3\xa7\xc3\xa1\xc3\xa0'

And if I print them like this:

jp = u'リング 山村 貞子'
print(jp)

I get:

Traceback (most recent call last):
 File "x.py", line 5, in <module>
 print(jp)
 File "C:\Python34\lib\encodings\cp850.py", line 19, in encode
 return codecs.charmap_encode(input,self.errors,encoding_map)[0]
UnicodeEncodeError: 'charmap' codec can't encode characters in position
0-2: character maps to <undefined>

I've also tried the following from this question (And other alternatives that involve sys.stdout.encoding):

#!/usr/bin/env python
# -*- coding: utf-8 -*-
from __future__ import print_function
import sys
def safeprint(s):
 try:
 print(s)
 except UnicodeEncodeError:
 if sys.version_info >= (3,):
 print(s.encode('utf8').decode(sys.stdout.encoding))
 else:
 print(s.encode('utf8'))
jp = u'リング 山村 貞子'
safeprint(jp)

And things get even more cryptic:

πâ¬πâ│πé░ σ▒▒μ\æ Φ▓₧σ¡É

And the docs were not very helpful.

So, what's the deal with Python 3.4, Unicode, different languages and Windows? Almost all possible examples I could find, deal with Python 2.x.

Is there a general and cross-platform way of printing ANY Unicode character from any language in a decent and non-nasty way in Python 3.4?

EDIT:

I've tried typing at the terminal:

chcp 65001

To change the code page, as proposed here and in the comments, and it did not work (Including the attempt with sys.stdout.encoding)

Answer*

**Update:** [Since Python 3.6, the code example that prints Unicode strings directly should just work now (even without `py -mrun`)](https://stackoverflow.com/a/32176732/4279).

---
Python can print text in multiple languages in Windows console whatever `chcp` says:

 T:\> py -mpip install win-unicode-console
 T:\> py -mrun your_script.py

where `your_script.py` prints Unicode directly e.g.:

 #!/usr/bin/env python3
 print('š áč') # cz
 print('ł ń') # pl
 print('リング') # jp
 print('五行') # cn
 print('ш я жх ё') # ru
 print('í çáà') # pt

All you need is to configure the font in your Windows console that can display the desired characters.

You could also run your Python script via IDLE without installing non-stdlib modules:

 T:\> py -midlelib -r your_script.py

To write to a file/pipe, use `PYTHONIOENCODING=utf-8` as [@Mark Tolonen suggested](https://stackoverflow.com/a/30540470/4279):

 T:\> set PYTHONIOENCODING=utf-8
 T:\> py your_script.py >output-utf8.txt 

Only the last solution supports non-BMP characters such as [😒 (U+1F612 UNAMUSED FACE)](https://codepoints.net/U+1F612) -- `py -mrun` can write them but Windows console displays them as boxes even if the font supports corresponding Unicode characters (though you can copy-paste the boxes into another program, to get the characters).

Draft saved

Draft discarded

Edit Summary*

Cancel

How would you do the interactive versions? I guess Python is python -i -m run, but I cannot figure out ipython, even though it's stated on win-unicode-console's page that it's integrated.

hyperknot
– hyperknot

2015年08月07日 21:48:44 +00:00
Commented Aug 7, 2015 at 21:48
@zsero: the docs show several approaches e.g., py -i -m run c:\path\to\ipython. You could also use qtconsole interface or a web-browser-based notebook. If it doesn't work for you; ask a separate question about what do you want to do with ipython and what fails exactly.

jfs
– jfs

2015年08月07日 22:14:20 +00:00
Commented Aug 7, 2015 at 22:14
@eryksun: no. Notice that py -mrun is used.

jfs
– jfs

2015年08月24日 06:15:52 +00:00
Commented Aug 24, 2015 at 6:15
@sebastian I guess I solved my issue with your help. Your answer is bite confusing: as a python 3.6 user I did not understood if I should ignore or take into account what you write bellow it. If it is the case a kind of "for the previous version:" would make it more clear. Thanks for your patience!

JinSnow
– JinSnow

2017年01月13日 20:36:38 +00:00
Commented Jan 13, 2017 at 20:36
1

Lucida console doesn't support Chinese or Japanese either.

Mark Tolonen
– Mark Tolonen

2017年01月13日 23:38:51 +00:00
Commented Jan 13, 2017 at 23:38

| Show 3 more comments

How to Edit

Correct minor typos or mistakes
Clarify meaning without changing it
Add related resources or links
Always respect the author’s intent
Don’t use edits to reply to the author

How to Format

create code fences with backticks ` or tildes ~
```
like so
```
add language identifier to highlight code
```python
def function(foo):
print(foo)
```
put returns between paragraphs
for linebreak add 2 spaces at end
_italic_ or **bold**
indent code by 4 spaces
backtick escapes `like _so_`
quote by placing > at start of line
to make links (use https whenever possible)

<https://example.com>

[example](https://example.com)

<a href="https://example.com">example</a>

formatting help »
answering help »

How to Tag

A tag is a keyword or label that categorizes your question with other, similar questions. Choose one or more (up to 5) tags that will help answerers to find and interpret your question.

complete the sentence: my question is about...
use tags that describe things or concepts that are essential, not incidental to your question
favor using existing popular tags
read the descriptions that appear below the tag

If your question is primarily about a topic for which you can't find a tag:

combine multiple words into single-words with hyphens (e.g. python-3.x), up to a maximum of 35 characters
creating new tags is a privilege; if you can't yet create a tag you need, then post this question without it, then ask the community to create it for you

popular tags »

lang-py

CollectivesTM on Stack Overflow

What's the deal with Python 3.4, Unicode, different languages and Windows? [duplicate]

Answer*