1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

How to convert special characters into html entities?

Asked 13 years, 10 months ago

Viewed 5k times

I want to convert, in python, special characters like "%$!&@á é ©" and not only '<&">' as all the documentation and references I've found so far shows. cgi.escape doesn't solve the problem.

For example, the string "á ê ĩ &" should be converted to "á ê &itilde; &".

Does anyboy know how to solve it? I'm using python 2.6.

Improve this question

edited Mar 8, 2012 at 14:08

joshua's user avatar

joshua

2,3692 gold badges29 silver badges59 bronze badges

asked Mar 8, 2012 at 11:27

Jayme Tosi Neto's user avatar

Jayme Tosi Neto

1,2392 gold badges22 silver badges43 bronze badges

2

Be aware of two things: (1) names entites may cause problems, you should probably use numeric entities instead. (2) Why use entities at all? In most case, a better solution is to UTF-8-encode the document so that it can contain the letters, and not use entities.

Konrad Rudolph
– Konrad Rudolph

2012年03月08日 11:30:50 +00:00
Commented Mar 8, 2012 at 11:30
1

wiki.python.org/moin/EscapingHtml

Quentin
– Quentin

2012年03月08日 11:32:05 +00:00
Commented Mar 8, 2012 at 11:32
I agree with you @KonradRudolph. I don't like using entities, but the system in which I'm working uses, so I have no choice. =/

Jayme Tosi Neto
– Jayme Tosi Neto

2012年03月08日 11:35:12 +00:00
Commented Mar 8, 2012 at 11:35
1

@Jayme No problem, sometimes you have no choice. Just wanted to make sure you were aware of this.

Konrad Rudolph
– Konrad Rudolph

2012年03月08日 11:38:06 +00:00
Commented Mar 8, 2012 at 11:38

Add a comment |

2 Answers 2

Sorted by: Reset to default

You could build your own loop using the dictionaries you can find in http://docs.python.org/library/htmllib.html#module-htmlentitydefs

The one you're looking for is htmlentitydefs.codepoint2name

Improve this answer

answered Mar 8, 2012 at 11:30

Ruben Vermeersch's user avatar

Ruben Vermeersch

1,9431 gold badge19 silver badges27 bronze badges

1 Comment

oxidworks

oxidworks Over a year ago

The link is no longer working. Use HTMLParser instead in Python 2, and the equivalent, html.parser, in Python 3.

2017年02月21日T22:39:13.773Z+00:00

I found a built in solution searching for the htmlentitydefs.codepoint2name that @Ruben Vermeersch said in his answer. The solution was found here: http://bytes.com/topic/python/answers/594350-convert-unicode-chars-html-entities

Here's the function:

def htmlescape(text):
 text = (text).decode('utf-8')
 from htmlentitydefs import codepoint2name
 d = dict((unichr(code), u'&%s;' % name) for code,name in codepoint2name.iteritems() if code!=38) # exclude "&" 
 if u"&" in text:
 text = text.replace(u"&", u"&amp;")
 for key, value in d.iteritems():
 if key in text:
 text = text.replace(key, value)
 return text

Thank you all for helping! ;)

Improve this answer

answered Mar 8, 2012 at 11:46

Jayme Tosi Neto's user avatar

Jayme Tosi Neto

1,2392 gold badges22 silver badges43 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

default

CollectivesTM on Stack Overflow

How to convert special characters into html entities?

2 Answers 2

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

2 Answers 2

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related