Newest 'python-module-unicodedata' Questions

1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

16 questions

1 vote

2 answers

116 views

Convert Full width numbers into Normal numbers in python

I have a data in an excel file(only 1 column) where there are several japanese characters followed by fullwidth numbers. I want to convert these numbers into normal numbers. いつもありがとう890ございます ...

monnomm's user avatar

monnomm

asked Apr 17, 2024 at 16:35

0 votes

1 answer

89 views

More efficient way to replace special chars with their unicode name in pandas df

I have a large pandas dataframe and would like to perform a thorough text cleaning on it. For this, I have crafted the below code that evaluates if a character is either an emoji, number, Roman number,...

lazarea's user avatar

lazarea

1,369

asked Jan 31, 2022 at 17:07

0 votes

2 answers

868 views

Capture output including control characters of subprocess

I have the following simple program to run a subprocess and tee its output to both stdout and some buffer import subprocess import sys import time import unicodedata p = subprocess.Popen( "...

Mugen's user avatar

Mugen

9,285

asked Nov 29, 2021 at 9:04

0 votes

1 answer

745 views

Convert check mark in Python

I have a dataframe which has, in a certain column, a check mark (unicode: '\u2714'). I have been trying to replace it with the following coomand: import unicodedata df['Column'].str.replace(...

bellotto's user avatar

bellotto

asked Apr 22, 2021 at 12:54

1 vote

0 answers

44 views

UnicodeEncodeError printing Hangul characters in the terminal [duplicate]

This application runs on a mac only and I'm stuck with Python 2. I have an input string '한글' which when decoded through an online unicode converter shows as \u1112\u1161\u11ab\u1100\u1173\u11af ...

Lewis's user avatar

Lewis

asked Jul 4, 2020 at 13:31

0 votes

1 answer

1k views

Understanding unistr of unicodedata.normalize()

Wikipedia basically says the following for the four values of unistr. - NFC (Normalization Form Canonical Composition) - Characters are decomposed - then recomposed by canonical equivalence. -...

user1424739's user avatar

user1424739

14.2k

asked Jan 30, 2020 at 4:21

3 votes

1 answer

429 views

Determine if a unicode character exists in a unicode subset

I'd like to find a way to determine if a Unicode character exists in a standardized subset of Unicode characters, specifically Latin basic and Latin-1. I am using Python 2 and the unicodedata module ...

rustinpeace91's user avatar

rustinpeace91

asked Aug 10, 2019 at 16:25

2 votes

1 answer

1k views

What are the differences between the modules unicode and unicodedata?

I have a large dataset with over 2 million rows of textual data. Now I want to remove the accents from the strings. In the link below, two different modules are described to remove the accents: ...

Emil's user avatar

Emil

1,752

asked May 8, 2019 at 13:50

-1 votes

1 answer

497 views

C++ implementation of python unicodedata library

New user here, please be gentle. we are looking to implement a piece of python code in c++, but it involves some intricate unicode library called unicodedata, in particular this function ...

John Jiang's user avatar

John Jiang

asked Mar 15, 2019 at 17:11

-1 votes

1 answer

74 views

how to return values from map function on dataframe

I am trying to return values from map function but instead it gives me the memory address. I tried using list, but then it gives me an error stating str object doesn't have an attribute decode. Is ...

via2's user avatar

via2

asked Oct 30, 2018 at 2:26

3 votes

1 answer

2k views

Python convert this utf8 string to latin1

I have this UTF-8 string: s = "Naděždaüäö" Which I'd like to convert to a UTF-8 string which can be encoded in "latin-1" without throwing an exception. I'd like to do so by replacing every character ...

Dominik Neise's user avatar

Dominik Neise

1,249

asked Jul 5, 2018 at 7:57

0 votes

3 answers

963 views

How to remove every possible accents from a column in python

I am new in python. I have a data frame with a column, named 'Name'. The column contains different type of accents. I am trying to remove those accents. For example, rubén => ruben, zuñiga=zuniga, ...

user3642360's user avatar

user3642360

asked Jun 12, 2018 at 17:24

1 vote

2 answers

1k views

Remove special characters from string such as smileys but keep german special charactes

I know how to remove unwanted charactes in a string, like smileys etc. However, some languages like german have special charactes, too. This is my current code: import unicodedata string = "süß 😆😋...

Kev1n91's user avatar

Kev1n91

3,723

asked Jan 15, 2018 at 20:48

2 votes

1 answer

2k views

Get a list of all Greek unicode characters

I would like to know how to obtain a list of all Greek characters (upper and lowercase letters). I know how to find specific characters (unicodedata.lookup(name)), but I want all upper and lowercase ...

Microlith57's user avatar

Microlith57

asked Dec 23, 2017 at 22:15

3 votes

1 answer

1k views

What is the difference between unicodedata.digit and unicodedata.numeric?

From unicodedata doc: unicodedata.digit(chr[, default]) Returns the digit value assigned to the character chr as integer. If no such value is defined, default is returned, or, if not given, ...

user avatar

user1785721

asked Aug 28, 2017 at 16:37

15 30 50 per page

2 Next

CollectivesTM on Stack Overflow

Convert Full width numbers into Normal numbers in python

More efficient way to replace special chars with their unicode name in pandas df

Capture output including control characters of subprocess

Convert check mark in Python

UnicodeEncodeError printing Hangul characters in the terminal [duplicate]

Understanding unistr of unicodedata.normalize()

Determine if a unicode character exists in a unicode subset

What are the differences between the modules unicode and unicodedata?

C++ implementation of python unicodedata library

how to return values from map function on dataframe

Python convert this utf8 string to latin1

How to remove every possible accents from a column in python

Remove special characters from string such as smileys but keep german special charactes

Get a list of all Greek unicode characters

What is the difference between unicodedata.digit and unicodedata.numeric?

Hot Network Questions