1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

Replacing empty or missing values with zeros in a large array

Asked 7 years, 3 months ago

Viewed 4k times

I have an large array of more than 40000 elements

a = ['15', '12', '', 18909, ...., '8989', '', '90789', '8']

I'm looking for a simply way to replace the empty '' values to '0' so that I can manipulate the data in the array using Numpy.

I would then convert the elements in my array into integers using

a = map(int, a)

so that I could find the mean of the array in numpy

a_mean = np.mean(a)

My issue is that I cannot convert to integers in an array with missing numbers to get a mean.

Improve this question

asked Sep 15, 2018 at 14:43

user1821176's user avatar

user1821176

1,2112 gold badges20 silver badges29 bronze badges

4

Can you do: new_a = [int(v or 0) for v in a] and then use new_a?

Jon Clements
– Jon Clements

2018年09月15日 14:47:15 +00:00
Commented Sep 15, 2018 at 14:47
I believe you can use numpy.nan_to_num ?

Karn Kumar
– Karn Kumar

2018年09月15日 14:54:03 +00:00
Commented Sep 15, 2018 at 14:54

Add a comment |

4 Answers 4

Sorted by: Reset to default

You could make a small function that converts a single value exactly how you want it, e.g.:

def to_int(x):
 try:
 return int(x)
 except ValueError:
 return 0

which can be used with map:

In [22]: a = ['15', '12', '', 18909, '8989', '90789', '8']
map(to_int, a)
Out[23]: [15, 12, 0, 18909, 8989, 90789, 8]

in a list comprehension:

In [25]: np.array([to_int(x) for x in a])
Out[25]: array([ 15, 12, 0, 18909, 8989, 90789, 8])

or in a generator expression to directly create a numpy array:

In [27]: np.fromiter((to_int(x) for x in a), dtype=int)
Out[27]: array([ 15, 12, 0, 18909, 8989, 90789, 8])

Improve this answer

edited Sep 16, 2018 at 12:48

answered Sep 15, 2018 at 14:57

Bas Swinckels's user avatar

Bas Swinckels

18.5k3 gold badges48 silver badges64 bronze badges

1 Comment

user1821176

user1821176 Over a year ago

Thank you, this was the simple and clean solution to my problem.

2018年09月15日T15:06:17.747Z+00:00

If I understood you right so it should look like that:

for index in range(len(a)):
 if a[i] is '':
 a[i] = '0'

You can also use:

a = list(map(lambda x: '0' if x == '' else x, a))

Improve this answer

edited Sep 15, 2018 at 14:58

answered Sep 15, 2018 at 14:49

MercyDude's user avatar

MercyDude

91410 silver badges27 bronze badges

Comments

From the previous learning with SO, i see you can impy the below solution to convert the NaN to zeros..

from numpy import *
a = array([[0, 1, 2], [3, 4, NaN]])
where_are_NaNs = isnan(a)
a[where_are_NaNs] = 0

secondly, nan_to_num() as i said earlier in my comment.

>>> import numpy as np
>>> a = array([[0, 1, 2], [3, 4, np.NaN]])
>>> a
array([[ 0., 1., 2.],
 [ 3., 4., nan]])
>>> a = np.nan_to_num(a)
>>> a
array([[ 0., 1., 2.],
 [ 3., 4., 0.]])

Improve this answer

edited Sep 15, 2018 at 15:03

answered Sep 15, 2018 at 14:57

Karn Kumar's user avatar

Karn Kumar

8,8343 gold badges33 silver badges61 bronze badges

Comments

A more verbose answer is:

acc = 0
for v in a:
 acc+=int(v or 0)
a_mean = acc/len(a)

Improve this answer

answered Sep 15, 2018 at 14:57

bunbun's user avatar

bunbun

2,6764 gold badges36 silver badges54 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Replacing empty or missing values with zeros in a large array

4 Answers 4

1 Comment

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

4 Answers 4

1 Comment

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related