1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

Indexing array inside an array

Asked 5 years, 5 months ago

Viewed 76 times

I have a 2d array that takes this kind of form:

[5643, 22, 0.67, [1.00, 0.05, -0.044....]]
[6733, 12, -0.44, [0.00, 1.00, -0.08...]]

so it has dimensions ~13k x 4 but the 4th column of every row is itself an array

what I’d like to do is subset this array such that I only keep the rows for which the yth element of the 4th column is greater than 0

my current approach has been this:

mask = [x[y] > 0 for x in array[:,3]]
new_array = array[mask]

Is there a faster way to do this?

Improve this question

edited Jul 26, 2020 at 0:23

Red's user avatar

Red

27.7k8 gold badges44 silver badges63 bronze badges

asked Jul 25, 2020 at 18:28

Harpreet Paul's user avatar

Harpreet Paul

111 bronze badge

3

You could attempt to utilize the filter method.

Arvin Kushwaha
– Arvin Kushwaha

2020年07月25日 18:34:49 +00:00
Commented Jul 25, 2020 at 18:34
What is the expected output?

Red
– Red

2020年07月26日 00:24:43 +00:00
Commented Jul 26, 2020 at 0:24

Add a comment |

3 Answers 3

Sorted by: Reset to default

Try this:

y = 1
[i for i in filter(lambda x: x[3][y] > 0, a)]

Improve this answer

edited Jul 26, 2020 at 0:30

Red's user avatar

Red

27.7k8 gold badges44 silver badges63 bronze badges

answered Jul 25, 2020 at 23:08

Akshay Sehgal's user avatar

Akshay Sehgal

19.4k3 gold badges26 silver badges57 bronze badges

Comments

Use the if clause of a list comprehension

new_array = [r for r in array if r[3][y] > 0]

Improve this answer

answered Jul 26, 2020 at 2:21

rioV8's user avatar

rioV8

29.7k4 gold badges48 silver badges68 bronze badges

Comments

The fastest way to do this is to not pack arrays in other arrays. This causes many issues, including not being able to use the shape attribute of numpy arrays effectively.

So, first split your data into two arrays, one of which has 13k rows, and 3 columns and the other one which also has 13k rows, and the columns of which depends on the dimensionality of the embedded array. Call these X and Y.

You can then do the following:

# Split the arrays
X, Y = array[:, :3], array[:, 3]
Y = np.asarray(Y)
mask = Y[:, y] > 0
X = X[mask]

Improve this answer

answered Jul 26, 2020 at 5:57

amdex's user avatar

amdex

7813 silver badges10 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Indexing array inside an array

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related