1. Home
2. Questions
3. AI Assist Labs
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Teams

Ask questions, find answers and collaborate at work with Stack Overflow for Teams.
Try Teams for free Explore Teams
Ask questions, find answers and collaborate at work with Stack Overflow for Teams. Explore Teams

Array of index values for unique elements in list

Asked 6 years, 2 months ago

Viewed 74 times

I have the following list:

x = np.array([1, 1, 2, 2, 2])

with np.unique values of [1, 2]

How do I generate the following list:

[1, 2, 1, 2, 3]

i.e. a running index from 1 for each of the unique elements in the list x.

Improve this question

edited Jul 16, 2019 at 10:47

ajrlewisajrlewis

asked Jul 16, 2019 at 10:39

ajrlewis's user avatar

ajrlewis ajrlewis

3,0584 gold badges38 silver badges77 bronze badges

Add a comment |

3 Answers 3

Sorted by: Reset to default

you can use pandas.cumcount() after grouping by the value itself, it does exactly that:

Number each item in each group from 0 to the length of that group - 1.

try this:

import numpy as np
import pandas as pd
x = np.array([1, 1, 2, 2, 2])
places = list(pd.Series(x).groupby(by=x).cumcount().values + 1)
print(places)

Output:

[1, 2, 1, 2, 3]

Improve this answer

answered Jul 16, 2019 at 10:49

Adam.Er8's user avatar

Adam.Er8 Adam.Er8

13.5k3 gold badges31 silver badges43 bronze badges

Thanks. Is there a way to do this with out pandas?

ajrlewis
– ajrlewis

2019年07月16日 10:52:16 +00:00
Commented Jul 16, 2019 at 10:52
1

@alex_lewis I tried with native numpy function but couldn't get any nice solution. the other way I think of is using list.index in a loop, which will be 100x times slower :( maybe someon else will come up with something numpy only.

Adam.Er8
– Adam.Er8

2019年07月16日 11:07:56 +00:00
Commented Jul 16, 2019 at 11:07

Add a comment |

Just use return_counts=True of np.unique with listcomp and np.hstack. It is still faster pandas solution

c = np.unique(x, return_counts=True)[1]
np.hstack([np.arange(item)+1 for item in c])
Out[869]: array([1, 2, 1, 2, 3], dtype=int64)

Improve this answer

answered Jul 16, 2019 at 11:14

Andy L.'s user avatar

Andy L. Andy L.

25.3k4 gold badges20 silver badges30 bronze badges

this will only work if the values are consecutive. for x = np.array([1, 1, 2, 2, 2, 1]) you'll get [1 2 3 1 2 3] while you should be getting [1 2 1 2 3 3]

Adam.Er8
– Adam.Er8

2019年07月16日 12:01:26 +00:00
Commented Jul 16, 2019 at 12:01

Add a comment |

I'm not sure, if this is any faster or slower solution, but if you need just a list result with no pandas, you could try this

arr = np.array([1, 1, 2, 2, 2])
from collections import Counter
ranges = [range(1,v+1) for k,v in Counter(arr).items()]
result = []
for l in ranges:
 result.extend(list(l))
print(result)

[1, 2, 1, 2, 3]

(or make your own counter with dict instead of Counter())

Improve this answer

answered Jul 16, 2019 at 11:51

Alexey Bogomolov's user avatar

Alexey Bogomolov Alexey Bogomolov

1341 silver badge8 bronze badges

Add a comment |

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Array of index values for unique elements in list

3 Answers 3

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

3 Answers 3

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related