Generate random number but exclude multiple ranges without looping

Question 1

I'm looking for a mathematical approach for generating a random number between [a, b) with holes at [c, d), [e, f), [g, h) and so on where a < b and the ranges are within the bounds.

I've found numerous examples here on how to do make the algorithm work if there is one missing range but can't seem to find a space/time efficient approach that generalizes to multiple ranges. What I mean here is that both:

a. A list of all possible ranges and choosing from that list: doesn't work well for large ranges

b. Generating a random number and checking if it is one of the ranges, otherwise trying again: unbounded terms of runtime

Some salient test cases might be:

generate_random(start=0, end=100, exclude: [(2,50),(51, 100)])
generate_random(start=0, end=1e16, exclude: [(1e6,1e7),(1e3, 1e4)])

Here are some of the examples I have found:

Question 2

So you want to pick any one of a..c-1, d..e-1, ..., x..b-1 ?

So N = (c-a) + (e-d) + ... + (b - x). Select random r in 0..N-1. If r < c, you are done. Set r = r + d, if r < e, you are done...

Question 3

If there are M ranges (where M is big) then we have a linear search (O(M)) to determine the offset for r. Special binary search by ranges would be faster (O(log M)).

Question 4

Ah this makes a bit more sense, okay, now what about if I want to expand this to two dimensions. (ie, my original goal which is to find unoccupied bounding boxes in an image of a preset width and height) The issue that will arise here is that I have more restrictions in that if I cannot make box of a specific height in a particular range that should be excluded as well.

Question 5

@Adithya: I'm guessing you mean that for each range a..c-1, d..e-1, etc. for x, you have a distinct set of ranges for y ? In the process of selecting x you need to note which range x is in -- say rx = 0 for a..c-1, and rx = 1 for d..e-1, etc. Then select a y based on the rx set of ranges.

Question 6

@ChrisHall that makes sense, is there a clean way to generalize that in a similar fashion to the calculation of N above or do I need to maintain a lookup table of some sort to keep track of what my constraints are with y with respect to x. I'll also post my own solution below of my some code in python that implements what you have above.

Question 7

@ChrisHall I realize that my prior question is likely out of scope for the above question, so I summarized my newer attempts and put together a new question: stackoverflow.com/questions/60533510/…

Question 8

Below is a Python implementation of the above algorithm from @Chris Hall's answer

def random_exclude(low: int, high: int, exclude: List[Tuple[int]]) -> int:
 N = -low+sum([(l-h) for l,h in exclude])+high
 r = np.random.randint(low, N)
 for l, h in exclude:
 if r < l:
 return r
 else:
 r+=h
 return r

Chris Hall Chris Hall 1,7921 gold badge6 silver badges15 bronze badges · Accepted Answer · 2020-03-04 13:39:35Z

1

So you want to pick any one of a..c-1, d..e-1, ..., x..b-1 ?

So N = (c-a) + (e-d) + ... + (b - x). Select random r in 0..N-1. If r < c, you are done. Set r = r + d, if r < e, you are done...

Share

Improve this answer

answered Mar 4, 2020 at 13:39

Chris Hall's user avatar

Chris Hall Chris Hall

1,7921 gold badge6 silver badges15 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Dialecticus

Dialecticus Over a year ago

If there are M ranges (where M is big) then we have a linear search (O(M)) to determine the offset for r. Special binary search by ranges would be faster (O(log M)).

2020年03月04日T13:42:21.437Z+00:00

Adithya

Adithya Over a year ago

Ah this makes a bit more sense, okay, now what about if I want to expand this to two dimensions. (ie, my original goal which is to find unoccupied bounding boxes in an image of a preset width and height) The issue that will arise here is that I have more restrictions in that if I cannot make box of a specific height in a particular range that should be excluded as well.

2020年03月04日T18:51:01.883Z+00:00

Chris Hall

Chris Hall Over a year ago

@Adithya: I'm guessing you mean that for each range a..c-1, d..e-1, etc. for x, you have a distinct set of ranges for y ? In the process of selecting x you need to note which range x is in -- say rx = 0 for a..c-1, and rx = 1 for d..e-1, etc. Then select a y based on the rx set of ranges.

2020年03月04日T19:04:07.21Z+00:00

Adithya

Adithya Over a year ago

@ChrisHall that makes sense, is there a clean way to generalize that in a similar fashion to the calculation of N above or do I need to maintain a lookup table of some sort to keep track of what my constraints are with y with respect to x. I'll also post my own solution below of my some code in python that implements what you have above.

2020年03月04日T19:13:09.673Z+00:00

Adithya

Adithya Over a year ago

@ChrisHall I realize that my prior question is likely out of scope for the above question, so I summarized my newer attempts and put together a new question: stackoverflow.com/questions/60533510/…

2020年03月04日T19:37:25.16Z+00:00

CollectivesTM on Stack Overflow

Generate random number but exclude multiple ranges without looping

2 Answers 2

5 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Hot Network Questions

CollectivesTM on Stack Overflow

2 Answers 2

5 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Related