Expanding a randomized Two Envelope Problem

Question 1

There is a previous problem called the Envelope Paradox with a detailed explanation and solution given here. In short, the problem involves two envelopes with random (on some probability distribution $P$) amounts inside. You draw one and have to guess whether it contains more or less than the other. General intuition dictates you have at most a 50% chance of being right. However, by generating a random number on the same distribution, you may increase your chances to 66%.

My question is: What is the upper bound on the maximum accuracy when generating an infinite amount of random numbers when there are 3 envelopes?

Furthemore, a much harder one may be: what would the upper bound be for $k$ envlopes?

My investigation just for k=2 gave me the following

I've noticed that, by generating more numbers, $n,ドル you increase your chances of being right. Specifically, for $k = 2$ as $n\rightarrow\infty$ the probability you are correct approaches a maximum of 75ドル\%$. We can see this graphically: MATLAB Simulation with 100,000 Monte Carlo Trials for each $n$

Or, we may just compute it. The number of arrangements of random numbers are equal to ${n+k\choose n}$. In the case of $k=2,ドル there simply are $\frac{(n+2)(n+1)}{2}$ arrangements. Making a guess based on these arrangements are dependent on where the majority of the numbers lie; let each number be $y_i \;\forall i = 1,2,\ldots,n$. For simplicity, lets work with just odd $n$'s. It is fairly intutive that the accuracy for $n+1$ is greater than $n,ドル while computationally, it can be seen that $\left(\Pr(n+1)-\Pr(n)\right)\approx0$ for large $n$'s. So we may assume that, for a large $n$'s, analyzing just odd $n$'s is enough.

When less than half the random numbers ($floor(n/2)$) are below those in the two envelopes $(a,b),ドル there are an easily countable number of events where our prediction will have 100ドル\%$ accuracy. We may then see that there exist $$\sum_{j=0}^{floor(n/2)}{n-ceil(n/2)+1\choose n-ceil(n/2)} = \left(floor(n/2)+1)\right)(n-ceil(n/2)+1) = x$$ of these situations. As shown in the previous link, all other events will only lead you to a correct guess 50ドル\%$ of the time; you can work this out yourself just for odd $n$'s. Thus, the probability of you being correct having generated some odd $n$ random numbers is:

$$\left((100\%)x + (50\%)({n+2\choose n}-x)\right)/{n+2\choose n}$$

$$ = 1/2 + \frac{x}{(n+1)(n+2)}$$

When looking at just odd $n$'s,

$$ 1/2 + \frac{\left(\frac{n-1}{2}\right)\left(n-\frac{n+1}{2}+1\right)}{(n+1)(n+2)}$$

$$\lim_{n\to\infty} \left[ 1/2 + \frac{\left(\frac{n-1}{2}\right)\left(n-\frac{n+1}{2}+1\right)}{(n+1)(n+2)} \right] = 1/2 + 1/4 = 75\%$$

Running 1,000,000 Monte Carlo trials where $n=10000$ gave me 0ドル.7446,ドル which is pretty close. The same for $k=3$ gave me 0ドル.5143,ドル but how do we get there mathematically?

Question 2

First, we can considerably simplify your analysis of the $k=2$ case. By comparing an arbitrary number of samples from the distribution with the given value $v$, you're effectively finding the cumulative distribution function $F(v)$ at that value. The strategy is to switch if this is less than $\frac12$, and the success probability is 1ドル$ if the two values in the envelopes are on opposite sides of $F(v)=\frac12$ and $\frac12$ otherwise, for a total success probability of $\frac12\cdot1+\frac12\cdot\frac12=\frac34$. Since only $F(v)$ enters into the calculations, we may as well assume uniform distribution on $[0,1]$ and use $F(v)=v$.

Unfortunately you didn't specify how you want to generalise the problem to $k$ envelopes. There are a number of possible generalisations:

Maximise the probability of correctly guessing whether the first value is the highest
Maximise the probability of correctly guessing the rank of the first value
Maximise the probability of stopping at the highest value
Maximise the expected value

Variant 4ドル$ depends on the distribution and is thus a different kind of problem.

For variant 1ドル$, we should guess that the first value $v$ is the highest if it is above some threshold $t$. Then the probability that we guess right is

$$ \int_0^t\left(1-v^{k-1}\right)\mathrm dv+\int_t^1v^{k-1}\mathrm dv=t+\frac1k\left(1-2t^k\right)\;. $$

Setting the derivative with respect to $t$ to 0ドル$ yields 1ドル-2t^{k-1}=0$ and thus $t=2^{-1/(k-1)}$, with resulting success probability

$$ 2^{-1/(k+1)}+\frac1k\left(1-2\cdot2^{-k/(k-1)}\right)=\frac1k+2^{-1/(k-1)}\left(1-\frac1k\right)\;, $$

which reduces to $t=\frac12$ and success probability $\frac34$ for $k=2$. For $k=3$, the success probability is

$$\frac13+2^{-1/2}\cdot\frac23\approx80\%\;,$$

so this is apparently not what you had in mind.

For variant 2ドル$, we need $k-1$ thresholds $t_1,\ldots,t_{k-1}$, and each of them should be placed where the likelihood for two adjacent ranks is equal, so

$$ \binom{k-1}{i-1}t_i^{i-1}(1-t_i)^{k-i}=\binom{k-1}it_i^i(1-t_i)^{k-i-1}\;, $$

with solution $t_i=\frac ik$, so the thresholds are evenly spaced as one might expect. The success probability is

$$ \sum_{i=1}^k\int_{(i-1)/k}^{i/k}\binom{k-1}{i-1}v^{i-1}(1-v)^{k-i}\mathrm dv\;. $$

I don't know how to simplify that for general $k$. For $k=3$, the success probability is

\begin{align} &\int_0^\frac13\binom20(1-v)^2\mathrm dv+\int_\frac13^\frac23\binom21v(1-v)\mathrm dv\int_\frac23^1\binom22v^2\mathrm dv\\ ={}&\frac{19}{81}+\frac{13}{81}+\frac{19}{81}\\ ={}&\frac{51}{81}\\ \approx{}&63\%\;, \end{align}

so this is apparently also not what you had in mind.

Variant 3ドル$ is known as the full-information best-choice problem; for references see e.g. Winning rate in the full-information best-choice problem by A. Gnedin and D. Miretskiy. Here the success probability tends to approximately 58ドル\%$ for $k\to\infty$. For $k=3$, we can consider two thresholds $t_1$ and $t_2$ and optimise the resulting success probability. If we stop at the first value $v_1$ if it is above $t_1$, we will succeed on the first value with probability

$$ \int_{t_1}^1v_1^2\mathrm dv_1=\frac13\left(1-t_1^3\right)\;. $$

The second threshold $t_2$ should be lower than $t_1$ (since we have fewer chances afterwards). Then the probability to win on the second value is

\begin{align} &\int_0^{t_2}\mathrm dv_1\int_{t_2}^1v_2\mathrm dv_2+\int_{t_2}^{t_1}\mathrm dv_1\int_{v_1}^1v_2\mathrm dv_2 \\ ={}& \int_0^{t_2}\mathrm dv_1\frac12\left(1-t_2^2\right)+\int_{t_2}^{t_1}\mathrm dv_1\frac12\left(1-v_1^2\right)\\ ={}&\frac12t_1-\frac16t_1^3-\frac13t_2^3\;, \end{align}

and the probability to win on the third value is

\begin{align} &\int_0^{t_1}\mathrm dv_1\int_0^{v_1}(1-v_1)\mathrm dv_2+\int_0^{t_2}\mathrm dv_1\int_{v_1}^{t_2}(1-v_2)\mathrm dv_2 \\ ={}& \int_0^{t_1}\mathrm dv_1v_1(1-v_1)+\int_0^{t_2}\mathrm dv_1\left((t_2-v_1)-\frac12\left(t_2^2-v_1^2\right)\right) \\ ={}& \frac12t_1^2-\frac13t_1^3+t_2^2-\frac12t_2^2-\frac12t_2^3+\frac16t_2^3 \\ ={}& \frac12t_1^2-\frac13t_1^3+\frac12t_2^2-\frac13t_2^3\;. \end{align}

Thus the total success probability is

$$ \frac13-\frac13t_1^3+\frac12t_1-\frac16t_1^3-\frac13t_2^3+\frac12t_1^2-\frac13t_1^3+\frac12t_2^2-\frac13t_2^3 \\ =\frac13+\frac12t_1+\frac12t_1^2-\frac56t_1^3+\frac12t_2^2-\frac23t_2^3\;. $$

Optimising yields $t_2=\frac12$ (unsurprisingly) and $\frac12+t_1-\frac52t_1^2=0$, with positive solution $t_1=(1+\sqrt6)/5\approx0.6899$, and the optimal success probability is

$$ \frac13+\frac12\frac{1+\sqrt6}5+\frac12\left(\frac{1+\sqrt6}5\right)^2-\frac56\left(\frac{1+\sqrt6}5\right)^3+\frac12\left(\frac12\right)^2-\frac23\left(\frac12\right)^3\\=\frac{293+48\sqrt6}{600}\approx68\%\;, $$

so this is apparently also not what you had in mind.

Which raises the question: What did you have in mind?

joriki 243k15 gold badges312 silver badges550 bronze badges · Accepted Answer · 2016-06-22 21:00:37Z

First, we can considerably simplify your analysis of the $k=2$ case. By comparing an arbitrary number of samples from the distribution with the given value $v$, you're effectively finding the cumulative distribution function $F(v)$ at that value. The strategy is to switch if this is less than $\frac12$, and the success probability is 1ドル$ if the two values in the envelopes are on opposite sides of $F(v)=\frac12$ and $\frac12$ otherwise, for a total success probability of $\frac12\cdot1+\frac12\cdot\frac12=\frac34$. Since only $F(v)$ enters into the calculations, we may as well assume uniform distribution on $[0,1]$ and use $F(v)=v$.

Unfortunately you didn't specify how you want to generalise the problem to $k$ envelopes. There are a number of possible generalisations:

Maximise the probability of correctly guessing whether the first value is the highest
Maximise the probability of correctly guessing the rank of the first value
Maximise the probability of stopping at the highest value
Maximise the expected value

Variant 4ドル$ depends on the distribution and is thus a different kind of problem.

For variant 1ドル$, we should guess that the first value $v$ is the highest if it is above some threshold $t$. Then the probability that we guess right is

$$ \int_0^t\left(1-v^{k-1}\right)\mathrm dv+\int_t^1v^{k-1}\mathrm dv=t+\frac1k\left(1-2t^k\right)\;. $$

Setting the derivative with respect to $t$ to 0ドル$ yields 1ドル-2t^{k-1}=0$ and thus $t=2^{-1/(k-1)}$, with resulting success probability

$$ 2^{-1/(k+1)}+\frac1k\left(1-2\cdot2^{-k/(k-1)}\right)=\frac1k+2^{-1/(k-1)}\left(1-\frac1k\right)\;, $$

which reduces to $t=\frac12$ and success probability $\frac34$ for $k=2$. For $k=3$, the success probability is

$$\frac13+2^{-1/2}\cdot\frac23\approx80\%\;,$$

so this is apparently not what you had in mind.

For variant 2ドル$, we need $k-1$ thresholds $t_1,\ldots,t_{k-1}$, and each of them should be placed where the likelihood for two adjacent ranks is equal, so

$$ \binom{k-1}{i-1}t_i^{i-1}(1-t_i)^{k-i}=\binom{k-1}it_i^i(1-t_i)^{k-i-1}\;, $$

with solution $t_i=\frac ik$, so the thresholds are evenly spaced as one might expect. The success probability is

$$ \sum_{i=1}^k\int_{(i-1)/k}^{i/k}\binom{k-1}{i-1}v^{i-1}(1-v)^{k-i}\mathrm dv\;. $$

I don't know how to simplify that for general $k$. For $k=3$, the success probability is

\begin{align} &\int_0^\frac13\binom20(1-v)^2\mathrm dv+\int_\frac13^\frac23\binom21v(1-v)\mathrm dv\int_\frac23^1\binom22v^2\mathrm dv\\ ={}&\frac{19}{81}+\frac{13}{81}+\frac{19}{81}\\ ={}&\frac{51}{81}\\ \approx{}&63\%\;, \end{align}

so this is apparently also not what you had in mind.

Variant 3ドル$ is known as the full-information best-choice problem; for references see e.g. Winning rate in the full-information best-choice problem by A. Gnedin and D. Miretskiy. Here the success probability tends to approximately 58ドル\%$ for $k\to\infty$. For $k=3$, we can consider two thresholds $t_1$ and $t_2$ and optimise the resulting success probability. If we stop at the first value $v_1$ if it is above $t_1$, we will succeed on the first value with probability

$$ \int_{t_1}^1v_1^2\mathrm dv_1=\frac13\left(1-t_1^3\right)\;. $$

The second threshold $t_2$ should be lower than $t_1$ (since we have fewer chances afterwards). Then the probability to win on the second value is

\begin{align} &\int_0^{t_2}\mathrm dv_1\int_{t_2}^1v_2\mathrm dv_2+\int_{t_2}^{t_1}\mathrm dv_1\int_{v_1}^1v_2\mathrm dv_2 \\ ={}& \int_0^{t_2}\mathrm dv_1\frac12\left(1-t_2^2\right)+\int_{t_2}^{t_1}\mathrm dv_1\frac12\left(1-v_1^2\right)\\ ={}&\frac12t_1-\frac16t_1^3-\frac13t_2^3\;, \end{align}

and the probability to win on the third value is

\begin{align} &\int_0^{t_1}\mathrm dv_1\int_0^{v_1}(1-v_1)\mathrm dv_2+\int_0^{t_2}\mathrm dv_1\int_{v_1}^{t_2}(1-v_2)\mathrm dv_2 \\ ={}& \int_0^{t_1}\mathrm dv_1v_1(1-v_1)+\int_0^{t_2}\mathrm dv_1\left((t_2-v_1)-\frac12\left(t_2^2-v_1^2\right)\right) \\ ={}& \frac12t_1^2-\frac13t_1^3+t_2^2-\frac12t_2^2-\frac12t_2^3+\frac16t_2^3 \\ ={}& \frac12t_1^2-\frac13t_1^3+\frac12t_2^2-\frac13t_2^3\;. \end{align}

Thus the total success probability is

$$ \frac13-\frac13t_1^3+\frac12t_1-\frac16t_1^3-\frac13t_2^3+\frac12t_1^2-\frac13t_1^3+\frac12t_2^2-\frac13t_2^3 \\ =\frac13+\frac12t_1+\frac12t_1^2-\frac56t_1^3+\frac12t_2^2-\frac23t_2^3\;. $$

Optimising yields $t_2=\frac12$ (unsurprisingly) and $\frac12+t_1-\frac52t_1^2=0$, with positive solution $t_1=(1+\sqrt6)/5\approx0.6899$, and the optimal success probability is

$$ \frac13+\frac12\frac{1+\sqrt6}5+\frac12\left(\frac{1+\sqrt6}5\right)^2-\frac56\left(\frac{1+\sqrt6}5\right)^3+\frac12\left(\frac12\right)^2-\frac23\left(\frac12\right)^3\\=\frac{293+48\sqrt6}{600}\approx68\%\;, $$

so this is apparently also not what you had in mind.

Which raises the question: What did you have in mind?

Stack Exchange Network

Expanding a randomized Two Envelope Problem

My investigation just for k=2 gave me the following

1 Answer 1

You must log in to answer this question.

Linked

Hot Network Questions

Expanding a randomized Two Envelope Problem

My investigation just for k=2 gave me the following

1 Answer 1

You must log in to answer this question.

Linked

Related

Hot Network Questions