Oczekiwana wartość mediany próbki na podstawie średniej próbki

16

Niech oznacza medianę i niech oznacza średnią losowej próbki o wielkości z rozkładu, który jest . Jak obliczyć ? $Y$ $\bar{X}$ $n=2k+1$ $N(\mu,\sigma^2)$ $E(Y|\bar{X}=\bar{x})$

Intuicyjnie, ze względu na założenie normalności, sensowne jest twierdzenie, że $E(Y|\bar{X}=\bar{x})=\bar{x}$ i rzeczywiście jest to poprawna odpowiedź. Czy można to jednak rygorystycznie wykazać?

Początkowo myślałem o podejściu do tego problemu przy użyciu warunkowego rozkładu normalnego, który jest ogólnie znanym rezultatem. Problem polega na tym, że ponieważ nie znam wartości oczekiwanej, a co za tym idzie wariancji mediany, musiałbym obliczyć te przy użyciu statystyki $k+1$ rzędu. Ale to jest bardzo skomplikowane i wolałbym tam nie iść, chyba że absolutnie muszę.

self-study normal-distribution mathematical-statistics expected-value conditional-expectation JohnK
źródło

2

Uważam, że jest to bezpośrednia konsekwencja uogólnienia, które właśnie opublikowałem na stronie stats.stackexchange.com/a/83887 . Rozkład reszt jest wyraźnie symetryczny około , skąd ich mediana ma rozkład symetryczny, a zatem jego średnia wynosi zero. Zatem oczekiwanie samej mediany (nie tylko reszt) wynosi , QED.

xi−x¯ $x_i-\bar{x}$

0 $0$

0+E(X¯ | X¯=x¯)=x¯ $0 + E(\bar{X}\ |\ \bar{X}=\bar{x}) = \bar{x}$

whuber

@whuber Przepraszamy, resztki?

JohnK

Zdefiniowałem je w moim komentarzu: są to różnice między każdym a ich średnią.

xi $x_i$

whuber

@ whuber Nie, rozumiem, ale wciąż pracuję nad zrozumieniem, w jaki sposób twoja druga odpowiedź odnosi się do mojego pytania i jak dokładnie działa oczekiwane przez ciebie oczekiwanie.

JohnK

2

@ whuber Okay, więc proszę mnie poprawić Jeśli się mylę,

E(Y|X¯)=E(X¯|X¯)+E(Y−X¯|X¯) $E(Y|\bar{X})=E(\bar{X}|\bar{X})+E(Y-\bar{X}|\bar{X})$ A teraz drugi termin to zero, ponieważ mediana jest symetryczna wokół

x¯ $\bar{x}$ . Dlatego oczekiwanie zmniejsza się do

x¯ $\bar{x}$

JohnK

7

Niech $X$ oznaczają oryginalnej próbki i $Z$ losowy wektor z wpisami $Z_k=X_k-\bar X$ . Wówczas $Z$ jest normalnie wyśrodkowany (ale jego wpisy nie są niezależne, jak widać z faktu, że ich suma wynosi zero z pełnym prawdopodobieństwem). Jako liniowe funkcyjną $X$ , z wektorem $(Z,\bar X)$ jest normalne stąd wystarczy obliczenie jego macierzy kowariancji pokazać, że $Z$ jest niezależnie od $\bar X$ .

Przechodząc do $Y$ , widać, że $Y=\bar X+T$ , gdzie $T$ jest mediana $Z$ . W szczególności, $T$ zależy tylko od $Z$ stąd $T$ jest niezależny od $\bar X$ , a rozkład $Z$ jest symetryczny, a zatem $T$ jest wyśrodkowany.

Wreszcie,

E (Y ∣ X ¯) = X ¯ + E (T ∣ X ¯) = X ¯ + E (T) = X ¯ .

$E(Y\mid\bar X)=\bar X+E(T\mid\bar X)=\bar X+E(T)=\bar X.$

Zrobił
źródło

Dziękuję, pytano o to prawie rok temu i bardzo się cieszę, że ktoś w końcu to wyjaśnił.

JohnK

7

Mediana próbki jest statystyką rzędu i ma rozkład nienormalny, więc łączny rozkład próby skończonej próbki mediany próbki i średniej próbki (która ma rozkład normalny) nie byłby dwuwymiarowy normalny. Odwołując się do przybliżeń, asymptotycznie następujące twierdzenia (patrz moja odpowiedź tutaj ):

n - - \sqrt [(X ¯ n Y n) - (μ v)] \to L N [(00), Σ]

$\sqrt n\Big [\left (\begin{matrix} \bar X_n \\ Y_n \end{matrix}\right) - \left (\begin{matrix} \mu \\ \mathbb v \end{matrix}\right)\Big ] \rightarrow_{\mathbf L}\; N\Big [\left (\begin{matrix} 0 \\ 0 \end{matrix}\right) , \Sigma \Big]$

z

Σ = (σ 2 E (| X - v |) [2 f (v)] - 1 E (| X - v |) [2 f (v)] - 1 [2 f (v)] - 2)

$\Sigma = \left (\begin{matrix} \sigma^2 & E\left( |X-\mathbb v|\right)\left[2f(\mathbb v)\right]^{-1} \\ E\left(|X-\mathbb v|\right)\left[2f(\mathbb v)\right]^{-1} & \left[2f(\mathbb v)\right]^{-2} \end{matrix}\right)$

gdzie jest średnią próbki, a średnią populacji, jest medianą próby mediana populacji, jest gęstością prawdopodobieństwa zaangażowanych zmiennych losowych, a jest wariancją. $\bar X_n$ $\mu$ $Y_n$ $\mathbb v$ $f()$ $\sigma^2$

Tak więc w przybliżeniu dla dużych próbek ich wspólny rozkład jest dwuwymiarowy normalny, więc mamy to

E (Y n ∣ X ¯ n = x ¯) = v + ρ σ v σ X ¯ (x ¯ - μ)

$E(Y_n \mid \bar X_n=\bar x) = \mathbb v + \rho\frac {\sigma_{\mathbb v}}{\sigma_{\bar X}}(\bar x -\mu)$

gdzie jest współczynnikiem korelacji. $\rho$

Manipulując rozkładem asymptotycznym, aby stać się przybliżonym rozkładem łącznej dużej próbki dla średniej próbki i mediany próbki (a nie standardowych wielkości), mamy

ρ = 1 n E ( | X - v | ) [ 2 f ( v ) ] - 1 1 n σ [ 2 f ( v ) ] - 1 = E ( | X - v | ) σ

$\rho = \frac {\frac 1nE\left(|X-\mathbb v|\right)\left[2f(\mathbb v)\right]^{-1}}{\frac 1n \sigma \left[2f(\mathbb v)\right]^{-1}} = \frac {E\left(|X-\mathbb v|\right)}{\sigma }$

So

E (Y n ∣ X ¯ n = x ¯) = v + E ( | X - v | ) σ [ 2 f ( v ) ] - 1 σ (x ¯ - μ)

$E(Y_n \mid \bar X_n=\bar x) = \mathbb v + \frac {E\left(|X-\mathbb v|\right)}{\sigma }\frac {\left[2f(\mathbb v)\right]^{-1}}{\sigma}(\bar x -\mu)$

We have that $2f(\mathbb v) = 2/\sigma\sqrt{2\pi}$ due to the symmetry of the normal density so we arrive at

E (Y n ∣ X ¯ n = x ¯) = v + π 2 - - \sqrt E (∣ ∣ ∣ X - μ σ ∣ ∣ ∣) (x ¯ - μ)

$E(Y_n \mid \bar X_n=\bar x) = \mathbb v + \sqrt{\frac {\pi}{2}}E\left(\left|\frac {X-\mu}{\sigma}\right|\right)(\bar x -\mu)$

where we have used $\mathbb v = \mu$ . Now the standardized variable is a standard normal, so its absolute value is a half-normal distribution with expected value equal to $\sqrt{2/\pi}$ (since the underlying variance is unity). So

E (Y n ∣ X ¯ n = x ¯) = v + π 2 - - \sqrt 2 π - - \sqrt (x ¯ - μ) = v + x ¯ - μ = x ¯

$E(Y_n \mid \bar X_n=\bar x) = \mathbb v + \sqrt{\frac {\pi}{2}}\sqrt{\frac {2}{\pi}}(\bar x -\mu) = \mathbb v + \bar x -\mu = \bar x$

Alecos Papadopoulos
źródło

2

As always, nice answer +1. However, since we have no information about the sample size, the asymptotic distribution might not hold. If there is no way to obtain the exact distribution though, I suppose I'll have to make do. Thank you very much.

JohnK

6

The answer is $\bar{x}$ .

Let $x = (x_1, x_2, \ldots, x_n)$ have a multivariate distribution $F$ for which all the marginals are symmetric about a common value $\mu$ . (It does not matter whether they are independent or even are identically distributed.) Define $\bar{x}$ to be the arithmetic mean of the $x_i,$ $\bar{x} = (x_1+x_2+\cdots+x_n)/n$ and write $x-\bar{x} = (x_1-\bar{x}, x_2-\bar{x}, \ldots, x_n-\bar{x})$ for the vector of residuals. The symmetry assumption on $F$ implies the distribution of $x - \bar{x}$ is symmetric about $0$ ; that is, when $E\subset\mathbb{R}^n$ is any event,

Pr F (x - x ¯ \in E) = Pr F (x - x ¯ \in - E) .

${\Pr}_F(x - \bar{x}\in E) = {\Pr}_F(x - \bar{x}\in -E).$

Applying the generalized result at /stats//a/83887 shows that the median of $x-\bar{x}$ has a symmetric distribution about $0$ . Assuming its expectation exists (which is certainly the case when the marginal distributions of the $x_i$ are Normal), that expectation has to be $0$ (because the symmetry implies it equals its own negative).

Now since subtracting the same value $\bar{x}$ from each of a set of values does not change their order, $Y$ (the median of the $x_i$ ) equals $\bar{x}$ plus the median of $x-\bar{x}$ . Consequently its expectation conditional on $\bar{x}$ equals the expectation of $x-\bar{x}$ conditional on $\bar{x}$ , plus $E(\bar{x}\ |\ \bar{x})$ . The latter obviously is $\bar{x}$ whereas the former is $0$ because the unconditional expectation is $0$ . Their sum is $\bar{x},$ QED.

whuber
źródło

Thank you for posting it as a full answer. I now understand the essence of your argument but I might ping you if something is still unclear.

JohnK

5

JohnK, I need to alert you to be cautious. A counterexample to this argument has been brought to my attention. I have encouraged its originator to post it here for further discussion, but briefly it concerns a discrete bivariate distribution with symmetric marginals but asymmetric conditional marginals. Its existence points to a flawed deduction early in my argument. I currently hope that the argument might be rescued by imposing stronger conditions on the

xi $x_i$ , but my attention is presently focused elsewhere and I might not get to think about this for awhile.

whuber

4

In the meantime I would encourage you to unaccept this answer. I would ordinarily delete any answer of mine known to be incorrect, but (as you might be able to tell) I like solutions based on first principles rather than detailed calculations, so I hope this argument can be rescued. I therefore intend to leave it open for criticism and improvement (and therefore made it CW); let the votes fall as they may.

whuber

Of course, thanks for letting me know. We will discuss it further when you have time. In the meantime I will settle for the asymptotic argument proposed by @Alecos Papadopoulos.

JohnK

6

This is simpler than the above answers make it. The sample mean is a complete and sufficient statistic (when the variance is known, but our results do not depend on the variance, hence will be valid also in the situation when the variance is unknown). Then the Rao-Blackwell together with the Lehmann-Scheffe theorems (see wikipedia ...) will imply that the conditional expectation of the median, given the arithmetic mean, is the unique minimum variance unbiased estimator of the expectation $\mu$ . But we know that is the arithmetic mean, hence the result follows.

We did also use that the median is an unbiased estimator, which follows from symmetry.

kjetil b halvorsen
źródło

1

By symmetry

$E[Y]=\mu$ , indeed. Then from these two theorems we know that

$E[Y|\bar{X}]$ is the Unique Minimum Variance Unbiased Estimator for

$\mu$ which we already know to be equal to

$\bar{X}$ . This is a brilliant answer, thank you very much. I would have marked it as the correct one, had I not done that already for another answer.

JohnK

Oczekiwana wartość mediany próbki na podstawie średniej próbki

Odpowiedzi: