Poprawa SNR za pomocą technik DSP

Ponieważ wskazałeś, że spektrum mocy szumu tła jest płaskie, założę się, że jest biały . Główną wadą obecnego podejścia jest to, że odrzucasz dużą ilość mocy sygnału; nawet z efektem ograniczenia pasma z przodu pokazanego na schemacie przez odpowiedź skokowego wzrostu wykładniczego, pojedyncza próbka ADC pod koniec zaokrąglonego impulsu zapewnia migawkę wejścia odbiornika, który jest raczej zlokalizowany w czasie. Możesz skorzystać z większej mocy sygnału, próbkując z wyższą częstotliwością i stosując dopasowany filtr przy wyższej częstotliwości próbkowania.

Teoria:

Można na to spojrzeć jako stosunkowo prosty problem w teorii wykrywania . W każdym przedziale symboli odbiornik musi wybierać między dwiema hipotezami:

\begin{array}{rcl} H_{0} & : & s i g n a l i s n o t p r e s e n t \\ H_{1} & : & s i g n a l i s p r e s e n t \end{array}

$\begin{eqnarray*} H_0 &:& signal \ is \ not \ present \\ H_1 &:& signal \ is \ present \end{eqnarray*}$

Tego rodzaju problem jest często rozwiązywany za pomocą bayesowskich reguł decyzyjnych , które próbują podjąć optymalną decyzję zgodnie z określoną miarą ryzyka. Zapewnia to ramy, w których można optymalnie podejmować decyzje dotyczące wykrywania w oparciu o elastyczny zestaw kryteriów. Na przykład, jeśli istnieje duża kara do systemu za brak wykryć sygnał, jeśli jest w rzeczywistości obecnej (tj wybrać gdy jest prawdą), wówczas można zbudować że w swojej reguły decyzyjnej w razie potrzeby. $H_0$ $H_1$

W przypadku problemu z wykrywaniem, takiego jak twój, gdy próbujesz zdecydować między zerami a zerami na wyjściu odbiornika, zwykle przyjmuje się, że kara jest równa (wyprowadzenie zera, gdy jeden został przesłany, i odwrotnie, „boli równo” ). Podejście bayesowskie w tym przypadku sprowadza się do estymatora największego prawdopodobieństwa (również tutaj opisanego ): wybierasz hipotezę, która jest najbardziej prawdopodobna, biorąc pod uwagę obserwację dokonaną przez odbiornik. Oznacza to, że jeśli ilość, którą obserwuje odbiornik, wynosi , wygenerowałoby to decyzję na podstawie hipotezy, która ma największą wartość funkcji prawdopodobieństwa . W przypadku decyzji binarnej zamiast tego można zastosować współczynnik prawdopodobieństwa: $x$

Λ (x) = \frac{P (x | H_{0} i s t r u e)}{P (x | H_{1} i s t r u e)} = \frac{P (x | s i g n a l i s n o t p r e s e n t)}{P (x | s i g n a l i s p r e s e n t)}

$\Lambda(x) = \frac{P(x\ |\ H_0 \ is \ true)}{P(x\ |\ H_1 \ is \ true)} = \frac{P(x\ |\ signal \ is \ not \ present)}{P(x\ |\ signal \ is \ present)}$

Korzystając z powyższego modelu, dla każdej obserwacji kanału optymalny odbiornik zdecydowałby, że sygnał nie był obecny (a zatem wyprowadzał zero), jeśli współczynnik prawdopodobieństwa jest większy niż jeden (a zatem sygnał był najprawdopodobniej nieobecność na podstawie obserwacji) i odwrotnie. $x$ $\Lambda(x)$

Pozostał model interesującego sygnału i wszelkich innych elementów statystyki wykrywania odbiornika które mogłyby wpłynąć na jego decyzje. W przypadku takiej komunikacji cyfrowej można ją modelować w następujący sposób: $x$

\begin{array}{rcl} H_{0} & : & x = N \\ H_{1} & : & x = s + N \end{array}

$\begin{eqnarray*} H_0 &:& x = N \\ H_1 &:& x = s + N \end{eqnarray*}$

gdzie jest zmienną losową wziętą z pewnego rozkładu (często przyjmowaną za zero-średnią gaussowską), a jest deterministycznym składnikiem obserwacji wynikającym z poszukiwanego sygnału. Dlatego rozkład obserwowalnego odbiornika zmienia się w zależności od tego, czy hipoteza czy jest prawdziwa. Aby ocenić współczynnik wiarygodności, potrzebujesz modelu tego, czym są te rozkłady. W przypadku Gaussa, o którym mowa powyżej, matematyka wygląda następująco: $n$ $s$ $x$ $H_0$ $H_1$

Λ (x) = \frac{P (x | H_{0} i s t r u e)}{P (x | H_{1} i s t r u e)} = \frac{P (x | x = N)}{P (x | x = s + N)}

$\Lambda(x) = \frac{P(x\ |\ H_0 \ is \ true)}{P(x\ |\ H_1 \ is \ true)} = \frac{P(x\ |\ x = N)}{P(x\ |\ x = s + N)}$

Λ (x) = \frac{P (x | H_{0} i s t r u e)}{P (x | H_{1} i s t r u e)} = \frac{e^{\frac{- x^{2}}{2 σ^{2}}}}{e^{\frac{- (x - s)^{2}}{2 σ^{2}}}}

$\Lambda(x) = \frac{P(x\ |\ H_0 \ is \ true)}{P(x\ |\ H_1 \ is \ true)} = \frac{e^{\frac{-x^2}{2 \sigma^2}}}{e^{\frac{-(x - s)^2}{2 \sigma^2}}}$

gdzie jest wariancją terminu szumu Gaussa. Należy zauważyć, że addytywna składowa sygnału ma jedynie funkcję przesunięcia średniej wynikowego rozkładu Gaussa . Współczynnik logarytmu wiarygodności można wykorzystać do pozbycia się wykładniczych: $\sigma^2$ $x$

\ln (Λ (x)) = \ln (\frac{e^{\frac{- x^{2}}{2 σ^{2}}}}{e^{\frac{- (x - s)^{2}}{2 σ^{2}}}}) = (\frac{- x^{2}}{2 σ^{2}}) - (\frac{- (x - s)^{2}}{2 σ^{2}})

$\ln(\Lambda(x)) = \ln\left(\frac{e^{\frac{-x^2}{2 \sigma^2}}}{e^{\frac{-(x - s)^2}{2 \sigma^2}}}\right) = \left(\frac{-x^2}{2 \sigma^2}\right) - \left(\frac{-(x - s)^2}{2 \sigma^2}\right)$

$H_0$ $H_0$

\begin{array}{rcl} x < \frac{s}{2} \to c h o o s e H_{0} \\ x > \frac{s}{2} \to c h o o s e H_{1} \end{array}

$\begin{eqnarray*} x < \frac{s}{2} \rightarrow choose\ H_0 \\ x > \frac{s}{2} \rightarrow choose\ H_1 \end{eqnarray*}$

$x = \frac{s}{2}$ $s$ $T = \frac{s}{2}$ $x$ $T$

Ćwiczyć:

$s$

Jak wspomniałem wcześniej, hałas często przyjmuje się za gaussowski, ponieważ rozkład normalny jest tak łatwy do pracy: suma grupy niezależnych Gaussianów jest nadal gaussowska, a ich średnia i wariancje po prostu również dodają. Również statystyki pierwszego i drugiego rzędu rozkładu są wystarczające, aby je całkowicie scharakteryzować (biorąc pod uwagę średnią i wariancję rozkładu Gaussa, możesz napisać jego pdf ). Mamy nadzieję, że jest to przyzwoite przybliżenie przynajmniej dla twojej aplikacji.

$s$ $N$ $s$ zamiast tego . Aby zobaczyć jego znaczenie, wróćmy do teorii na chwilę. Jakie jest prawdopodobieństwo małego błędu przy naszej regule decyzyjnej?

\begin{array}{rcl} P_{e} & = & P (c h o o s e H_{0} | H_{1} t r u e) P (H_{1} t r u e) + P (c h o o s e H_{1} | H_{0} t r u e) P (H_{0} t r u e) \\ = & \frac{1}{2} P (x < \frac{s}{2} | x = s + N) + \frac{1}{2} P (x > \frac{s}{2} | x = N) \\ = & \frac{1}{2} F_{x | x = s + N} (\frac{s}{2}) + \frac{1}{2} (1 - F_{x | x = N} (\frac{s}{2})) \end{array}

$\begin{eqnarray*} P_e &=& P(choose \ H_0 \ |\ H_1\ true) P(H_1\ true) + P(choose \ H_1 \ |\ H_0\ true) P(H_0\ true) \\ &=& \frac{1}{2} P(x < \frac{s}{2} \ |\ x = s + N) + \frac{1}{2} P(x > \frac{s}{2} \ |\ x = N) \\ &=& \frac{1}{2} F_{x\ |\ x = s + N}\left(\frac{s}{2}\right) + \frac{1}{2} \left(1 - F_{x\ |\ x = N}\left(\frac{s}{2}\right)\right) \end{eqnarray*}$

where $F_{x\ |\ x = s + N}(z)$ is the cumulative distribution function of the distribution of the observation $x$ , given that $x = s + N$ (and likewise for the other function). Substituting in the cdf for the Gaussian distribution, we get:

\begin{array}{rcl} P_{e} & = & \frac{1}{2} (1 - Q (\frac{\frac{s}{2} - s}{σ})) + \frac{1}{2} Q (\frac{\frac{s}{2}}{σ}) \\ = & \frac{1}{2} + \frac{1}{2} (- Q (\frac{\frac{s}{2} - s}{σ}) + Q (\frac{\frac{s}{2}}{σ})) \\ = & \frac{1}{2} + \frac{1}{2} (- Q (\frac{- s}{2 σ}) + Q (\frac{s}{2 σ})) \\ = & \frac{1}{2} + \frac{1}{2} (- Q (\frac{- S N R}{2}) + Q (\frac{S N R}{2})) \\ = & Q (\frac{S N R}{2}) \end{array}

$\begin{eqnarray*} P_e &=& \frac{1}{2} \left(1 - Q\left(\frac{\frac{s}{2} - s}{\sigma}\right)\right) + \frac{1}{2} Q\left(\frac{\frac{s}{2}}{\sigma}\right) \\ &=& \frac{1}{2} + \frac{1}{2} \left(-Q\left(\frac{\frac{s}{2} - s}{\sigma}\right) + Q\left(\frac{\frac{s}{2}}{\sigma}\right)\right) \\ &=& \frac{1}{2} + \frac{1}{2} \left(-Q\left(\frac{-s}{2\sigma}\right) + Q\left(\frac{s}{2\sigma}\right)\right) \\ &=& \frac{1}{2} + \frac{1}{2} \left(-Q\left(\frac{-SNR}{2}\right) + Q\left(\frac{SNR}{2}\right)\right) \\ &=& Q\left(\frac{SNR}{2}\right) \end{eqnarray*}$

where $Q(x)$ is the Q function:

Q (x) = \frac{1}{\sqrt{2 π}} \int_{x}^{\infty} e^{\frac{- z^{2}}{2}} d z

$Q(x) = \frac{1}{\sqrt{2 \pi}} \int_{x}^{\infty} e^{\frac{-z^2}{2}} dz$

(i.e. the tail integral of the standard normal distribution's pdf, or $1$ minus the distribution's cdf) and $SNR$ is the signal-to-noise ratio $\frac{s}{\sigma}$ . The above function is a strictly decreasing function of $SNR$ ; as you increase the ratio of the signal amplitude $s$ to the noise standard deviation $\sigma$ , the probability of making a bit decision error decreases. So, it behooves you to do whatever you can to increase this ratio.

Remember our assumption that the noise was white and Gaussian? That can help us now. If the noise is white and Gaussian, then the noise components contained in each observation are jointly independent of one another. An important property of independent random variables is that when you sum them together, their means and variances sum. So, let's consider another simple case, where instead of taking one sample per symbol interval, you take two, then sum them together. I'll assume for simplicity that the pulse shape is rectangular (not an exponential rise), so the signal component $s$ in each observation $x_1$ and $x_2$ is the same. What is the difference in signal to noise ratio between just a single observation $x_1$ and the sum of two independent ones?

S N R_{1} = \frac{s}{σ}

$SNR_1 = \frac{s}{\sigma}$

S N R_{2} = \frac{2 s}{\sqrt{2 σ}} = \sqrt{2} S N R_{1}

$SNR_2 = \frac{2s}{\sqrt{2 \sigma}} = \sqrt{2}SNR_1$

So, the signal to noise ratio in the combined observation is larger than using only a single sample (under the assumption of equal signal component and equal-variance white Gaussian noise in both samples that we took). This is a basic observation that points out the potential benefits of taking more than one sample per symbol interval and integrating them together (which, for a rectangular pulse, is a matched filter). In general, you want to cover the entire symbol interval with samples so that your receiver "ingests" as much of the transmitted energy for each symbol, thus maximizing the SNR in the combined output. The ratio of symbol energy to the background noise variance $\frac{E_s}{N_0}$ is often used as a figure of merit when evaluating digital communications system performance.

More rigorously, it can be shown that a matched filter has an impulse response that is identical in shape (that is, "matched", with the only subtle exception being that the impulse response is reversed in time) to the pulse shape that the receiver sees (so it weights more strongly samples that have larger signal components). That shape is a function of the transmitted pulse shape as well as any effects induced by the channel or receiver front end, such as bandlimiting or multipath.

To implement this sort of arrangement in practice, you would convolve the stream of samples taken by your ADC with the time-reversed expected pulse shape. This has the effect of calculating the cross-correlation between the pulse shape and the received signal for all possible time offsets. Your implementation is aided by the precise time synchronization that you have available, so you'll know exactly which matched filter output samples correspond to correct sampling instants. The filter outputs at those times are used as the detection statistic $x$ in the theoretical model above.

I referred to threshold selection before, which can be a complicated topic, and there are many different ways that you can choose one, depending upon your system's structure. Selecting a threshold for an on-off-keyed system is complicated by the likely-unknown signal amplitude $s$ ; other signal constellations, like antipodal signaling (e.g. binary phase shift keying, or BPSK) have a more obvious threshold choice (for BPSK, the best threshold is zero for equally-likely data).

One simple implementation of a threshold selector for OOK might calculate the mean of many observations. Assuming that zeros and ones are equally likely, the expected value of the resulting random variable is half of the signal amplitude, which is the threshold that you seek. Performing this operation over a sliding window can allow you to be somewhat adaptive to varying background conditions.

Note that this is only intended to be a high-level introduction to the issues inherent in digital communications with respect to detection theory. It can be a very complicated topic, with a lot of statistics involved; I tried to make it somewhat easy to understand while keeping true to the underlying theory. For a better explanation, go get a good textbook, like Sklar's.

Jason R
źródło

thanks for the detailed answer, I learned a lot from it. I like to ask a few clarifications. I get the point of more than 1 sample at the duration. In this case how a matched filter look like? Say, I have three samples x1,x2,x3 (x3 at the tail end and x1 at the beginning). Based on what I read, I must convolve this with a same but symmetrical shape signal. Can you perhaps explain this part? [I think I know the answer but just to make sure] Second part, I know what is the dynamic range of incoming signal would be as I have taken measurements. Can I use that range for threshold setting?

Frank

A matched filter is a way of implementing a sliding cross-correlation between the signal seen by your receiver and the expected pulse shape. The diagram shown in your question illustrates the pulse seen by the ADC as an exponential rise; if that is indeed your model for what the receiver sees, then the appropriate matched filter would have the same shape, only reversed in time (the time reversal turns the convolution operation into correlation). If the receiver front end doesn't appreciably distort the pulse, you could use an "ideal" rectangular matched filter, which is simpler to implement.

Jason R

As to your second question: yes, if you know a priori the expected amplitude of the signal component, then you can use that to select a threshold. Using the statistical model for the system (based on the type of noise that is present), you can calculate the bit error rate as a function of the signal to noise ratio (which is proportional to the signal amplitude). If the thermal noise of your receiver is the dominant source, then white Gaussian noise is usually a good assumption.

Jason R

My receiver has a BPF that cuts the high frequency signals. The BPF rounds off the initial spike of the pulse and it becomes a more exponential in nature. I can disable the BPF but this will introduce HF noise currently not in the chain. It sounds like I have a tradeoff, how can I quantify which way is better. (i.e remove BPF and use matched filter for a pulse, don't remove BPF and use a matched filter for a exponential rise)

Frank

I awarded the bounty to you, thanks very much for a great answer.

Frank

Poprawa SNR za pomocą technik DSP

Odpowiedzi:

Teoria:

Ćwiczyć: