\DeclareMathOperator{\var}{var} % variance "operator"

\chapter{Probability and inference}
\section{means and variances of conditional distributions}
The following result 
\var(u) = E(\var(u|v)) + \var(E(u|v))
can be derived by expanding the terms on the right side
\begin{equation} \label{eqn:means_variances}
E(\var(u|v)) + \var(E(u|v)) & = E(E(u^2|v)-(E(u|v))^2)+E((E(u|v))^2)-(E(E(u|v)))^2 \\                & = E(u^2)-E((E(u|v))^2)+E((E(u|v))^2)-(E(u))^2        \\                & =E(u^2)-(E(u))^2=\var(u).
The above Identity %$\ref{eqn:means_variances}$
,along with
E(u) = E(E(u|v)),
also hold if $u$ is a vector, in which case $E(u)$ is a vector and $\var(u)$
a matrix.
\subsection{Transformation of variables}
In one dimension, we commonly use the logarithm to transform the parameter space from $(0,\infty)$ to $(−\infty,\infty)$. When working with parameters defined on the open unit interval, $(0, 1)$, we often use the logistic transformation:
logit(u) = log\left(\frac{u}{1-u}\right),
whose inverse transformation is
logit^{-1}(v) = \frac{e^v}{1+e^v}.
Another common choice is the probit transformation, $\Phi^{−1}(u)$, where $\Phi$ is the standard normal cumulative distribution function, to transform from $(0, 1)$ to $(−\infty,\infty)$.


