DSC 140A

Problem #038

Consider a data set of \(n\) points in \(\mathbb R^d\), \(\nvec{x}{1}, \ldots, \nvec{x}{n}\). Suppose the data are standardized, creating a set of new points \(\nvec{z}{1}, \ldots, \nvec{z}{n}\). That is, if the new points are stacked into an \(n \times d\) matrix, \(Z\), the mean and variance of each column of \(Z\) would be zero and one, respectively.

True or False: the covariance matrix of the standardized data must be the \(d\times d\) identity matrix; that is, the \(d \times d\) matrix with ones along the diagonal and zeros off the diagonal.

True False

Solution

False.

Problem #041

Tags: covariance

Suppose a data set consists of the following three measurements for each Saturday last year: \(X_1\): The day's high temperature \(X_2\): The number of people at Pacific Beach on that day \(X_3\): The number of people wearing coats on that day

Suppose the covariance between these features is calculated and placed into a \(3 \times 3\) sample covariance matrix, \(C\). Which of the below options most likely shows the sign of each entry of the sample covariance matrix?

Solution

The second option.

Problem #042

Tags: covariance

Suppose we have two data sets, \(\mathcal{D}_1\) and \(\mathcal{D}_2\), each containing \(n/2\) points in \(\mathbb R^d\). Let \(\nvec{\mu}{1}\) and \(C^{(1)}\) be the mean and sample covariance matrix of \(\mathcal{D}_1\), and let \(\nvec{\mu}{2}\) and \(C^{(2)}\) be the mean and sample covariance matrix of \(\mathcal{D}_2\).

Suppose the two data sets are combined into a single data set \(\mathcal D\) containing \(n\) points.

Part 1)

True or False: the mean of the combined data \(\mathcal{D}\) is equal to \(\displaystyle\frac{\nvec{\mu}{1} + \nvec{\mu}{2}}{2}\).

True False

Solution

True.

Part 2)

True or False: the sample covariance matrix of the combined data \(\mathcal{D}\) is equal to \(\displaystyle\frac{C^{(1)} + C^{(2)}}{2}\).

True False

Solution

False

Problem #043

Tags: covariance

Suppose a random vector \(\vec X = (X_1, X_2)\) has a multivariate Gaussian distribution. Suppose it is known that known that \(X_1\) and \(X_2\) are independent.

Let \(C\) be the Gaussian distribution's covariance matrix.

Part 1)

True or False: \(C\) must be diagonal.

True False

Solution

True.

Part 2)

True or False: each entry of \(C\) must the same.

True False

Solution

False.

Problem #052

Tags: covariance

Let \(\mathcal D\) be a set of data points in \(\mathbb R^d\), and let \(C\) be the sample covariance matrix of \(\mathcal D\). Suppose each point in the data set is shifted in the same direction and by the same amount. That is, suppose there is a vector \(\vec\delta\) such that if \(\nvec{x}{i}\in\mathcal D\), then \(\nvec{x}{i} + \vec\delta\) is in the new data set.

True or False: the sample covariance matrix of the new data set is equal to \(C\)(the sample covariance matrix of the original data set).

True False

Solution

True.

Problem #055

Tags: covariance

Consider the data set \(\mathcal D\) shown below.

What will be the sign of the \((1,2)\) entry of the data's sample covariance matrix?

Solution

The sign will be negative.

Problem #103

Tags: covariance, object type

Problem #106

Tags: covariance, maximum likelihood

Consider the following set of 6 data points:

\[\begin{align*} \nvec{x}{1} &= (1, 2)^T \\ \nvec{x}{2} &= (-1, 1)^T \\ \nvec{x}{3} &= (1, 3)^T \\ \nvec{x}{4} &= (1, 4)^T \\ \nvec{x}{5} &= (-1, 1)^T \\ \nvec{x}{6} &= (-1, 1)^T \\ \end{align*}\]

In the below parts, your answers should be given as numbers. You may leave your answer as an unsimplified fraction or a decimal, if you prefer.

Part 1)

What is the (1,2) entry of the sample covariance matrix?

1

Part 2)

What is the (2,2) entry of the sample covariance matrix?

8/6

Solution

Video explanation: https://youtu.be/BvFKfpGVR9k

Problem #107

Tags: covariance, Gaussians

The picture below shows the contours of a multivariate Gaussian density function:

Which one of the following could possibly be the covariance matrix of this Gaussian?

Solution

C. Video explanation: https://youtu.be/5b1nzF0yYeE

Problem #113

Tags: covariance, maximum likelihood

Consider the following set of 6 data points:

\[\begin{align*} \nvec{x}{1} &= (2, 4, 4)^T \\ \nvec{x}{2} &= (-1, 2, 1)^T \\ \nvec{x}{3} &= (3, -3, 2)^T \\ \nvec{x}{4} &= (0, -3, -3)^T \\ \end{align*}\]

In the below parts, your answers should be given as numbers. You may leave your answer as an unsimplified fraction or a decimal, if you prefer.

Part 1)

What is the (1,3) entry of the sample covariance matrix?

9/4

Part 2)

What is the (1,2) entry of the sample covariance matrix?

-3/4

Problem #114

Tags: conditional independence, Gaussians, covariance

Let \(X_1\) and \(X_2\) be two independent random variables. Suppose the distribution of \(X_1\) has the Gaussian density:

\[ p_1(x_1) = \frac{1}{\sqrt{8 \pi}}\, e^{-\frac{(x_1 - 2)^2}{8}}, \]

while the distribution of \(X_2\) has the Gaussian density:

\[ p_2(x_2) = \frac{1}{\sqrt{2\pi}}\, e^{-\frac{(x_2 + 1)^2}{2}}. \]

Which one of the following pictures shows the contours of the joint density \(p(x_1, x_2)\)(the density for the joint distribution of \(X_1\) and \(X_2\))?

Solution

Picture (d).

Problem #116

Tags: covariance, Gaussians, bayes error

Suppose that, in a binary classification setting, the true underlying class-conditional densities \(p(\vec x \given Y=0)\) and \(p(\vec x \given Y=1)\) are known to each be multivariate Gaussians with full covariance matrices. Suppose, also, that \(\pr(Y = 1) = \pr(Y = 0) = \frac{1}{2}\).

True or False: it is possible that the Bayes error in this case is exactly zero.

True False

Solution

False.

Problem #120

Tags: covariance, Gaussians

The picture below shows the contours of a multivariate Gaussian density function:

Which one of the following could possibly be the covariance matrix of this Gaussian?

Solution

C.

Problems tagged with "covariance"

Problem #038

Problem #041

Problem #042

Part 1)

Part 2)

Problem #043

Part 1)

Part 2)

Problem #052

Problem #055

Problem #103

Part 1)

Part 2)

Part 3)

Part 4)

Problem #106

Part 1)

Part 2)

Problem #107

Problem #113

Part 1)

Part 2)

Problem #114

Problem #116

Problem #120