Probability and Statistics #math pns


  • of The Discrete Type
    • joint pmf
    • marginal pmf
      • independent random variables
    • mathematical expectation
  • The Correlation Coefficient
    • Correlation and regression
  • Conditional Distributions
  • of The Continuous Type
  • Bivariate Normal Distribution

Discrete Bivariate Distributions

So far our random variables were only taking one input variable, but what if we want to correlate 2 input fields? Or predict a a third quantity based on multiple inputs. e.g. Based on 12th board marks and JEE Rank, can we predict CGPA at the end of first semester?

Joint Probability Mass Function:

Let and be two random variables defined on a discrete space. Let denote the corresponding two-dimensional space of and . The probability that and is given by the joint pmf where

Marginal Probability Mass Function:

The pmf of alone is called the marginal pmf of and is given by , where the summation is taken over all possible values for every

Independence: The random variables and are independent if and only if for and every : or equivalently Otherwise and are said to be dependent

Note: Whenever the support of and is not rectangular, they are dependent

Mathematical Expectation:

is the mathematical expectation of

  1. if then is called the mean of
  2. if then is the variance of

Correlation Coefficient

and are the mean and variance

We now introduce covariance and the correlation coefficient

  1. if then is the covariance
  2. if that standard deviations are positive (not imaginary) then is the correlation coefficient

Correlation and Regression:

good resource: MIT Notes

let and be our two random variables, is a measure of how correlated and are. i.e. When we form our least squares linear regression line for the graph of then how linear of a relation do we get?

The slope of the line is , the intercept

And the minimum value of the square of the error is

This gives us the property

we can say that measures the amount of linearity of a bivariate probability distribution