Mixture Distributions
Sometimes, we can model a random variable as a mixture of two or more distributions in the actuarial context. For example, the distribution of the total claim amount in a portfolio of insurance policies can be modeled as a mixture of two distributions: one for the claims that are below the deductible and another for the claims that exceed the deductible.
A mixture distribution is a probability distribution that is a mixture of two or more distributions. The probability density function (pdf) of a mixture distribution is given by the weighted sum of the pdfs of the component distributions. The weights are non-negative and sum to 1.
Let and be the pdfs of two distributions, and let the random variable have a mixture distribution with the pdf , which is a mixture of and with weights and , respectively. Then, the pdf of the mixture distribution is given by:
Equivalently, the cumulative distribution function (cdf) of the mixture distribution is given by:
where and are the cdfs of the component distributions.
Example Here is an example of a mixture distribution. Suppose
two uniform random variables and are independent and identically distributed with pdf for . Let be a mixture of and with weights 0.5:0.5. Then, the E(X) is given by:
Let . Then, the E(Y) is given by:
Thus, the expected value of and are the same. However, the two random variables are not the same. is a mixture of and , while is the average of and . The range of is , while the range of is . :::
More generally, a mixture distribution can be a mixture of more than two distributions. The pdf of a mixture distribution with component distributions is given by:
where are the weights of the component distributions, and are the pdfs of the component distributions.
Suppose that for every in the set of is a distribution . If is a on corresponding .
Then we can define the mixture distribution by
In the previous section, we discussed how the Pareto distribution could be derived from the mixture of the Exponential distribution and the Gamma distribution.
::: example Here we provide another of the negative binomial distribution
Suppose the variability in the claim rate follows a distribution and let be the density function of the random variable, and the claim cases follows a Poisson distribution with parameter , the density function of random variable will be given by
By using the following R code, we can derive the negative binomial distribution
x <- c(rep(0, 10000)) # generate random samples
for (i in 1:10000) {
x[i] <- rpois(
n = 1,
lambda = rgamma(n = 1, shape = 5, rate = 25)
)
}
table(x)
print(mean(x))
print(var(x))x
0 1 2 3 5
8204 1619 150 26 1[1] 0.2002
[1] 0.2077407