Introduction to Machine Learning(Part-4)

July 14, 2020

Training Data is the subset of population, where population can be of n variables x₁,x₂,........,x_n. Let us discuss it by example , assume we are taking to factors to differentiate individuals that is Eye color and Hair color. Eye color can be Blue,Green and Brown. Hair color can be Black, red, blond and grey. So the total combinations we will be working be 3*4 = 12

Here, we can make prediction of function by Joint Probability Distribution.We assume that eye color and hair color does not tell us much about the individuals.Let us add an attribute Milk allergy which has binary values Yes(y) or No(N), now the combinations on which we can work will be 3*4*2 = 24.

If we add an another attribute income which is continuous then the combinations to learn is 3*4*2*∞ = ∞. So , there are two ways by which we can learn this problem:

Discretize / Bucket : In this we divide the attribute income in interval like: k maps to [1,100k] ,[100k,500k,.......]
Probability Density Function : Here we apply mixture of gaussians for the P(income). This is a parametric model where the overall parameters will be 2k+k-1.