What is Mean?
Mean is known as average, which represents middle point in data. We can say that mean is most common value in the data.
Example:
Age of 3 students are, 12, 15, 18. Average age of those 3 students is 15
Calculation of Mean:
Mean = (Σxᵢ) / n
That is sum of all the data over the number of data, here sum of all ages are 12+15+18=45, now lets divide sum over number of data points ie, 45/3=15. So 15 is average
Note: Mean is sensitive to outliers (extreme data values)
What is Mode?
Mode is the most frequent value/most occured value in a dataset, used in imputation process of categorical values
Example: data is having value 1,2,2,3,3,3,4,4,4,4. The Mode is 4
Note: If 2 values repeats most, then we call it bi-modal. If 3 or more most repeated values, then there is no mode in the data
What is Median?
Median represents “middle” value in a dataset when the data is ordered from smallest to largest (ascending order). Usually used to find out skewness of data
Example: data is having values 4, 7, 2, 8, 5. The Median is 5
Calculation of Mode:
Data = 4, 7, 2, 8, 5.
Data is sorted in ascending order is 2, 4, 5, 7, 8, in which middle value is 5
Why we need Mean Median and Mode(central tendency)?
We use this when we need central tendency to summarize the data and this help users to understand how data is distributed