2022-03-23

Median and Mode

Introduction

Two essential measures of central tendency in statistics are the median and mode. These measures help us understand the central point of a data set, which can be crucial for making informed decisions, analyzing trends, and understanding patterns.

What is Median

The median is a measure of central tendency that represents the middle value of a dataset when it is arranged in order of magnitude. It is the value that separates the higher half of the data from the lower half. The median is particularly useful when there are extreme values or outliers in the dataset that could skew the mean.

Calculation

To calculate the median, the data must be arranged in order of magnitude from smallest to largest. If there is an odd number of values in the dataset, the median is the middle value. If there is an even number of values, the median is the average of the two middle values.

For example, if the dataset is {2, 4, 7, 9, 10}, the median would be 7. If the dataset is {2, 4, 7, 9, 10, 12}, the median would be (7+9)/2 = 8.

When to Use Median

The median is often used when the dataset contains extreme values or outliers that could skew the mean. For example, if a dataset contains salaries of employees in a company, the median salary might be a better measure of central tendency than the mean salary, as a few high salaries could skew the mean.

What is Mode

The mode is a measure of central tendency that represents the most frequently occurring value in a dataset. It is the value that occurs with the highest frequency. The mode is useful for datasets where the frequency of values is important, and it can be used for both numerical and categorical data.

Calculation

To calculate the mode, the data must be arranged in order, and the value that occurs most frequently is identified. In some cases, there may be more than one mode (bimodal or multimodal), or there may be no mode (when all values occur with equal frequency).

For example, if the dataset is {2, 4, 7, 7, 9, 10}, the mode would be 7.

When to Use Mode

The mode is useful when the frequency of values is important, such as in surveys or polls where respondents are asked to choose from a list of options. It can also be used for numerical data, such as in a dataset of test scores, where the mode represents the most common score.

Ryusei Kakujo

researchgatelinkedingithub

Focusing on data science for mobility

Bench Press 100kg!