The sum of squares formula is used to calculate the sum of two or more squares in a given expression. To describe how well a model represents the data being modeled, this formula is used. Also, the sum of squares will give the measure of deviation from the mean value of the data. Therefore, it is calculated as the subtraction of the total summation of the squares and the mean.
                                                                      Sources: pixabay.com
Sum of Squares Formula
Concept of the sum of squares
The sum of squares is a very useful tool used by statisticians and scientists. It is used to evaluate the overall variance of a data set from its mean value. A large sum of squares denotes the large value of variance. It means that individual readings fluctuate widely around its mean value.
This information is very useful in many situations. For example, computing the variance in blood pressure readings over some period of time. This is needed for instability in the cardiovascular system requiring medical attention.
For financial advisors, a large variance in daily stock values indicates the market instability and higher risks for investors. When we are taking the square root of the sum of squares, we get the standard deviation i.e. an even more useful number.
The formula of Sum of Squares:
The calculation of sample variance is typically stated as a fraction. The numerator of this fraction involves a sum of squared deviations from its mean value. In the statistics domain, the formula for this total sum of squares is:
\(\Sigma ( x_i – \bar x ) ^2 \)
Where,
\(\bar x \) | the sample mean |
x_i | ith data from the sample. |
\(\Sigma\) | Sum |
\(( x_i – \bar x )\) | The difference of data with the mean value. |
While this formula works for calculations, still there is an equivalent and shortcut formula. It does not require the calculation of the sample mean. This shortcut formula for the sum of squares is given below:
\(\Sigma ( x_i^2 ) – \frac {( \Sigma x_i ) ^2 } { n } \)
Where,
n | Sample data size |
\(x_i\) | ith data from the sample. |
\(\Sigma\) | Sum |
Solved Examples
Q.1: Compute the sum of squares of the following data using the mean calculation.
X : 74.01 , 74.77 , 73.94 , 73.61, 73.40.
Solution:
Given sample data is an ungrouped type of data. First, we have to calculate the mean of the given data.
Formula for mean value is:
\(\bar X = \frac { Sum of all data values} { Sample data size } \)
Where \(\sigma x is sum of the data and n is the number of sample data.\)
\(\bar X = \frac {74.01 + 74.77 + 73.94 + 73.61 + 73.40 } {5}\)
= \(\frac { 369.73 } { 5 } \)
= 73.95
Now, we will apply the formula for computing sum of squares using mean value, as below:
\(\Sigma ( x_i – \bar x ) ^2 \)
= (74.01 – 73.95)² + (74.77 – 73.95)² + (73.94 – 73.95)² + (73.61 – 73.95)² + (73.40 – 73.95)²
= (0.06)² + (0.82)² + (-0.01)² + (-0.34)² + (-0.55)²
= 1.0942
Therefore the sum of squares is 1.0942.
I get a different answer for first example.
I got Q1 as 20.5
median 23 and
Q3 26