Counting Sort

Distribute Education like Computer Geek

Sorting in linear time

In all the sorting algorithms we have read so far, the worst case is always equal to or greater than nlogn. If we have to reduce the time complexity, then for this we have to increase the space complexity. Because there is no shortage of space, we can give space.

We have three algorithms that solve the time complexity in linear time.

Counting Sort
Radix Sort
Bucket Sort

Counting Sort

Its working is simple but you will forget its working until you practice it 3 or 4 times. This sorting method does not sort by comparing elements. It sorts elements by counting those with unique key values.

The space complexity is O(n) because we use additional array in the sorting.
The time complexity is also O(n+k) in the best, average and worst case.
It uses only integer value
This algorithm can’t sort negative integers because we are counting the elements from 0 to positive integers, so it is not possible.
The sort can’t be used as a general-purpose sort because there are many restrictions or constraints in it.
The largest number in the input array will be used as key ‘k’. Because we have to create an additional array that goes from 0 to k. This sort was named counting sort because we count every key (0, k).
Example – let’s say input is 5, 1, 9, 25, 100, 3. The highest number is 100, so we have to make the additional array from 0 to 100.

Terms are denoted by

Input array is A[1…n], So Length[A] = n

Output array is B[1…n]

Additional array is C[0…k]

In this diagram, we have taken an input array A, an additional array C and an output array B.

In input array A, the largest number is 4. So, in array C, we only used integers from 0 to 4.

In diagram (b), Array C elements are initialized to 0.

Now counting sort starts.

In array A, the index 1 has a value 3, so in array C, index 3 is incremented by 1. Index 3 = 1.

In array A, the index 2 has a value 4, so in array C, index 4 is incremented by 1. Index 4 = 1.

In array A, the index 3 has a value 1, so in array C, index 1 is incremented by 1. Index 1 = 1.

In array A, the index 4 has a value 2, so in array C, index 2 is incremented by 1. Index 2 = 1.

In array A, the index 5 has a value 3, so in array C, index 3, whose value is 1, is now incremented by 1. Index 3 = 2

In array A, the index 6 has a value 4, so in array C, index 4, whose value is 1, is now incremented by 1. Index 4 = 2

This process is repeated till we cover all indexes of input array A.

Index 0 = 0

Index 1 = 3 because in input, 1 occurs 3 times.

Index 2 = 2 because in input, 2 occurs 2 times.

Index 3 = 3 because in input, 1 occurs 3 times.

Index 4 = 2 because in input, 4 occurs 2 times.

In diagram (C), we took the cumulative sum of array C.

Index 0 value is 0, so the cumulative sum is 0.

Index 1 value is 3, so the cumulative sum is “index 0 value + index 1 value” is 3.

Index 2 value is 2, so the cumulative sum is “index 1 value + index 2 value” is 5.

Index 3 value is 3, so the cumulative sum is “index 2 value + index 3 value” is 8.

Index 4 value is 2, so the cumulative sum is “index 3 value + index 4 value” is 10.

This is called the “offset” stored in array C.

Now we will fill the output sorted list through array A input list and array C offset list.

In index 1 of input list, the value is 3.

Now, we will see the value of index 3 of offset list in array C. The value is 8. As a result, set index 8 of the output sorted list to 3. Also, you have to decrement 8 to 7 in the offset list.

In index 2 of input list, the value is 4.

Now, we will see the value of index 4 of offset list in array C. The value is 10. As a result, set index 10 of the output sorted list to 4. Also, you have to decrement 10 to 9 in the offset list.

In index 3 of input list, the value is 1.

Now, we will see the value of index 1 of offset list in array C. The value is 3. As a result, set index 3 of the output sorted list to 1. Also, you have to decrement 3 to 2 in the offset list.

Same procedure goes that will fill the output sorted list.

The result is

Algorithm of Counting Sort

Enter the input array A[1…n].
Take out the maximum number in input array.
In array C, declare from 0 to maximum of A[] and initialize with 0.

Enter an input array A from 1 to n index. Take the maximum number in the array. Let’s say that index 1 is maximum. After that, you will compare all the indices with the maximum you choose, and if any index has a greater value, then point to that index.

From 0 to the maximum of array A, declare an array and name it C (Count).

for i <- 1 to n { C[A[i]]++; }

Traverse over all of the elements in Array A. There is some value in all the indexes. Take the value and move over to array C. The input array value is turned into an index in the count ‘C’ array. So, all you have to do is increase the value of that index in the count array by 1.

for j <- 1 to max { C[j] = C[j] + C[j-1]; }

Find the cumulative sum of the count array.

Index 1 -> index 0 + index 1

Index 2 -> index 1 + index 2, and so on…

for i <- 1 to n

{ B[C[A[i]]] = A[i]; }

You need to perform B[C[A[i]] = A[i] from index 1 to index n.

This B[C[A[i]] appears to be difficult, but it is not. We chose a value i say 1.

A[i] denotes the value at index 1 of the input array.

C[A[i]] denotes that the value has an index in array C that has a value.

B[C[A[i]] denotes that the value in array C has an index in sorted output array B, and the value of that index is A[i].

Program of Counting Sort in C

C++

Java

Python

C++

Java

Python

Time & Space Complexity of counting sort

In best, average & worst case, time complexity is O(n+k),

Where n is no of input and k is the maximum number in the input.

Space complexity is O(n+k), because B array contain n values and C array contain k values.

It contains a main array and an auxiliary (additional) array to perform the sort.

Advantages of counting sort

Time complexity is very less O(n + k).
Easy to code.
It is a stable sort.

Disadvantages of counting sort

This sort is not used for non-integer numbers.
If the value of k is very high, then choosing counting sort is not sensible.
Working is simple but there are several procedures which make it difficult.

Counting Sort

Sorting in linear time

Counting Sort

Program of Counting Sort in C

Time & Space Complexity of counting sort

In best, average & worst case, time complexity is O(n+k),

Space complexity is O(n+k), because B array contain n values and C array contain k values.

Advantages of counting sort

Disadvantages of counting sort

Test Yourself

Q1- Explain the working principle of counting sort.

Q2- What is the time complexity of counting sort in the best, average, and worst cases?

Q3- Explain the limitations of counting sort.

Q4- How does counting sort handle duplicate elements during the sorting process?

Q5- What is the space complexity of counting sort?

Q6- Explain why counting sort is considered a stable sorting algorithm.

Q7- Can counting sort be used to sort an array containing negative integers? Why or why not?

Q8- What is the main advantage of counting sort compared to other sorting algorithms?

Q9- Describe the steps involved in implementing counting sort.

Q10- Counting sort is based on

Comparison of elements

Counting occurrences of unique key values

Searching for the minimum and maximum elements

Rearranging elements based on their indices

Q11- What is the time complexity of counting sort in the worst case?

O(1)

O(n)

O(nlogn)

O(n + k)

Q12- What type of data can counting sort handle?

Integer values only

Non-integer values only

Custom data types

Both integer and non-integer values

Q13- Why is counting sort not suitable for sorting arrays with a large range of elements?

It has a high time complexity

It is not a stable sorting algorithm

It requires additional space for sorting

It may result in significant memory usage and inefficiency

Q14- Counting sort can be used as a general-purpose sorting algorithm for:

Integer values

Non-integer values

Small datasets

Large datasets

Q15- In counting sort, the count array stores:

Sorted elements

Occurrences of each element

Cumulative sum of elements

Indices of elements

BOOKS