Scalability describes a system's elasticity. It refers to a system's ability to handle or withstand an increase in workload without sacrificing system performance. It’s the ability of a system to grow and manage an increased volume of requests(traffic) from users over time.
The servers should be powerful enough to handle increased traffic loads so that there is no decline in service quality.
A poorly designed system will result in a bottleneck in the size of traffic it can handle or exponentially increasing costs with a small increase in traffic. There are two ways of scaling an application: