Resource provisioning is the process of creating and allocating cloud resources such as:
Resources are provided on demand within minutes using a cloud portal, CLI, or automation tools.
Scaling is the ability to increase or decrease resources based on workload demand.
There are two types of scaling:
Vertical Scaling (Scale Up / Down)
Increase or decrease CPU, RAM, or storage of a server.
Horizontal Scaling (Scale Out / In)
Add or remove multiple servers to handle more or less traffic.
Choose a cloud platform:
Select required resources:
Define:
Launch resources using:
Track:
Set rules like:
Configure auto scaling group or scale set.
Distribute traffic across multiple servers.
Cloud automatically adds or removes servers based on demand.
An e-commerce website experiences high traffic during sales.
Create EC2 launch template with:
Startup script:
#!/bin/bash
sudo apt update
sudo apt install apache2 -y
echo "Welcome to Auto Scaling Web App" > /var/www/html/index.html
Minimum instances: 1
Maximum instances: 5
Desired instances: 2
Create Application Load Balancer
Attach Auto Scaling Group
Scale out when CPU > 70%
Scale in when CPU < 30%
Normal traffic: 2 servers running
High traffic: auto scales to 5 servers
Low traffic: scales back to 1 server
Cloud resource provisioning and scaling allow applications to:
Run efficiently under any workload
Automatically adapt to demand
Reduce operational cost
Improve performance and reliability
It is a core feature of modern cloud computing.