Use Cases¶
Suitable Cases¶
- High concurrency business with short connections
 - Businesses that require elastic scaling
 - A large number of applications need to scale down to 0 to improve resource utilization
 - AI Serving services that scale based on specific metrics
 
Unsuitable Cases¶
- Long-lived connection business
 - Latency-sensitive business
 - Traffic splitting based on cookies
 - Traffic splitting based on headers