Serverless
Managing Instances
Learn how to start, stop, and manage your serverless instances
Instance States and Controls
AtlasCloud Serverless instances can be managed to optimize costs and resource usage. You have full control to start and stop your instances as needed.
Instance States
Your serverless instance can be in one of the following states:
State | Workers Display | Status Indicator | Available Actions | Description |
---|---|---|---|---|
Running | 1/1 Running | ✅ Ready (green) | Stop button | Instance is active and accepting requests |
Stopped | 0/1 Running | Idle | Start button (black) | Instance is paused, no charges incurred |
Starting a Stopped Instance
When your serverless instance is stopped, you can quickly resume it:
When to Start
- Your instance shows
0/1 Running
status - You need to make the service available for requests
- You want to resume a previously paused service
How to Start
- Locate your stopped instance in the dashboard - it will show
0/1 Running
with a black Start button - Click the Start button (▶️)
- Confirm the action when prompted with "Are you sure to start this endpoint?"
- Wait for activation - the instance will transition to running state
What Happens
- The instance state changes from Stopped to Active
- Worker pods are created (takes 30-60 seconds)
- The status indicator changes to ✅ Ready
- Billing resumes for the running instance
- Your API endpoint becomes available for requests
Stopping a Running Instance
You can stop running instances to save costs when they're not needed:
When to Stop
- Your instance shows
1/1 Running
with ✅ Ready status - You want to pause the service to save costs
- The service is not currently needed but you plan to use it later
How to Stop
- Locate your running instance - it will show
1/1 Running
with ✅ Ready status - Click the Stop button (⏹️)
- Wait for the instance to stop - the transition happens quickly
What Happens
- The instance state changes from Active to Stopped
- Worker pods are terminated (takes 5-15 seconds)
- The display updates to show
0/1 Running
- Billing stops immediately - no charges while stopped
- Your API endpoint URL remains valid but won't respond to requests
Cost Management
Active Instance
- Billing: Charged per second based on GPU type and worker count
- Cost Formula: Duration × Unit Price × GPUs per Worker
- Auto-scaling: Can scale between Active Workers and Max Workers
Stopped Instance
- Billing: No charges - 100% cost savings
- Persistent: Your configuration and endpoint URL are preserved
- Quick Resume: Can be restarted anytime with the Start button
Cost Optimization Tips
- Stop during idle periods: Stop instances when not actively in use
- Monitor usage patterns: Identify optimal times to stop/start
- Overnight savings: Consider stopping development instances outside business hours
- Weekend optimization: Stop non-production instances during weekends
- Quick resume: Instances restart in under a minute when needed
Important Considerations
Service Availability
- Stopped instances: The URL remains valid but returns no response
- Starting latency: Expect 30-60 seconds for the instance to become fully available
- No data loss: Stopping an instance doesn't affect your configuration or mounted storage
Auto-scaling Behavior
- Stopped instances: No auto-scaling (effectively scaled to zero)
- Active instances: Full auto-scaling between Active Workers and Max Workers
- Scale-to-zero: Active instances with Active Workers set to 0 can auto-scale to zero during idle periods
Logs and Monitoring
- Metrics: Available for both stopped and active instances
- Historical logs: Preserved across start/stop cycles
- Events: Start/stop operations are logged in your instance's event history
Troubleshooting
Instance Won't Start
- Ensure you have sufficient account balance
- Check that the instance is in Stopped state
- Verify you have permission to manage the instance
- Wait a moment and refresh if the UI hasn't updated
Instance Won't Stop
- Ensure the instance is in Active/Running state
- Allow current requests to complete (graceful shutdown)
- Check that you have permission to manage the instance
Status Inconsistency
- Refresh the page to see the latest status
- Allow 30-60 seconds for state transitions to complete
- Check the instance logs for any error messages
Best Practices
- Plan your usage: Stop instances during known idle periods
- Monitor before stopping: Check if there are active requests
- Graceful shutdown: Allow ongoing requests to complete
- Development workflow: Stop development instances when not coding
- Testing optimization: Start for testing, stop immediately after
- Production planning: Use monitoring data to optimize start/stop schedules