blob: efeb1db3669dba3499884ce741bb18ec25d22d9c [file] [log] [blame]
.. Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information#
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.
Configuring AutoScale without using NetScaler
=============================================
.. note::
AutoScaling is currently not supported in the new UI.
To manage AutoScaling, please directly invoke the
respective APIs or use `cloudmonkey <https://github.com/apache/cloudstack-cloudmonkey>`_,
the CLI tool for cloudstack
What is AutoScaling?
--------------------
AutoScaling allows you to scale your back-end services or application Instances up
or down seamlessly and automatically according to the conditions you define.
With AutoScaling enabled, you can ensure that the number of Instances you are using
seamlessly scale up when demand increases, and automatically decreases when
demand subsides. Thus it helps you save compute costs by terminating underused
Instances automatically and launching new Instances when you need them, without the need
for manual intervention.
Hypervisor support
------------------
At that time, AutoScaling without NetScaler only supports for Xenserver. We
are working to support KVM also.
Prerequisites
-------------
Before you configure an AutoScale rule, consider the following:
- Ensure that the necessary Template is prepared before configuring AutoScale.
Firstly you must install the PV-driver, which helps Xenserver collect
performance parameters (CPU and memory) into Instances. Besides, When an Instance is
deployed by using a Template and when it comes up, the application should be
up and running.
Configuration
-------------
Specify the following:
.. image:: /_static/images/autoscale-config.png
- Template: A Template consists of a base OS image and application. A
Template is used to provision the new Instance of an application on a
scaleup action. When an Instance is deployed from a Template, it can start
taking the traffic from the load balancer without any admin intervention.
For example, if the Instance is deployed for a Web service, it should have the
Web server running, the database connected, and so on.
- Compute offering: A predefined set of virtual hardware attributes,
including CPU speed, number of CPUs, and RAM size, that the user can select
when creating a new Instance. Choose one of the compute offerings to be used
while provisioning an Instance as part of scaleup action.
- Min Instance: The minimum number of active Instances that is assigned to
a load balancing rule. The active Instances are the application
Instances that are up and serving the traffic, and are being load balanced.
This parameter ensures that a load balancing rule has at least the
configured number of active Instances are available to serve the traffic.
- Max Instance: Maximum number of active Instances that should be assigned
to a load balancing rule. This parameter defines the upper limit of active
Instances that can be assigned to a load balancing rule.
Specifying a large value for the maximum Instance parameter might result in
provisioning large number of Instances, which in turn leads to a single
load balancing rule exhausting the Instances limit specified at the
Account or domain level.
Specify the following scale-up and scale-down policies:
- Duration: The duration, in seconds, for which the conditions you specify
must be true to trigger a scaleup action. The conditions defined should
hold true for the entire duration you specify for an AutoScale action to be
invoked.
- Counter: The performance counters expose the state of the monitored
Instances. We added two new counter to work with that feature:
- Linux User CPU [native] - percentage
- Linux User RAM [native] - percentage
Remember to choose one of them. If you choose anything else, the
autoscaling will not work.
- Operator: The following five relational operators are supported in
AutoScale feature: Greater than, Less than, Less than or equal to, Greater
than or equal to, and Equal to.
- Threshold: Threshold value to be used for the counter. Once the counter
defined above breaches the threshold value, the AutoScale feature initiates
a scaleup or scaledown action.
- Add: Click Add to add the condition.
Additionally, if you want to configure the advanced settings, click Show
advanced settings, and specify the following:
- Polling interval: Frequency in which the conditions, combination of counter,
operator and threshold, are to be evaluated before taking a scale up or
down action. The default polling interval is 30 seconds.
- Quiet Time: This is the cool down period after an AutoScale action is
initiated. The time includes the time taken to complete provisioning an
Instance from its Template and the time taken by an application to be ready
to serve traffic. This quiet time allows the fleet to come up to a stable
state before any action can take place. The default is 300 seconds.
- Destroy Instance Grace Period: The duration in seconds, after a scaledown action
is initiated, to wait before the Instance is destroyed as part of scaledown
action. This is to ensure graceful close of any pending sessions or
transactions being served by the Instance marked for destroy. The default is 120
seconds.
- Apply: Click Apply to create the AutoScale configuration.
Disabling and Enabling an AutoScale Configuration
-------------------------------------------------
If you want to perform any maintenance operation on the AutoScale Instances,
disable the AutoScale configuration. When the AutoScale configuration is
disabled, no scaleup or scaledown action is performed. You can use this
downtime for the maintenance activities. To disable the AutoScale
configuration, click the Disable AutoScale button.
The button toggles between enable and disable, depending on whether AutoScale
is currently enabled or not. After the maintenance operations are done, you
can enable the AutoScale configuration back. To enable, open the AutoScale
configuration page again, then click the Enable AutoScale button.
Updating an AutoScale Configuration
-----------------------------------
You can update the various parameters and add or delete the conditions in a
scaleup or scaledown rule. Before you update an AutoScale configuration,
ensure that you disable the AutoScale load balancer rule by clicking the
Disable AutoScale button.
After you modify the required AutoScale parameters, click Apply. To apply the
new AutoScale policies, open the AutoScale configuration page again, then
click the Enable AutoScale button.
Runtime Considerations
----------------------
An administrator should not assign an Instance to a load balancing rule which is
configured for AutoScale.
Making API calls outside the context of AutoScale, such as destroyVM, on an
autoscaled Instance leaves the load balancing configuration in an inconsistent state.
Even though the Instance is destroyed from the load balancer rule, it continues to be shown
as a service assigned to a rule inside the context of AutoScale.