source/adminguide/autoscale_without_netscaler.rst - cloudstack-documentation - Git at Google

 .. Licensed to the Apache Software Foundation (ASF) under one
    or more contributor license agreements.  See the NOTICE file
    distributed with this work for additional information#
    regarding copyright ownership.  The ASF licenses this file
    to you under the Apache License, Version 2.0 (the
    "License"); you may not use this file except in compliance
    with the License.  You may obtain a copy of the License at
    http://www.apache.org/licenses/LICENSE-2.0
    Unless required by applicable law or agreed to in writing,
    software distributed under the License is distributed on an
    "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
    KIND, either express or implied.  See the License for the
    specific language governing permissions and limitations
    under the License.


 Configuring AutoScale without using NetScaler
 =============================================

 .. note::
    AutoScaling is currently not supported in the new UI.
    To manage AutoScaling, please directly invoke the
    respective APIs or use `cloudmonkey <https://github.com/apache/cloudstack-cloudmonkey>`_,
    the CLI tool for cloudstack

 What is AutoScaling?
 --------------------

 AutoScaling allows you to scale your back-end services or application Instances up
 or down seamlessly and automatically according to the conditions you define.
 With AutoScaling enabled, you can ensure that the number of Instances you are using
 seamlessly scale up when demand increases, and automatically decreases when
 demand subsides. Thus it helps you save compute costs by terminating underused
 Instances automatically and launching new Instances when you need them, without the need
 for manual intervention.


 Hypervisor support
 ------------------

 At that time, AutoScaling without NetScaler only supports for Xenserver. We
 are working to support KVM also.


 Prerequisites
 -------------

 Before you configure an AutoScale rule, consider the following:

 -  Ensure that the necessary Template is prepared before configuring AutoScale.
    Firstly you must install the PV-driver, which helps Xenserver collect
    performance parameters (CPU and memory) into Instances. Besides, When an Instance is
    deployed by using a Template and when it comes up, the application should be
    up and running.


 Configuration
 -------------

 Specify the following:

 .. image:: /_static/images/autoscale-config.png

 -  Template: A Template consists of a base OS image and application. A
    Template is used to provision the new Instance of an application on a
    scaleup action. When an Instance is deployed from a Template, it can start
    taking the traffic from the load balancer without any admin intervention.
    For example, if the Instance is deployed for a Web service, it should have the
    Web server running, the database connected, and so on.

 -  Compute offering: A predefined set of virtual hardware attributes,
    including CPU speed, number of CPUs, and RAM size, that the user can select
    when creating a new Instance. Choose one of the compute offerings to be used
    while provisioning an Instance as part of scaleup action.

 -  Min Instance: The minimum number of active Instances that is assigned to
    a load balancing rule. The active Instances are the application
    Instances that are up and serving the traffic, and are being load balanced.
    This parameter ensures that a load balancing rule has at least the
    configured number of active Instances are available to serve the traffic.

 -  Max Instance: Maximum number of active Instances that should be assigned
    to a load balancing rule. This parameter defines the upper limit of active
    Instances that can be assigned to a load balancing rule.

    Specifying a large value for the maximum Instance parameter might result in
    provisioning large number of Instances, which in turn leads to a single
    load balancing rule exhausting the Instances limit specified at the
    Account or domain level.

 Specify the following scale-up and scale-down policies:

 -  Duration: The duration, in seconds, for which the conditions you specify
    must be true to trigger a scaleup action. The conditions defined should
    hold true for the entire duration you specify for an AutoScale action to be
    invoked.

 -  Counter: The performance counters expose the state of the monitored
    Instances. We added two new counter to work with that feature:

    -  Linux User CPU [native] - percentage
    -  Linux User RAM [native] - percentage

    Remember to choose one of them. If you choose anything else, the
    autoscaling will not work.

 -  Operator: The following five relational operators are supported in
    AutoScale feature: Greater than, Less than, Less than or equal to, Greater
    than or equal to, and Equal to.

 -  Threshold: Threshold value to be used for the counter. Once the counter
    defined above breaches the threshold value, the AutoScale feature initiates
    a scaleup or scaledown action.

 -  Add: Click Add to add the condition.

    Additionally, if you want to configure the advanced settings, click Show
    advanced settings, and specify the following:

 -  Polling interval: Frequency in which the conditions, combination of counter,
    operator and threshold, are to be evaluated before taking a scale up or
    down action. The default polling interval is 30 seconds.

 -  Quiet Time: This is the cool down period after an AutoScale action is
    initiated. The time includes the time taken to complete provisioning an
    Instance from its Template and the time taken by an application to be ready
    to serve traffic. This quiet time allows the fleet to come up to a stable
    state before any action can take place. The default is 300 seconds.

 -  Destroy Instance Grace Period: The duration in seconds, after a scaledown action
    is initiated, to wait before the Instance is destroyed as part of scaledown
    action. This is to ensure graceful close of any pending sessions or
    transactions being served by the Instance marked for destroy. The default is 120
    seconds.

 -  Apply: Click Apply to create the AutoScale configuration.


 Disabling and Enabling an AutoScale Configuration
 -------------------------------------------------

 If you want to perform any maintenance operation on the AutoScale Instances,
 disable the AutoScale configuration. When the AutoScale configuration is
 disabled, no scaleup or scaledown action is performed. You can use this
 downtime for the maintenance activities. To disable the AutoScale
 configuration, click the Disable AutoScale button.

 The button toggles between enable and disable, depending on whether AutoScale
 is currently enabled or not. After the maintenance operations are done, you
 can enable the AutoScale configuration back. To enable, open the AutoScale
 configuration page again, then click the Enable AutoScale button.


 Updating an AutoScale Configuration
 -----------------------------------

 You can update the various parameters and add or delete the conditions in a
 scaleup or scaledown rule. Before you update an AutoScale configuration,
 ensure that you disable the AutoScale load balancer rule by clicking the
 Disable AutoScale button.

 After you modify the required AutoScale parameters, click Apply. To apply the
 new AutoScale policies, open the AutoScale configuration page again, then
 click the Enable AutoScale button.


 Runtime Considerations
 ----------------------

 An administrator should not assign an Instance to a load balancing rule which is
 configured for AutoScale.

 Making API calls outside the context of AutoScale, such as destroyVM, on an
 autoscaled Instance leaves the load balancing configuration in an inconsistent state.
 Even though the Instance is destroyed from the load balancer rule, it continues to be shown
 as a service assigned to a rule inside the context of AutoScale.
	.. Licensed to the Apache Software Foundation (ASF) under one
	or more contributor license agreements. See the NOTICE file
	distributed with this work for additional information#
	regarding copyright ownership. The ASF licenses this file
	to you under the Apache License, Version 2.0 (the
	"License"); you may not use this file except in compliance
	with the License. You may obtain a copy of the License at
	http://www.apache.org/licenses/LICENSE-2.0
	Unless required by applicable law or agreed to in writing,
	software distributed under the License is distributed on an
	"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
	KIND, either express or implied. See the License for the
	specific language governing permissions and limitations
	under the License.


	Configuring AutoScale without using NetScaler
	=============================================

	.. note::
	AutoScaling is currently not supported in the new UI.
	To manage AutoScaling, please directly invoke the
	respective APIs or use `cloudmonkey <https://github.com/apache/cloudstack-cloudmonkey>`_,
	the CLI tool for cloudstack

	What is AutoScaling?
	--------------------

	AutoScaling allows you to scale your back-end services or application Instances up
	or down seamlessly and automatically according to the conditions you define.
	With AutoScaling enabled, you can ensure that the number of Instances you are using
	seamlessly scale up when demand increases, and automatically decreases when
	demand subsides. Thus it helps you save compute costs by terminating underused
	Instances automatically and launching new Instances when you need them, without the need
	for manual intervention.


	Hypervisor support
	------------------

	At that time, AutoScaling without NetScaler only supports for Xenserver. We
	are working to support KVM also.


	Prerequisites
	-------------

	Before you configure an AutoScale rule, consider the following:

	- Ensure that the necessary Template is prepared before configuring AutoScale.
	Firstly you must install the PV-driver, which helps Xenserver collect
	performance parameters (CPU and memory) into Instances. Besides, When an Instance is
	deployed by using a Template and when it comes up, the application should be
	up and running.


	Configuration
	-------------

	Specify the following:

	.. image:: /_static/images/autoscale-config.png

	- Template: A Template consists of a base OS image and application. A
	Template is used to provision the new Instance of an application on a
	scaleup action. When an Instance is deployed from a Template, it can start
	taking the traffic from the load balancer without any admin intervention.
	For example, if the Instance is deployed for a Web service, it should have the
	Web server running, the database connected, and so on.

	- Compute offering: A predefined set of virtual hardware attributes,
	including CPU speed, number of CPUs, and RAM size, that the user can select
	when creating a new Instance. Choose one of the compute offerings to be used
	while provisioning an Instance as part of scaleup action.

	- Min Instance: The minimum number of active Instances that is assigned to
	a load balancing rule. The active Instances are the application
	Instances that are up and serving the traffic, and are being load balanced.
	This parameter ensures that a load balancing rule has at least the
	configured number of active Instances are available to serve the traffic.

	- Max Instance: Maximum number of active Instances that should be assigned
	to a load balancing rule. This parameter defines the upper limit of active
	Instances that can be assigned to a load balancing rule.

	Specifying a large value for the maximum Instance parameter might result in
	provisioning large number of Instances, which in turn leads to a single
	load balancing rule exhausting the Instances limit specified at the
	Account or domain level.

	Specify the following scale-up and scale-down policies:

	- Duration: The duration, in seconds, for which the conditions you specify
	must be true to trigger a scaleup action. The conditions defined should
	hold true for the entire duration you specify for an AutoScale action to be
	invoked.

	- Counter: The performance counters expose the state of the monitored
	Instances. We added two new counter to work with that feature:

	- Linux User CPU [native] - percentage
	- Linux User RAM [native] - percentage

	Remember to choose one of them. If you choose anything else, the
	autoscaling will not work.

	- Operator: The following five relational operators are supported in
	AutoScale feature: Greater than, Less than, Less than or equal to, Greater
	than or equal to, and Equal to.

	- Threshold: Threshold value to be used for the counter. Once the counter
	defined above breaches the threshold value, the AutoScale feature initiates
	a scaleup or scaledown action.

	- Add: Click Add to add the condition.

	Additionally, if you want to configure the advanced settings, click Show
	advanced settings, and specify the following:

	- Polling interval: Frequency in which the conditions, combination of counter,
	operator and threshold, are to be evaluated before taking a scale up or
	down action. The default polling interval is 30 seconds.

	- Quiet Time: This is the cool down period after an AutoScale action is
	initiated. The time includes the time taken to complete provisioning an
	Instance from its Template and the time taken by an application to be ready
	to serve traffic. This quiet time allows the fleet to come up to a stable
	state before any action can take place. The default is 300 seconds.

	- Destroy Instance Grace Period: The duration in seconds, after a scaledown action
	is initiated, to wait before the Instance is destroyed as part of scaledown
	action. This is to ensure graceful close of any pending sessions or
	transactions being served by the Instance marked for destroy. The default is 120
	seconds.

	- Apply: Click Apply to create the AutoScale configuration.


	Disabling and Enabling an AutoScale Configuration
	-------------------------------------------------

	If you want to perform any maintenance operation on the AutoScale Instances,
	disable the AutoScale configuration. When the AutoScale configuration is
	disabled, no scaleup or scaledown action is performed. You can use this
	downtime for the maintenance activities. To disable the AutoScale
	configuration, click the Disable AutoScale button.

	The button toggles between enable and disable, depending on whether AutoScale
	is currently enabled or not. After the maintenance operations are done, you
	can enable the AutoScale configuration back. To enable, open the AutoScale
	configuration page again, then click the Enable AutoScale button.


	Updating an AutoScale Configuration
	-----------------------------------

	You can update the various parameters and add or delete the conditions in a
	scaleup or scaledown rule. Before you update an AutoScale configuration,
	ensure that you disable the AutoScale load balancer rule by clicking the
	Disable AutoScale button.

	After you modify the required AutoScale parameters, click Apply. To apply the
	new AutoScale policies, open the AutoScale configuration page again, then
	click the Enable AutoScale button.


	Runtime Considerations
	----------------------

	An administrator should not assign an Instance to a load balancing rule which is
	configured for AutoScale.

	Making API calls outside the context of AutoScale, such as destroyVM, on an
	autoscaled Instance leaves the load balancing configuration in an inconsistent state.
	Even though the Instance is destroyed from the load balancer rule, it continues to be shown
	as a service assigned to a rule inside the context of AutoScale.