chart/docs/keda.rst - airflow - Git at Google

  .. Licensed to the Apache Software Foundation (ASF) under one
     or more contributor license agreements.  See the NOTICE file
     distributed with this work for additional information
     regarding copyright ownership.  The ASF licenses this file
     to you under the Apache License, Version 2.0 (the
     "License"); you may not use this file except in compliance
     with the License.  You may obtain a copy of the License at

  ..   http://www.apache.org/licenses/LICENSE-2.0

  .. Unless required by applicable law or agreed to in writing,
     software distributed under the License is distributed on an
     "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
     KIND, either express or implied.  See the License for the
     specific language governing permissions and limitations
     under the License.

 Autoscaling with KEDA
 =====================

 KEDA stands for Kubernetes Event Driven Autoscaling.
 `KEDA <https://github.com/kedacore/keda>`__ is a custom controller that
 allows users to create custom bindings to the Kubernetes `Horizontal Pod
 Autoscaler <https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/>`__.
 The autoscaler will adjust the number of active Celery workers based on the number
 of tasks in ``queued`` or ``running`` state.

 One advantage of KEDA is that it allows you to scale your application to/from 0 workers, meaning no workers are idle when there are no tasks.

 KEDA Installation and usage
 ---------------------------

 To install KEDA in your Kubernetes cluster, run the following commands:

 .. code-block:: bash

    helm repo add kedacore https://kedacore.github.io/charts
    helm repo update
    kubectl create namespace keda
    helm install keda kedacore/keda \
      --namespace keda \
      --version "v2.0.0"

 To enable KEDA for the Airflow instance, it has to be enabled by setting ``workers.celery.keda.enabled=true``
 in your Helm command or in the ``values.yaml`` like:

 .. code-block:: bash

    kubectl create namespace airflow
    helm repo add apache-airflow https://airflow.apache.org
    helm install airflow apache-airflow/airflow \
      --namespace airflow \
      --set executor=CeleryExecutor \
      --set workers.celery.keda.enabled=true

 .. note::

    Make sure ``values.yaml`` shows that either KEDA or HPA is enabled, but not both. It is recommended not
    to use both KEDA and HPA to scale the same workload. They will compete with each other resulting in odd scaling behavior.

 After installation, the KEDA ``ScaledObject`` and an ``HPA`` will be created in the Airflow namespace.

 In the default configuration, KEDA will derive the desired number of Celery workers by querying Airflow metadata database with following SQL statement:

 .. code-block:: none

    SELECT
      ceil(COUNT(*)::decimal / {{ .Values.config.celery.worker_concurrency }})
    FROM
      task_instance
    WHERE
      (state='running' OR state='queued')
      AND queue IN <queue names>

 where ``<queue names>`` is a list of queue names used by
 `Celery worker queues <https://airflow.apache.org/docs/apache-airflow-providers-celery/stable/celery_executor.html#queues>`_
 mechanism (with default configuration it has one element ``default``).

 .. note::

    Set Celery worker concurrency through the Helm Chart value
    ``config.celery.worker_concurrency`` (e.g. instead of airflow.cfg or
    environment variables), so that the KEDA trigger will be consistent with
    the worker concurrency setting.

 Triggers (aka Scalers)
 ----------------------

 Triggers refer to the metrics (or formulae) that KEDA should refer to when scaling workers.

 It is recommended to use multiple triggers within a ScaledObject, rather than creating different objects for different triggers.
 This keeps all your rules and formulae in one place, and it avoids multiple ScaledObjects being created by the same workload.

 ScaledObject
 ------------

 To configure KEDA's triggers and scaling behaviors, you need to create a ScaledObject. Below ScaledObject parameters:

 * ``cooldownPeriod`` specifies the number of seconds to wait before downscaling to 0 workers, does not apply to downscaling to n workers while n >= 1.
 * ``idleReplicaCount`` can be set to any number less than ``minReplicaCount``, but it must be set to 0, otherwise KEDA will not work. Change ``minReplicaCount`` to n > 0 if you need idle workers.

 Triggerers value ``targetQueryValue`` is used as ``TargetValue`` of workers, which must be between ScaledObject ``minReplicaCount`` and ``maxReplicaCount`` values.

 .. note::

    To avoid strange behavior, best practice is to set ``cooldownPeriod`` to an integer slightly larger than ``terminationGracePeriodSeconds`` so that your cluster does not downscale to 0 workers before cleanup is finished.

 Metrics
 -------

 The HPA controller, refreshes metrics defined in triggers every ``--horizontal-pod-autoscaler-sync-period`` and the values are routed to
 KEDA Metrics Server directly. To reduce the load on the KEDA Scaler, you can set ``useCachedMetrics`` to true, to enabling reading metrics
 from cache first. Cache is updated periodically every ``pollingInterval``.

 .. note::

    When number of workers = 0, KEDA will still poll for metrics using ``pollingInterval``.
    When number of workers >= 1, both KEDA and the HPA will poll your defined triggers.

 KEDA offers two ``metricTypes`` that provide more granular scaling control than the standard HPA ``Target`` metric:

 * AverageValue (default) controls a per-worker average.
 * Value controls total system load.
	.. Licensed to the Apache Software Foundation (ASF) under one
	or more contributor license agreements. See the NOTICE file
	distributed with this work for additional information
	regarding copyright ownership. The ASF licenses this file
	to you under the Apache License, Version 2.0 (the
	"License"); you may not use this file except in compliance
	with the License. You may obtain a copy of the License at

	.. http://www.apache.org/licenses/LICENSE-2.0

	.. Unless required by applicable law or agreed to in writing,
	software distributed under the License is distributed on an
	"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
	KIND, either express or implied. See the License for the
	specific language governing permissions and limitations
	under the License.

	Autoscaling with KEDA
	=====================

	KEDA stands for Kubernetes Event Driven Autoscaling.
	`KEDA <https://github.com/kedacore/keda>`__ is a custom controller that
	allows users to create custom bindings to the Kubernetes `Horizontal Pod
	Autoscaler <https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/>`__.
	The autoscaler will adjust the number of active Celery workers based on the number
	of tasks in ``queued`` or ``running`` state.

	One advantage of KEDA is that it allows you to scale your application to/from 0 workers, meaning no workers are idle when there are no tasks.

	KEDA Installation and usage
	---------------------------

	To install KEDA in your Kubernetes cluster, run the following commands:

	.. code-block:: bash

	helm repo add kedacore https://kedacore.github.io/charts
	helm repo update
	kubectl create namespace keda
	helm install keda kedacore/keda \
	--namespace keda \
	--version "v2.0.0"

	To enable KEDA for the Airflow instance, it has to be enabled by setting ``workers.celery.keda.enabled=true``
	in your Helm command or in the ``values.yaml`` like:

	.. code-block:: bash

	kubectl create namespace airflow
	helm repo add apache-airflow https://airflow.apache.org
	helm install airflow apache-airflow/airflow \
	--namespace airflow \
	--set executor=CeleryExecutor \
	--set workers.celery.keda.enabled=true

	.. note::

	Make sure ``values.yaml`` shows that either KEDA or HPA is enabled, but not both. It is recommended not
	to use both KEDA and HPA to scale the same workload. They will compete with each other resulting in odd scaling behavior.

	After installation, the KEDA ``ScaledObject`` and an ``HPA`` will be created in the Airflow namespace.

	In the default configuration, KEDA will derive the desired number of Celery workers by querying Airflow metadata database with following SQL statement:

	.. code-block:: none

	SELECT
	ceil(COUNT(*)::decimal / {{ .Values.config.celery.worker_concurrency }})
	FROM
	task_instance
	WHERE
	(state='running' OR state='queued')
	AND queue IN <queue names>

	where ``<queue names>`` is a list of queue names used by
	`Celery worker queues <https://airflow.apache.org/docs/apache-airflow-providers-celery/stable/celery_executor.html#queues>`_
	mechanism (with default configuration it has one element ``default``).

	.. note::

	Set Celery worker concurrency through the Helm Chart value
	``config.celery.worker_concurrency`` (e.g. instead of airflow.cfg or
	environment variables), so that the KEDA trigger will be consistent with
	the worker concurrency setting.

	Triggers (aka Scalers)
	----------------------

	Triggers refer to the metrics (or formulae) that KEDA should refer to when scaling workers.

	It is recommended to use multiple triggers within a ScaledObject, rather than creating different objects for different triggers.
	This keeps all your rules and formulae in one place, and it avoids multiple ScaledObjects being created by the same workload.

	ScaledObject
	------------

	To configure KEDA's triggers and scaling behaviors, you need to create a ScaledObject. Below ScaledObject parameters:

	* ``cooldownPeriod`` specifies the number of seconds to wait before downscaling to 0 workers, does not apply to downscaling to n workers while n >= 1.
	* ``idleReplicaCount`` can be set to any number less than ``minReplicaCount``, but it must be set to 0, otherwise KEDA will not work. Change ``minReplicaCount`` to n > 0 if you need idle workers.

	Triggerers value ``targetQueryValue`` is used as ``TargetValue`` of workers, which must be between ScaledObject ``minReplicaCount`` and ``maxReplicaCount`` values.

	.. note::

	To avoid strange behavior, best practice is to set ``cooldownPeriod`` to an integer slightly larger than ``terminationGracePeriodSeconds`` so that your cluster does not downscale to 0 workers before cleanup is finished.

	Metrics
	-------

	The HPA controller, refreshes metrics defined in triggers every ``--horizontal-pod-autoscaler-sync-period`` and the values are routed to
	KEDA Metrics Server directly. To reduce the load on the KEDA Scaler, you can set ``useCachedMetrics`` to true, to enabling reading metrics
	from cache first. Cache is updated periodically every ``pollingInterval``.

	.. note::

	When number of workers = 0, KEDA will still poll for metrics using ``pollingInterval``.
	When number of workers >= 1, both KEDA and the HPA will poll your defined triggers.

	KEDA offers two ``metricTypes`` that provide more granular scaling control than the standard HPA ``Target`` metric:

	* AverageValue (default) controls a per-worker average.
	* Value controls total system load.