blob: b32fc128baa606e177175c084c8b18db70299d0f [file] [log] [blame]
:py:mod:`airflow.providers.amazon.aws.operators.glue_crawler`
=============================================================
.. py:module:: airflow.providers.amazon.aws.operators.glue_crawler
Module Contents
---------------
Classes
~~~~~~~
.. autoapisummary::
airflow.providers.amazon.aws.operators.glue_crawler.GlueCrawlerOperator
airflow.providers.amazon.aws.operators.glue_crawler.AwsGlueCrawlerOperator
.. py:class:: GlueCrawlerOperator(config, aws_conn_id='aws_default', poll_interval = 5, wait_for_completion = True, **kwargs)
Bases: :py:obj:`airflow.models.BaseOperator`
Creates, updates and triggers an AWS Glue Crawler. AWS Glue Crawler is a serverless
service that manages a catalog of metadata tables that contain the inferred
schema, format and data types of data stores within the AWS cloud.
.. seealso::
For more information on how to use this operator, take a look at the guide:
:ref:`howto/operator:GlueCrawlerOperator`
:param config: Configurations for the AWS Glue crawler
:param aws_conn_id: aws connection to use
:param poll_interval: Time (in seconds) to wait between two consecutive calls to check crawler status
:param wait_for_completion: Whether or not wait for crawl execution completion. (default: True)
.. py:attribute:: ui_color
:annotation: = #ededed
.. py:method:: hook(self)
Create and return an GlueCrawlerHook.
.. py:method:: execute(self, context)
Executes AWS Glue Crawler from Airflow
:return: the name of the current glue crawler.
.. py:class:: AwsGlueCrawlerOperator(*args, **kwargs)
Bases: :py:obj:`GlueCrawlerOperator`
This operator is deprecated.
Please use :class:`airflow.providers.amazon.aws.operators.glue_crawler.GlueCrawlerOperator`.