docs/concepts/customizing-execution.rst - hamilton - Git at Google

 Customizing Execution
 ----------------------------------


 The Driver
 ----------------------------------
 The Hamilton Driver by default has the following behaviors:

 #. It is single threaded, and runs on the machine you call execute from.
 #. It is limited to the memory available on your machine.
 #. ``execute()`` by default returns a pandas DataFrame.
 #. Nothing besides the node is executed during node run

 To change these behaviors, we need to introduce two concepts:

 #. A Result Builder -- this is how we tell Hamilton what kind of object we want to return when we call ``execute()``.
 #. A Graph Adapters -- this is how we tell Hamilton where and how functions should be executed.

 Customizing Execution
 #####################

 The following tools are all part of the lifecycle customization capability in Hamilton. This is a very powerful set of features,
 a few of which are presented below. See the reference for :doc:`../reference/customizing-execution/index` for more information.

 You can mix/match lifecycle capabilities, although some may not be compatible. To do so, pass a variable number into the ``with_adapters(...)`` function
 of the ``driver.Builder``.

 Result Builders
 ###############

 In effect, this is a class with a static function, that takes a dictionary of computed results, and turns it into
 something.

 .. code-block:: python

     class ResultMixin(object):
         """Base class housing the static function.

         Why a static function? That's because certain frameworks can only pickle a static function, not an entire
         object.
         """
         @staticmethod
         @abc.abstractmethod
         def build_result(**outputs: typing.Dict[str, typing.Any]) -> typing.Any:
             """This function builds the result given the computed values."""
             pass

 So we have a few implementations see :doc:`../../reference/result-builders/index` for the list. These can be run independently,
 or paired with a graph adapter, see next section for details!

 Graph Adapters
 ##############

 Graph Adapters `adapt` the Hamilton DAG, and change how it is executed. They all implement a single interface called
 ``base.HamiltonGraphAdapter``. They are called internally by Hamilton at the right points in time to make execution
 work. The link with the Result Builders, is that GraphAdapters (currently) need to implement a ``build_result()`` function
 themselves.

 .. code-block:: python

     class HamiltonGraphAdapter(lifecycle.ResultBuilder):
         """Any GraphAdapters should implement this interface to adapt the HamiltonGraph for that particular context.

         Note since it inherits ResultMixin -- HamiltonGraphAdapters need a `build_result` function too.
         """
         # four functions not shown

 The default behavior is to join all data in a dictionary with nodes as keys and their results as values (unless you bypass `driver.Builder`, in which case it will
 return a pandas dataframe). Should you find aspects you wish to customize, reach out to the maintainers. The odds are high
 that we already have the right abstraction and have not yet exposed it via documentation & examples.


 If you want to tell Hamilton to return something else, we suggest starting with the ``lifecycle.ResultBuilder``
 and writing a simple class & function that implements the ``lifecycle.ResultBuilder`` interface and passing that into the driver as ``adapter=[result_builder]``.  See
 :doc:`../reference/customizing-execution/index` and
 :doc:`../reference/result-builders/index` for options.

 Execution Hooks
 ###############

 You can do anything you want pre/post execution by implementing the interface ``hamilton.lifecycle.NodeExecutionHook``, which
 gives you two methods:

 1. ``run_before_node_execution(node_name, node_tags, node_kwargs, node_return_type, **future_kwargs)``
 2. ``run_after_node_execution(node_name, node_tags, node_kwargs, node_return_type, result, error, success, **future_kwargs)``

 These both have ``**future_kwargs`` to ensure backwards compatibility. You can use these to do anything you want, including
 logging, sending metrics, telemetry, etc... Otherwise, let's quickly walk through some options on how to execute a Hamilton DAG.

 Dynamic DAGs/Parallel Execution
 ----------------------------------

 Hamilton now has pluggable execution, which allows for the following:

 1. Grouping of nodes into "tasks" (discrete execution unit between serialization boundaries)
 2. Executing the tasks in parallel, using any executor of your choice

 You can run this executor using the `Builder`, a utility class that allows you to build a driver piece by piece.
 Note that you currently have to call `enable_dynamic_execution(allow_experimental_mode=True)`
 which will toggle it to use the `V2` executor. Then, you can:

 1. Add task executors to specify how to run the tasks
 2. Add node gropuing strategies
 3. Add modules to crawl for functions
 4. Add a results builder to shape the results

 Either constructing the driver, or using the builder and `not` calling `enable_dynamic_execution` will give you the standard executor.
 We highly recommend you use the builder pattern -- while the constructor of the `Driver` will be fully
 backwards compatible according to the rules of semantic versioning, we may change it in the future (for 2.0).

 Note that the new executor is required to handle dynamic creation of nodes (E.G. using `Parallelizable[]` and `Collect[]`.

 Let's look at an example of the driver:

 .. code-block:: python

     from my_code import foo_module, bar_module

     from hamilton import driver
     from hamilton.execution import executors

     dr = (
         driver.Builder()
         .with_modules(foo_module)
         .enable_dynamic_execution(allow_experimental_mode=True)
         .with_config({"config_key": "config_value"})
         .with_local_executor(executors.SynchronousLocalTaskExecutor())
         .with_remote_executor(executors.MultiProcessingExecutor(max_tasks=5))
         .build()
     )

     dr.execute(["my_variable"], inputs={...}, overrides={...})

 Note that we set a `remote` executor, and a local executor. While you can bypass this and instead set an `execution_manager`
 in the builder call (see :doc:`../reference/drivers/Driver` for documentation on the `Builder`),this goes along with the default grouping strategy,
 which is to place each node in its own group, except for
 dynamically generated (`Parallelizable[]`) blocks, which are each made into one group, and executed locally.

 Thus, when you write a DAG like this (a simple map-reduce pattern):

 .. code-block:: python

     from hamilton.htypes import Parallelizable

     def url() -> Parallelizable[str]:
         for url_ in  _list_all_urls():
             yield url_

     def url_loaded(url: str) -> str:
         return _load(urls)

     def counts(url_loaded: str) -> str:
         return len(url_loaded.split(" "))

     def total_words(counts: Collect[int]) -> int:
         return sum(counts)

 The block containing `counts` and `url_loaded` will get marked as one task, repeated for each URL in url_loaded,
 and run on the remote executor (which in this case is the `ThreadPoolExecutor`).

 Note that we currently have the following caveats:

 1. No nested `Parallelizable[]`/`Collect[]` blocks -- we only allow one level of parallelization
 2. Serialization for `Multiprocessing` is suboptimal -- we currently use the default `pickle` serializer, which breaks with certain cases. Ray, Dask, etc... all work well, and we plan to add support for joblib + cloudpickle serialization.
 3. `Collect[]` input types are limited to one per function -- this is another caveat that we intend to get rid of, but for now you'll want to concat/put into one function before collecting.
	Customizing Execution
	----------------------------------


	The Driver
	----------------------------------
	The Hamilton Driver by default has the following behaviors:

	#. It is single threaded, and runs on the machine you call execute from.
	#. It is limited to the memory available on your machine.
	#. ``execute()`` by default returns a pandas DataFrame.
	#. Nothing besides the node is executed during node run

	To change these behaviors, we need to introduce two concepts:

	#. A Result Builder -- this is how we tell Hamilton what kind of object we want to return when we call ``execute()``.
	#. A Graph Adapters -- this is how we tell Hamilton where and how functions should be executed.

	Customizing Execution
	#####################

	The following tools are all part of the lifecycle customization capability in Hamilton. This is a very powerful set of features,
	a few of which are presented below. See the reference for :doc:`../reference/customizing-execution/index` for more information.

	You can mix/match lifecycle capabilities, although some may not be compatible. To do so, pass a variable number into the ``with_adapters(...)`` function
	of the ``driver.Builder``.

	Result Builders
	###############

	In effect, this is a class with a static function, that takes a dictionary of computed results, and turns it into
	something.

	.. code-block:: python

	class ResultMixin(object):
	"""Base class housing the static function.

	Why a static function? That's because certain frameworks can only pickle a static function, not an entire
	object.
	"""
	@staticmethod
	@abc.abstractmethod
	def build_result(**outputs: typing.Dict[str, typing.Any]) -> typing.Any:
	"""This function builds the result given the computed values."""
	pass

	So we have a few implementations see :doc:`../../reference/result-builders/index` for the list. These can be run independently,
	or paired with a graph adapter, see next section for details!

	Graph Adapters
	##############

	Graph Adapters `adapt` the Hamilton DAG, and change how it is executed. They all implement a single interface called
	``base.HamiltonGraphAdapter``. They are called internally by Hamilton at the right points in time to make execution
	work. The link with the Result Builders, is that GraphAdapters (currently) need to implement a ``build_result()`` function
	themselves.

	.. code-block:: python

	class HamiltonGraphAdapter(lifecycle.ResultBuilder):
	"""Any GraphAdapters should implement this interface to adapt the HamiltonGraph for that particular context.

	Note since it inherits ResultMixin -- HamiltonGraphAdapters need a `build_result` function too.
	"""
	# four functions not shown

	The default behavior is to join all data in a dictionary with nodes as keys and their results as values (unless you bypass `driver.Builder`, in which case it will
	return a pandas dataframe). Should you find aspects you wish to customize, reach out to the maintainers. The odds are high
	that we already have the right abstraction and have not yet exposed it via documentation & examples.


	If you want to tell Hamilton to return something else, we suggest starting with the ``lifecycle.ResultBuilder``
	and writing a simple class & function that implements the ``lifecycle.ResultBuilder`` interface and passing that into the driver as ``adapter=[result_builder]``. See
	:doc:`../reference/customizing-execution/index` and
	:doc:`../reference/result-builders/index` for options.

	Execution Hooks
	###############

	You can do anything you want pre/post execution by implementing the interface ``hamilton.lifecycle.NodeExecutionHook``, which
	gives you two methods:

	1. ``run_before_node_execution(node_name, node_tags, node_kwargs, node_return_type, **future_kwargs)``
	2. ``run_after_node_execution(node_name, node_tags, node_kwargs, node_return_type, result, error, success, **future_kwargs)``

	These both have ``**future_kwargs`` to ensure backwards compatibility. You can use these to do anything you want, including
	logging, sending metrics, telemetry, etc... Otherwise, let's quickly walk through some options on how to execute a Hamilton DAG.

	Dynamic DAGs/Parallel Execution
	----------------------------------

	Hamilton now has pluggable execution, which allows for the following:

	1. Grouping of nodes into "tasks" (discrete execution unit between serialization boundaries)
	2. Executing the tasks in parallel, using any executor of your choice

	You can run this executor using the `Builder`, a utility class that allows you to build a driver piece by piece.
	Note that you currently have to call `enable_dynamic_execution(allow_experimental_mode=True)`
	which will toggle it to use the `V2` executor. Then, you can:

	1. Add task executors to specify how to run the tasks
	2. Add node gropuing strategies
	3. Add modules to crawl for functions
	4. Add a results builder to shape the results

	Either constructing the driver, or using the builder and `not` calling `enable_dynamic_execution` will give you the standard executor.
	We highly recommend you use the builder pattern -- while the constructor of the `Driver` will be fully
	backwards compatible according to the rules of semantic versioning, we may change it in the future (for 2.0).

	Note that the new executor is required to handle dynamic creation of nodes (E.G. using `Parallelizable[]` and `Collect[]`.

	Let's look at an example of the driver:

	.. code-block:: python

	from my_code import foo_module, bar_module

	from hamilton import driver
	from hamilton.execution import executors

	dr = (
	driver.Builder()
	.with_modules(foo_module)
	.enable_dynamic_execution(allow_experimental_mode=True)
	.with_config({"config_key": "config_value"})
	.with_local_executor(executors.SynchronousLocalTaskExecutor())
	.with_remote_executor(executors.MultiProcessingExecutor(max_tasks=5))
	.build()
	)

	dr.execute(["my_variable"], inputs={...}, overrides={...})

	Note that we set a `remote` executor, and a local executor. While you can bypass this and instead set an `execution_manager`
	in the builder call (see :doc:`../reference/drivers/Driver` for documentation on the `Builder`),this goes along with the default grouping strategy,
	which is to place each node in its own group, except for
	dynamically generated (`Parallelizable[]`) blocks, which are each made into one group, and executed locally.

	Thus, when you write a DAG like this (a simple map-reduce pattern):

	.. code-block:: python

	from hamilton.htypes import Parallelizable

	def url() -> Parallelizable[str]:
	for url_ in _list_all_urls():
	yield url_

	def url_loaded(url: str) -> str:
	return _load(urls)

	def counts(url_loaded: str) -> str:
	return len(url_loaded.split(" "))

	def total_words(counts: Collect[int]) -> int:
	return sum(counts)

	The block containing `counts` and `url_loaded` will get marked as one task, repeated for each URL in url_loaded,
	and run on the remote executor (which in this case is the `ThreadPoolExecutor`).

	Note that we currently have the following caveats:

	1. No nested `Parallelizable[]`/`Collect[]` blocks -- we only allow one level of parallelization
	2. Serialization for `Multiprocessing` is suboptimal -- we currently use the default `pickle` serializer, which breaks with certain cases. Ray, Dask, etc... all work well, and we plan to add support for joblib + cloudpickle serialization.
	3. `Collect[]` input types are limited to one per function -- this is another caveat that we intend to get rid of, but for now you'll want to concat/put into one function before collecting.