site/docs/4.0.1/generated-collection-funcs-table.html - spark-website - Git at Google

 <table class="table">
   <thead>
     <tr>
       <th style="width:25%">Function</th>
       <th>Description</th>
     </tr>
   </thead>
   <tbody>
     <tr>
       <td>aggregate(expr, start, merge, finish)</td>
       <td>Applies a binary operator to an initial state and all
       elements in the array, and reduces this to a single state. The final state is converted
       into the final result by applying a finish function.</td>
     </tr>
     <tr>
       <td>array_sort(expr, func)</td>
       <td>Sorts the input array. If func is omitted, sort
     in ascending order. The elements of the input array must be orderable.
     NaN is greater than any non-NaN elements for double/float type.
     Null elements will be placed at the end of the returned array.
     Since 3.0.0 this function also sorts and returns the array based on the
     given comparator function. The comparator will take two arguments representing
     two elements of the array.
     It returns a negative integer, 0, or a positive integer as the first element is less than,
     equal to, or greater than the second element. If the comparator function returns null,
     the function will fail and raise an error.</td>
     </tr>
     <tr>
       <td>cardinality(expr)</td>
       <td>Returns the size of an array or a map.
     This function returns -1 for null input only if spark.sql.ansi.enabled is false and
     spark.sql.legacy.sizeOfNull is true. Otherwise, it returns null for null input.
     With the default settings, the function returns null for null input.</td>
     </tr>
     <tr>
       <td>concat(col1, col2, ..., colN)</td>
       <td>Returns the concatenation of col1, col2, ..., colN.</td>
     </tr>
     <tr>
       <td>element_at(array, index)</td>
       <td>Returns element of array at given (1-based) index. If Index is 0,
       Spark will throw an error. If index < 0, accesses elements from the last to the first.
       The function returns NULL if the index exceeds the length of the array and
       `spark.sql.ansi.enabled` is set to false.
       If `spark.sql.ansi.enabled` is set to true, it throws ArrayIndexOutOfBoundsException
       for invalid indices.</td>
     </tr>
     <tr>
       <td>    element_at(map, key)</td>
       <td>Returns value for given key. The function returns NULL if the key is not
        contained in the map.</td>
     </tr>
     <tr>
       <td>exists(expr, pred)</td>
       <td>Tests whether a predicate holds for one or more elements in the array.</td>
     </tr>
     <tr>
       <td>filter(expr, func)</td>
       <td>Filters the input array using the given predicate.</td>
     </tr>
     <tr>
       <td>forall(expr, pred)</td>
       <td>Tests whether a predicate holds for all elements in the array.</td>
     </tr>
     <tr>
       <td>map_filter(expr, func)</td>
       <td>Filters entries in a map using the function.</td>
     </tr>
     <tr>
       <td>map_zip_with(map1, map2, function)</td>
       <td>Merges two given maps into a single map by applying
       function to the pair of values with the same key. For keys only presented in one map,
       NULL will be passed as the value for the missing key. If an input map contains duplicated
       keys, only the first entry of the duplicated key is passed into the lambda function.</td>
     </tr>
     <tr>
       <td>reduce(expr, start, merge, finish)</td>
       <td>Applies a binary operator to an initial state and all
       elements in the array, and reduces this to a single state. The final state is converted
       into the final result by applying a finish function.</td>
     </tr>
     <tr>
       <td>reverse(array)</td>
       <td>Returns a reversed string or an array with reverse order of elements.</td>
     </tr>
     <tr>
       <td>size(expr)</td>
       <td>Returns the size of an array or a map.
     This function returns -1 for null input only if spark.sql.ansi.enabled is false and
     spark.sql.legacy.sizeOfNull is true. Otherwise, it returns null for null input.
     With the default settings, the function returns null for null input.</td>
     </tr>
     <tr>
       <td>transform(expr, func)</td>
       <td>Transforms elements in an array using the function.</td>
     </tr>
     <tr>
       <td>transform_keys(expr, func)</td>
       <td>Transforms elements in a map using the function.</td>
     </tr>
     <tr>
       <td>transform_values(expr, func)</td>
       <td>Transforms values in the map using the function.</td>
     </tr>
     <tr>
       <td>try_element_at(array, index)</td>
       <td>Returns element of array at given (1-based) index. If Index is 0,
       Spark will throw an error. If index < 0, accesses elements from the last to the first.
       The function always returns NULL if the index exceeds the length of the array.</td>
     </tr>
     <tr>
       <td>    try_element_at(map, key)</td>
       <td>Returns value for given key. The function always returns NULL
       if the key is not contained in the map.</td>
     </tr>
     <tr>
       <td>zip_with(left, right, func)</td>
       <td>Merges the two given arrays, element-wise, into a single array using function. If one array is shorter, nulls are appended at the end to match the length of the longer array, before applying function.</td>
     </tr>
   </tbody>
 </table>
	<table class="table">
	<thead>
	<tr>
	<th style="width:25%">Function</th>
	<th>Description</th>
	</tr>
	</thead>
	<tbody>
	<tr>
	<td>aggregate(expr, start, merge, finish)</td>
	<td>Applies a binary operator to an initial state and all
	elements in the array, and reduces this to a single state. The final state is converted
	into the final result by applying a finish function.</td>
	</tr>
	<tr>
	<td>array_sort(expr, func)</td>
	<td>Sorts the input array. If func is omitted, sort
	in ascending order. The elements of the input array must be orderable.
	NaN is greater than any non-NaN elements for double/float type.
	Null elements will be placed at the end of the returned array.
	Since 3.0.0 this function also sorts and returns the array based on the
	given comparator function. The comparator will take two arguments representing
	two elements of the array.
	It returns a negative integer, 0, or a positive integer as the first element is less than,
	equal to, or greater than the second element. If the comparator function returns null,
	the function will fail and raise an error.</td>
	</tr>
	<tr>
	<td>cardinality(expr)</td>
	<td>Returns the size of an array or a map.
	This function returns -1 for null input only if spark.sql.ansi.enabled is false and
	spark.sql.legacy.sizeOfNull is true. Otherwise, it returns null for null input.
	With the default settings, the function returns null for null input.</td>
	</tr>
	<tr>
	<td>concat(col1, col2, ..., colN)</td>
	<td>Returns the concatenation of col1, col2, ..., colN.</td>
	</tr>
	<tr>
	<td>element_at(array, index)</td>
	<td>Returns element of array at given (1-based) index. If Index is 0,
	Spark will throw an error. If index < 0, accesses elements from the last to the first.
	The function returns NULL if the index exceeds the length of the array and
	`spark.sql.ansi.enabled` is set to false.
	If `spark.sql.ansi.enabled` is set to true, it throws ArrayIndexOutOfBoundsException
	for invalid indices.</td>
	</tr>
	<tr>
	<td> element_at(map, key)</td>
	<td>Returns value for given key. The function returns NULL if the key is not
	contained in the map.</td>
	</tr>
	<tr>
	<td>exists(expr, pred)</td>
	<td>Tests whether a predicate holds for one or more elements in the array.</td>
	</tr>
	<tr>
	<td>filter(expr, func)</td>
	<td>Filters the input array using the given predicate.</td>
	</tr>
	<tr>
	<td>forall(expr, pred)</td>
	<td>Tests whether a predicate holds for all elements in the array.</td>
	</tr>
	<tr>
	<td>map_filter(expr, func)</td>
	<td>Filters entries in a map using the function.</td>
	</tr>
	<tr>
	<td>map_zip_with(map1, map2, function)</td>
	<td>Merges two given maps into a single map by applying
	function to the pair of values with the same key. For keys only presented in one map,
	NULL will be passed as the value for the missing key. If an input map contains duplicated
	keys, only the first entry of the duplicated key is passed into the lambda function.</td>
	</tr>
	<tr>
	<td>reduce(expr, start, merge, finish)</td>
	<td>Applies a binary operator to an initial state and all
	elements in the array, and reduces this to a single state. The final state is converted
	into the final result by applying a finish function.</td>
	</tr>
	<tr>
	<td>reverse(array)</td>
	<td>Returns a reversed string or an array with reverse order of elements.</td>
	</tr>
	<tr>
	<td>size(expr)</td>
	<td>Returns the size of an array or a map.
	This function returns -1 for null input only if spark.sql.ansi.enabled is false and
	spark.sql.legacy.sizeOfNull is true. Otherwise, it returns null for null input.
	With the default settings, the function returns null for null input.</td>
	</tr>
	<tr>
	<td>transform(expr, func)</td>
	<td>Transforms elements in an array using the function.</td>
	</tr>
	<tr>
	<td>transform_keys(expr, func)</td>
	<td>Transforms elements in a map using the function.</td>
	</tr>
	<tr>
	<td>transform_values(expr, func)</td>
	<td>Transforms values in the map using the function.</td>
	</tr>
	<tr>
	<td>try_element_at(array, index)</td>
	<td>Returns element of array at given (1-based) index. If Index is 0,
	Spark will throw an error. If index < 0, accesses elements from the last to the first.
	The function always returns NULL if the index exceeds the length of the array.</td>
	</tr>
	<tr>
	<td> try_element_at(map, key)</td>
	<td>Returns value for given key. The function always returns NULL
	if the key is not contained in the map.</td>
	</tr>
	<tr>
	<td>zip_with(left, right, func)</td>
	<td>Merges the two given arrays, element-wise, into a single array using function. If one array is shorter, nulls are appended at the end to match the length of the longer array, before applying function.</td>
	</tr>
	</tbody>
	</table>