docs/source/user-guide/sql/window_functions.md - datafusion-sandbox - Git at Google

 <!---
   Licensed to the Apache Software Foundation (ASF) under one
   or more contributor license agreements.  See the NOTICE file
   distributed with this work for additional information
   regarding copyright ownership.  The ASF licenses this file
   to you under the Apache License, Version 2.0 (the
   "License"); you may not use this file except in compliance
   with the License.  You may obtain a copy of the License at

     http://www.apache.org/licenses/LICENSE-2.0

   Unless required by applicable law or agreed to in writing,
   software distributed under the License is distributed on an
   "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
   KIND, either express or implied.  See the License for the
   specific language governing permissions and limitations
   under the License.
 -->

 <!---
 This file was generated by the dev/update_function_docs.sh script.
 Do not edit it manually as changes will be overwritten.
 Instead, edit the WindowUDFImpl's documentation() function to
 update documentation for an individual UDF or the
 dev/update_function_docs.sh file for updating surrounding text.
 -->

 # Window Functions

 A _window function_ performs a calculation across a set of table rows that are somehow related to the current row.
 This is comparable to the type of calculation that can be done with an aggregate function.
 However, window functions do not cause rows to become grouped into a single output row like non-window aggregate calls would.
 Instead, the rows retain their separate identities. Behind the scenes, the window function is able to access more than just the current row of the query result

 Here is an example that shows how to compare each employee's salary with the average salary in his or her department:

 ```sql
 SELECT depname, empno, salary, avg(salary) OVER (PARTITION BY depname) FROM empsalary;

 +-----------+-------+--------+-------------------+
 | depname   | empno | salary | avg               |
 +-----------+-------+--------+-------------------+
 | personnel | 2     | 3900   | 3700.0            |
 | personnel | 5     | 3500   | 3700.0            |
 | develop   | 8     | 6000   | 5020.0            |
 | develop   | 10    | 5200   | 5020.0            |
 | develop   | 11    | 5200   | 5020.0            |
 | develop   | 9     | 4500   | 5020.0            |
 | develop   | 7     | 4200   | 5020.0            |
 | sales     | 1     | 5000   | 4866.666666666667 |
 | sales     | 4     | 4800   | 4866.666666666667 |
 | sales     | 3     | 4800   | 4866.666666666667 |
 +-----------+-------+--------+-------------------+
 ```

 A window function call always contains an OVER clause directly following the window function's name and argument(s). This is what syntactically distinguishes it from a normal function or non-window aggregate. The OVER clause determines exactly how the rows of the query are split up for processing by the window function. The PARTITION BY clause within OVER divides the rows into groups, or partitions, that share the same values of the PARTITION BY expression(s). For each row, the window function is computed across the rows that fall into the same partition as the current row. The previous example showed how to count the average of a column per partition.

 You can also control the order in which rows are processed by window functions using ORDER BY within OVER. (The window ORDER BY does not even have to match the order in which the rows are output.) Here is an example:

 ```sql
 SELECT depname, empno, salary,
        rank() OVER (PARTITION BY depname ORDER BY salary DESC)
 FROM empsalary;

 +-----------+-------+--------+--------+
 | depname   | empno | salary | rank   |
 +-----------+-------+--------+--------+
 | personnel | 2     | 3900   | 1      |
 | develop   | 8     | 6000   | 1      |
 | develop   | 10    | 5200   | 2      |
 | develop   | 11    | 5200   | 2      |
 | develop   | 9     | 4500   | 4      |
 | develop   | 7     | 4200   | 5      |
 | sales     | 1     | 5000   | 1      |
 | sales     | 4     | 4800   | 2      |
 | personnel | 5     | 3500   | 2      |
 | sales     | 3     | 4800   | 2      |
 +-----------+-------+--------+--------+
 ```

 There is another important concept associated with window functions: for each row, there is a set of rows within its partition called its window frame. Some window functions act only on the rows of the window frame, rather than of the whole partition. Here is an example of using window frames in queries:

 ```sql
 SELECT depname, empno, salary,
     avg(salary) OVER(ORDER BY salary ASC ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS avg,
     min(salary) OVER(ORDER BY empno ASC ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS cum_min
 FROM empsalary
 ORDER BY empno ASC;

 +-----------+-------+--------+--------------------+---------+
 | depname   | empno | salary | avg                | cum_min |
 +-----------+-------+--------+--------------------+---------+
 | sales     | 1     | 5000   | 5000.0             | 5000    |
 | personnel | 2     | 3900   | 3866.6666666666665 | 3900    |
 | sales     | 3     | 4800   | 4700.0             | 3900    |
 | sales     | 4     | 4800   | 4866.666666666667  | 3900    |
 | personnel | 5     | 3500   | 3700.0             | 3500    |
 | develop   | 7     | 4200   | 4200.0             | 3500    |
 | develop   | 8     | 6000   | 5600.0             | 3500    |
 | develop   | 9     | 4500   | 4500.0             | 3500    |
 | develop   | 10    | 5200   | 5133.333333333333  | 3500    |
 | develop   | 11    | 5200   | 5466.666666666667  | 3500    |
 +-----------+-------+--------+--------------------+---------+
 ```

 When a query involves multiple window functions, it is possible to write out each one with a separate OVER clause, but this is duplicative and error-prone if the same windowing behavior is wanted for several functions. Instead, each windowing behavior can be named in a WINDOW clause and then referenced in OVER. For example:

 ```sql
 SELECT sum(salary) OVER w, avg(salary) OVER w
 FROM empsalary
 WINDOW w AS (PARTITION BY depname ORDER BY salary DESC);
 ```

 ## Syntax

 The syntax for the OVER-clause is

 ```sql
 function([expr])
   OVER(
     [PARTITION BY expr[, …]]
     [ORDER BY expr [ ASC | DESC ][, …]]
     [ frame_clause ]
     )
 ```

 where **frame_clause** is one of:

 ```sql
   { RANGE | ROWS | GROUPS } frame_start
   { RANGE | ROWS | GROUPS } BETWEEN frame_start AND frame_end
 ```

 and **frame_start** and **frame_end** can be one of

 ```sql
 UNBOUNDED PRECEDING
 offset PRECEDING
 CURRENT ROW
 offset FOLLOWING
 UNBOUNDED FOLLOWING
 ```

 where **offset** is an non-negative integer.

 RANGE and GROUPS modes require an ORDER BY clause (with RANGE the ORDER BY must specify exactly one column).

 ## Filter clause for aggregate window functions

 Aggregate window functions support the SQL `FILTER (WHERE ...)` clause to include only rows that satisfy the predicate from the window frame in the aggregation.

 ```sql
 sum(salary) FILTER (WHERE salary > 0)
   OVER (PARTITION BY depname ORDER BY salary ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW)
 ```

 If no rows in the frame satisfy the filter for a given output row, `COUNT` yields `0` while `SUM`/`AVG`/`MIN`/`MAX` yield `NULL`.

 ## Aggregate functions

 All [aggregate functions](aggregate_functions.md) can be used as window functions.

 ## Ranking Functions

 - [cume_dist](#cume_dist)
 - [dense_rank](#dense_rank)
 - [ntile](#ntile)
 - [percent_rank](#percent_rank)
 - [rank](#rank)
 - [row_number](#row_number)

 ### `cume_dist`

 Relative rank of the current row: (number of rows preceding or peer with the current row) / (total rows).

 ```sql
 cume_dist()
 ```

 #### Example

 ```sql
 -- Example usage of the cume_dist window function:
 SELECT salary,
     cume_dist() OVER (ORDER BY salary) AS cume_dist
 FROM employees;

 +--------+-----------+
 | salary | cume_dist |
 +--------+-----------+
 | 30000  | 0.33      |
 | 50000  | 0.67      |
 | 70000  | 1.00      |
 +--------+-----------+
 ```

 ### `dense_rank`

 Returns the rank of the current row without gaps. This function ranks rows in a dense manner, meaning consecutive ranks are assigned even for identical values.

 ```sql
 dense_rank()
 ```

 #### Example

 ```sql
 -- Example usage of the dense_rank window function:
 SELECT department,
     salary,
     dense_rank() OVER (PARTITION BY department ORDER BY salary DESC) AS dense_rank
 FROM employees;

 +-------------+--------+------------+
 | department  | salary | dense_rank |
 +-------------+--------+------------+
 | Sales       | 70000  | 1          |
 | Sales       | 50000  | 2          |
 | Sales       | 50000  | 2          |
 | Sales       | 30000  | 3          |
 | Engineering | 90000  | 1          |
 | Engineering | 80000  | 2          |
 +-------------+--------+------------+
 ```

 ### `ntile`

 Integer ranging from 1 to the argument value, dividing the partition as equally as possible

 ```sql
 ntile(expression)
 ```

 #### Arguments

 - **expression**: An integer describing the number groups the partition should be split into

 #### Example

 ```sql
 -- Example usage of the ntile window function:
 SELECT employee_id,
     salary,
     ntile(4) OVER (ORDER BY salary DESC) AS quartile
 FROM employees;

 +-------------+--------+----------+
 | employee_id | salary | quartile |
 +-------------+--------+----------+
 | 1           | 90000  | 1        |
 | 2           | 85000  | 1        |
 | 3           | 80000  | 2        |
 | 4           | 70000  | 2        |
 | 5           | 60000  | 3        |
 | 6           | 50000  | 3        |
 | 7           | 40000  | 4        |
 | 8           | 30000  | 4        |
 +-------------+--------+----------+
 ```

 ### `percent_rank`

 Returns the percentage rank of the current row within its partition. The value ranges from 0 to 1 and is computed as `(rank - 1) / (total_rows - 1)`.

 ```sql
 percent_rank()
 ```

 #### Example

 ```sql
     -- Example usage of the percent_rank window function:
 SELECT employee_id,
     salary,
     percent_rank() OVER (ORDER BY salary) AS percent_rank
 FROM employees;

 +-------------+--------+---------------+
 | employee_id | salary | percent_rank  |
 +-------------+--------+---------------+
 | 1           | 30000  | 0.00          |
 | 2           | 50000  | 0.50          |
 | 3           | 70000  | 1.00          |
 +-------------+--------+---------------+
 ```

 ### `rank`

 Returns the rank of the current row within its partition, allowing gaps between ranks. This function provides a ranking similar to `row_number`, but skips ranks for identical values.

 ```sql
 rank()
 ```

 #### Example

 ```sql
 -- Example usage of the rank window function:
 SELECT department,
     salary,
     rank() OVER (PARTITION BY department ORDER BY salary DESC) AS rank
 FROM employees;

 +-------------+--------+------+
 | department  | salary | rank |
 +-------------+--------+------+
 | Sales       | 70000  | 1    |
 | Sales       | 50000  | 2    |
 | Sales       | 50000  | 2    |
 | Sales       | 30000  | 4    |
 | Engineering | 90000  | 1    |
 | Engineering | 80000  | 2    |
 +-------------+--------+------+
 ```

 ### `row_number`

 Number of the current row within its partition, counting from 1.

 ```sql
 row_number()
 ```

 #### Example

 ```sql
 -- Example usage of the row_number window function:
 SELECT department,
   salary,
   row_number() OVER (PARTITION BY department ORDER BY salary DESC) AS row_num
 FROM employees;

 +-------------+--------+---------+
 | department  | salary | row_num |
 +-------------+--------+---------+
 | Sales       | 70000  | 1       |
 | Sales       | 50000  | 2       |
 | Sales       | 50000  | 3       |
 | Sales       | 30000  | 4       |
 | Engineering | 90000  | 1       |
 | Engineering | 80000  | 2       |
 +-------------+--------+---------+
 ```

 ## Analytical Functions

 - [first_value](#first_value)
 - [lag](#lag)
 - [last_value](#last_value)
 - [lead](#lead)
 - [nth_value](#nth_value)

 ### `first_value`

 Returns value evaluated at the row that is the first row of the window frame.

 ```sql
 first_value(expression)
 ```

 #### Arguments

 - **expression**: Expression to operate on

 #### Example

 ```sql
 -- Example usage of the first_value window function:
 SELECT department,
   employee_id,
   salary,
   first_value(salary) OVER (PARTITION BY department ORDER BY salary DESC) AS top_salary
 FROM employees;

 +-------------+-------------+--------+------------+
 | department  | employee_id | salary | top_salary |
 +-------------+-------------+--------+------------+
 | Sales       | 1           | 70000  | 70000      |
 | Sales       | 2           | 50000  | 70000      |
 | Sales       | 3           | 30000  | 70000      |
 | Engineering | 4           | 90000  | 90000      |
 | Engineering | 5           | 80000  | 90000      |
 +-------------+-------------+--------+------------+
 ```

 ### `lag`

 Returns value evaluated at the row that is offset rows before the current row within the partition; if there is no such row, instead return default (which must be of the same type as value).

 ```sql
 lag(expression, offset, default)
 ```

 #### Arguments

 - **expression**: Expression to operate on
 - **offset**: Integer. Specifies how many rows back the value of expression should be retrieved. Defaults to 1.
 - **default**: The default value if the offset is not within the partition. Must be of the same type as expression.

 #### Example

 ```sql
 -- Example usage of the lag window function:
 SELECT employee_id,
     salary,
     lag(salary, 1, 0) OVER (ORDER BY employee_id) AS prev_salary
 FROM employees;

 +-------------+--------+-------------+
 | employee_id | salary | prev_salary |
 +-------------+--------+-------------+
 | 1           | 30000  | 0           |
 | 2           | 50000  | 30000       |
 | 3           | 70000  | 50000       |
 | 4           | 60000  | 70000       |
 +-------------+--------+-------------+
 ```

 ### `last_value`

 Returns value evaluated at the row that is the last row of the window frame.

 ```sql
 last_value(expression)
 ```

 #### Arguments

 - **expression**: Expression to operate on

 #### Example

 ```sql
 -- SQL example of last_value:
 SELECT department,
        employee_id,
        salary,
        last_value(salary) OVER (PARTITION BY department ORDER BY salary) AS running_last_salary
 FROM employees;

 +-------------+-------------+--------+---------------------+
 | department  | employee_id | salary | running_last_salary |
 +-------------+-------------+--------+---------------------+
 | Sales       | 1           | 30000  | 30000               |
 | Sales       | 2           | 50000  | 50000               |
 | Sales       | 3           | 70000  | 70000               |
 | Engineering | 4           | 40000  | 40000               |
 | Engineering | 5           | 60000  | 60000               |
 +-------------+-------------+--------+---------------------+
 ```

 ### `lead`

 Returns value evaluated at the row that is offset rows after the current row within the partition; if there is no such row, instead return default (which must be of the same type as value).

 ```sql
 lead(expression, offset, default)
 ```

 #### Arguments

 - **expression**: Expression to operate on
 - **offset**: Integer. Specifies how many rows forward the value of expression should be retrieved. Defaults to 1.
 - **default**: The default value if the offset is not within the partition. Must be of the same type as expression.

 #### Example

 ```sql
 -- Example usage of lead window function:
 SELECT
     employee_id,
     department,
     salary,
     lead(salary, 1, 0) OVER (PARTITION BY department ORDER BY salary) AS next_salary
 FROM employees;

 +-------------+-------------+--------+--------------+
 | employee_id | department  | salary | next_salary  |
 +-------------+-------------+--------+--------------+
 | 1           | Sales       | 30000  | 50000        |
 | 2           | Sales       | 50000  | 70000        |
 | 3           | Sales       | 70000  | 0            |
 | 4           | Engineering | 40000  | 60000        |
 | 5           | Engineering | 60000  | 0            |
 +-------------+-------------+--------+--------------+
 ```

 ### `nth_value`

 Returns the value evaluated at the nth row of the window frame (counting from 1). Returns NULL if no such row exists.

 ```sql
 nth_value(expression, n)
 ```

 #### Arguments

 - **expression**: The column from which to retrieve the nth value.
 - **n**: Integer. Specifies the row number (starting from 1) in the window frame.

 #### Example

 ```sql
 -- Sample employees table:
 CREATE TABLE employees (id INT, salary INT);
 INSERT INTO employees (id, salary) VALUES
 (1, 30000),
 (2, 40000),
 (3, 50000),
 (4, 60000),
 (5, 70000);

 -- Example usage of nth_value:
 SELECT nth_value(salary, 2) OVER (
   ORDER BY salary
   ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
 ) AS nth_value
 FROM employees;

 +-----------+
 | nth_value |
 +-----------+
 | 40000     |
 | 40000     |
 | 40000     |
 | 40000     |
 | 40000     |
 +-----------+
 ```
	<!---
	Licensed to the Apache Software Foundation (ASF) under one
	or more contributor license agreements. See the NOTICE file
	distributed with this work for additional information
	regarding copyright ownership. The ASF licenses this file
	to you under the Apache License, Version 2.0 (the
	"License"); you may not use this file except in compliance
	with the License. You may obtain a copy of the License at

	http://www.apache.org/licenses/LICENSE-2.0

	Unless required by applicable law or agreed to in writing,
	software distributed under the License is distributed on an
	"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
	KIND, either express or implied. See the License for the
	specific language governing permissions and limitations
	under the License.
	-->

	<!---
	This file was generated by the dev/update_function_docs.sh script.
	Do not edit it manually as changes will be overwritten.
	Instead, edit the WindowUDFImpl's documentation() function to
	update documentation for an individual UDF or the
	dev/update_function_docs.sh file for updating surrounding text.
	-->

	# Window Functions

	A _window function_ performs a calculation across a set of table rows that are somehow related to the current row.
	This is comparable to the type of calculation that can be done with an aggregate function.
	However, window functions do not cause rows to become grouped into a single output row like non-window aggregate calls would.
	Instead, the rows retain their separate identities. Behind the scenes, the window function is able to access more than just the current row of the query result

	Here is an example that shows how to compare each employee's salary with the average salary in his or her department:

	```sql
	SELECT depname, empno, salary, avg(salary) OVER (PARTITION BY depname) FROM empsalary;

	+-----------+-------+--------+-------------------+
	\| depname \| empno \| salary \| avg \|
	+-----------+-------+--------+-------------------+
	\| personnel \| 2 \| 3900 \| 3700.0 \|
	\| personnel \| 5 \| 3500 \| 3700.0 \|
	\| develop \| 8 \| 6000 \| 5020.0 \|
	\| develop \| 10 \| 5200 \| 5020.0 \|
	\| develop \| 11 \| 5200 \| 5020.0 \|
	\| develop \| 9 \| 4500 \| 5020.0 \|
	\| develop \| 7 \| 4200 \| 5020.0 \|
	\| sales \| 1 \| 5000 \| 4866.666666666667 \|
	\| sales \| 4 \| 4800 \| 4866.666666666667 \|
	\| sales \| 3 \| 4800 \| 4866.666666666667 \|
	+-----------+-------+--------+-------------------+
	```

	A window function call always contains an OVER clause directly following the window function's name and argument(s). This is what syntactically distinguishes it from a normal function or non-window aggregate. The OVER clause determines exactly how the rows of the query are split up for processing by the window function. The PARTITION BY clause within OVER divides the rows into groups, or partitions, that share the same values of the PARTITION BY expression(s). For each row, the window function is computed across the rows that fall into the same partition as the current row. The previous example showed how to count the average of a column per partition.

	You can also control the order in which rows are processed by window functions using ORDER BY within OVER. (The window ORDER BY does not even have to match the order in which the rows are output.) Here is an example:

	```sql
	SELECT depname, empno, salary,
	rank() OVER (PARTITION BY depname ORDER BY salary DESC)
	FROM empsalary;

	+-----------+-------+--------+--------+
	\| depname \| empno \| salary \| rank \|
	+-----------+-------+--------+--------+
	\| personnel \| 2 \| 3900 \| 1 \|
	\| develop \| 8 \| 6000 \| 1 \|
	\| develop \| 10 \| 5200 \| 2 \|
	\| develop \| 11 \| 5200 \| 2 \|
	\| develop \| 9 \| 4500 \| 4 \|
	\| develop \| 7 \| 4200 \| 5 \|
	\| sales \| 1 \| 5000 \| 1 \|
	\| sales \| 4 \| 4800 \| 2 \|
	\| personnel \| 5 \| 3500 \| 2 \|
	\| sales \| 3 \| 4800 \| 2 \|
	+-----------+-------+--------+--------+
	```

	There is another important concept associated with window functions: for each row, there is a set of rows within its partition called its window frame. Some window functions act only on the rows of the window frame, rather than of the whole partition. Here is an example of using window frames in queries:

	```sql
	SELECT depname, empno, salary,
	avg(salary) OVER(ORDER BY salary ASC ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS avg,
	min(salary) OVER(ORDER BY empno ASC ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS cum_min
	FROM empsalary
	ORDER BY empno ASC;

	+-----------+-------+--------+--------------------+---------+
	\| depname \| empno \| salary \| avg \| cum_min \|
	+-----------+-------+--------+--------------------+---------+
	\| sales \| 1 \| 5000 \| 5000.0 \| 5000 \|
	\| personnel \| 2 \| 3900 \| 3866.6666666666665 \| 3900 \|
	\| sales \| 3 \| 4800 \| 4700.0 \| 3900 \|
	\| sales \| 4 \| 4800 \| 4866.666666666667 \| 3900 \|
	\| personnel \| 5 \| 3500 \| 3700.0 \| 3500 \|
	\| develop \| 7 \| 4200 \| 4200.0 \| 3500 \|
	\| develop \| 8 \| 6000 \| 5600.0 \| 3500 \|
	\| develop \| 9 \| 4500 \| 4500.0 \| 3500 \|
	\| develop \| 10 \| 5200 \| 5133.333333333333 \| 3500 \|
	\| develop \| 11 \| 5200 \| 5466.666666666667 \| 3500 \|
	+-----------+-------+--------+--------------------+---------+
	```

	When a query involves multiple window functions, it is possible to write out each one with a separate OVER clause, but this is duplicative and error-prone if the same windowing behavior is wanted for several functions. Instead, each windowing behavior can be named in a WINDOW clause and then referenced in OVER. For example:

	```sql
	SELECT sum(salary) OVER w, avg(salary) OVER w
	FROM empsalary
	WINDOW w AS (PARTITION BY depname ORDER BY salary DESC);
	```

	## Syntax

	The syntax for the OVER-clause is

	```sql
	function([expr])
	OVER(
	[PARTITION BY expr[, …]]
	[ORDER BY expr [ ASC \| DESC ][, …]]
	[ frame_clause ]
	)
	```

	where frame_clause is one of:

	```sql
	{ RANGE \| ROWS \| GROUPS } frame_start
	{ RANGE \| ROWS \| GROUPS } BETWEEN frame_start AND frame_end
	```

	and frame_start and frame_end can be one of

	```sql
	UNBOUNDED PRECEDING
	offset PRECEDING
	CURRENT ROW
	offset FOLLOWING
	UNBOUNDED FOLLOWING
	```

	where offset is an non-negative integer.

	RANGE and GROUPS modes require an ORDER BY clause (with RANGE the ORDER BY must specify exactly one column).

	## Filter clause for aggregate window functions

	Aggregate window functions support the SQL `FILTER (WHERE ...)` clause to include only rows that satisfy the predicate from the window frame in the aggregation.

	```sql
	sum(salary) FILTER (WHERE salary > 0)
	OVER (PARTITION BY depname ORDER BY salary ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW)
	```

	If no rows in the frame satisfy the filter for a given output row, `COUNT` yields `0` while `SUM`/`AVG`/`MIN`/`MAX` yield `NULL`.

	## Aggregate functions

	All [aggregate functions](aggregate_functions.md) can be used as window functions.

	## Ranking Functions

	- [cume_dist](#cume_dist)
	- [dense_rank](#dense_rank)
	- [ntile](#ntile)
	- [percent_rank](#percent_rank)
	- [rank](#rank)
	- [row_number](#row_number)

	### `cume_dist`

	Relative rank of the current row: (number of rows preceding or peer with the current row) / (total rows).

	```sql
	cume_dist()
	```

	#### Example

	```sql
	-- Example usage of the cume_dist window function:
	SELECT salary,
	cume_dist() OVER (ORDER BY salary) AS cume_dist
	FROM employees;

	+--------+-----------+
	\| salary \| cume_dist \|
	+--------+-----------+
	\| 30000 \| 0.33 \|
	\| 50000 \| 0.67 \|
	\| 70000 \| 1.00 \|
	+--------+-----------+
	```

	### `dense_rank`

	Returns the rank of the current row without gaps. This function ranks rows in a dense manner, meaning consecutive ranks are assigned even for identical values.

	```sql
	dense_rank()
	```

	#### Example

	```sql
	-- Example usage of the dense_rank window function:
	SELECT department,
	salary,
	dense_rank() OVER (PARTITION BY department ORDER BY salary DESC) AS dense_rank
	FROM employees;

	+-------------+--------+------------+
	\| department \| salary \| dense_rank \|
	+-------------+--------+------------+
	\| Sales \| 70000 \| 1 \|
	\| Sales \| 50000 \| 2 \|
	\| Sales \| 50000 \| 2 \|
	\| Sales \| 30000 \| 3 \|
	\| Engineering \| 90000 \| 1 \|
	\| Engineering \| 80000 \| 2 \|
	+-------------+--------+------------+
	```

	### `ntile`

	Integer ranging from 1 to the argument value, dividing the partition as equally as possible

	```sql
	ntile(expression)
	```

	#### Arguments

	- expression: An integer describing the number groups the partition should be split into

	#### Example

	```sql
	-- Example usage of the ntile window function:
	SELECT employee_id,
	salary,
	ntile(4) OVER (ORDER BY salary DESC) AS quartile
	FROM employees;

	+-------------+--------+----------+
	\| employee_id \| salary \| quartile \|
	+-------------+--------+----------+
	\| 1 \| 90000 \| 1 \|
	\| 2 \| 85000 \| 1 \|
	\| 3 \| 80000 \| 2 \|
	\| 4 \| 70000 \| 2 \|
	\| 5 \| 60000 \| 3 \|
	\| 6 \| 50000 \| 3 \|
	\| 7 \| 40000 \| 4 \|
	\| 8 \| 30000 \| 4 \|
	+-------------+--------+----------+
	```

	### `percent_rank`

	Returns the percentage rank of the current row within its partition. The value ranges from 0 to 1 and is computed as `(rank - 1) / (total_rows - 1)`.

	```sql
	percent_rank()
	```

	#### Example

	```sql
	-- Example usage of the percent_rank window function:
	SELECT employee_id,
	salary,
	percent_rank() OVER (ORDER BY salary) AS percent_rank
	FROM employees;

	+-------------+--------+---------------+
	\| employee_id \| salary \| percent_rank \|
	+-------------+--------+---------------+
	\| 1 \| 30000 \| 0.00 \|
	\| 2 \| 50000 \| 0.50 \|
	\| 3 \| 70000 \| 1.00 \|
	+-------------+--------+---------------+
	```

	### `rank`

	Returns the rank of the current row within its partition, allowing gaps between ranks. This function provides a ranking similar to `row_number`, but skips ranks for identical values.

	```sql
	rank()
	```

	#### Example

	```sql
	-- Example usage of the rank window function:
	SELECT department,
	salary,
	rank() OVER (PARTITION BY department ORDER BY salary DESC) AS rank
	FROM employees;

	+-------------+--------+------+
	\| department \| salary \| rank \|
	+-------------+--------+------+
	\| Sales \| 70000 \| 1 \|
	\| Sales \| 50000 \| 2 \|
	\| Sales \| 50000 \| 2 \|
	\| Sales \| 30000 \| 4 \|
	\| Engineering \| 90000 \| 1 \|
	\| Engineering \| 80000 \| 2 \|
	+-------------+--------+------+
	```

	### `row_number`

	Number of the current row within its partition, counting from 1.

	```sql
	row_number()
	```

	#### Example

	```sql
	-- Example usage of the row_number window function:
	SELECT department,
	salary,
	row_number() OVER (PARTITION BY department ORDER BY salary DESC) AS row_num
	FROM employees;

	+-------------+--------+---------+
	\| department \| salary \| row_num \|
	+-------------+--------+---------+
	\| Sales \| 70000 \| 1 \|
	\| Sales \| 50000 \| 2 \|
	\| Sales \| 50000 \| 3 \|
	\| Sales \| 30000 \| 4 \|
	\| Engineering \| 90000 \| 1 \|
	\| Engineering \| 80000 \| 2 \|
	+-------------+--------+---------+
	```

	## Analytical Functions

	- [first_value](#first_value)
	- [lag](#lag)
	- [last_value](#last_value)
	- [lead](#lead)
	- [nth_value](#nth_value)

	### `first_value`

	Returns value evaluated at the row that is the first row of the window frame.

	```sql
	first_value(expression)
	```

	#### Arguments

	- expression: Expression to operate on

	#### Example

	```sql
	-- Example usage of the first_value window function:
	SELECT department,
	employee_id,
	salary,
	first_value(salary) OVER (PARTITION BY department ORDER BY salary DESC) AS top_salary
	FROM employees;

	+-------------+-------------+--------+------------+
	\| department \| employee_id \| salary \| top_salary \|
	+-------------+-------------+--------+------------+
	\| Sales \| 1 \| 70000 \| 70000 \|
	\| Sales \| 2 \| 50000 \| 70000 \|
	\| Sales \| 3 \| 30000 \| 70000 \|
	\| Engineering \| 4 \| 90000 \| 90000 \|
	\| Engineering \| 5 \| 80000 \| 90000 \|
	+-------------+-------------+--------+------------+
	```

	### `lag`

	Returns value evaluated at the row that is offset rows before the current row within the partition; if there is no such row, instead return default (which must be of the same type as value).

	```sql
	lag(expression, offset, default)
	```

	#### Arguments

	- expression: Expression to operate on
	- offset: Integer. Specifies how many rows back the value of expression should be retrieved. Defaults to 1.
	- default: The default value if the offset is not within the partition. Must be of the same type as expression.

	#### Example

	```sql
	-- Example usage of the lag window function:
	SELECT employee_id,
	salary,
	lag(salary, 1, 0) OVER (ORDER BY employee_id) AS prev_salary
	FROM employees;

	+-------------+--------+-------------+
	\| employee_id \| salary \| prev_salary \|
	+-------------+--------+-------------+
	\| 1 \| 30000 \| 0 \|
	\| 2 \| 50000 \| 30000 \|
	\| 3 \| 70000 \| 50000 \|
	\| 4 \| 60000 \| 70000 \|
	+-------------+--------+-------------+
	```

	### `last_value`

	Returns value evaluated at the row that is the last row of the window frame.

	```sql
	last_value(expression)
	```

	#### Arguments

	- expression: Expression to operate on

	#### Example

	```sql
	-- SQL example of last_value:
	SELECT department,
	employee_id,
	salary,
	last_value(salary) OVER (PARTITION BY department ORDER BY salary) AS running_last_salary
	FROM employees;

	+-------------+-------------+--------+---------------------+
	\| department \| employee_id \| salary \| running_last_salary \|
	+-------------+-------------+--------+---------------------+
	\| Sales \| 1 \| 30000 \| 30000 \|
	\| Sales \| 2 \| 50000 \| 50000 \|
	\| Sales \| 3 \| 70000 \| 70000 \|
	\| Engineering \| 4 \| 40000 \| 40000 \|
	\| Engineering \| 5 \| 60000 \| 60000 \|
	+-------------+-------------+--------+---------------------+
	```

	### `lead`

	Returns value evaluated at the row that is offset rows after the current row within the partition; if there is no such row, instead return default (which must be of the same type as value).

	```sql
	lead(expression, offset, default)
	```

	#### Arguments

	- expression: Expression to operate on
	- offset: Integer. Specifies how many rows forward the value of expression should be retrieved. Defaults to 1.
	- default: The default value if the offset is not within the partition. Must be of the same type as expression.

	#### Example

	```sql
	-- Example usage of lead window function:
	SELECT
	employee_id,
	department,
	salary,
	lead(salary, 1, 0) OVER (PARTITION BY department ORDER BY salary) AS next_salary
	FROM employees;

	+-------------+-------------+--------+--------------+
	\| employee_id \| department \| salary \| next_salary \|
	+-------------+-------------+--------+--------------+
	\| 1 \| Sales \| 30000 \| 50000 \|
	\| 2 \| Sales \| 50000 \| 70000 \|
	\| 3 \| Sales \| 70000 \| 0 \|
	\| 4 \| Engineering \| 40000 \| 60000 \|
	\| 5 \| Engineering \| 60000 \| 0 \|
	+-------------+-------------+--------+--------------+
	```

	### `nth_value`

	Returns the value evaluated at the nth row of the window frame (counting from 1). Returns NULL if no such row exists.

	```sql
	nth_value(expression, n)
	```

	#### Arguments

	- expression: The column from which to retrieve the nth value.
	- n: Integer. Specifies the row number (starting from 1) in the window frame.

	#### Example

	```sql
	-- Sample employees table:
	CREATE TABLE employees (id INT, salary INT);
	INSERT INTO employees (id, salary) VALUES
	(1, 30000),
	(2, 40000),
	(3, 50000),
	(4, 60000),
	(5, 70000);

	-- Example usage of nth_value:
	SELECT nth_value(salary, 2) OVER (
	ORDER BY salary
	ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
	) AS nth_value
	FROM employees;

	+-----------+
	\| nth_value \|
	+-----------+
	\| 40000 \|
	\| 40000 \|
	\| 40000 \|
	\| 40000 \|
	\| 40000 \|
	+-----------+
	```