The VARIANCE function calculates the statistical variance of the specified expression. It measures how far a set of numbers are spread out from their arithmetic mean.
VARIANCE(<expr>)
| Parameter | Description |
|---|---|
<expr> | The column or expression to calculate variance for, supports type Double. |
Returns a Double value representing the calculated variance. If there is no valid data in the group, returns NULL.
-- Create sample table CREATE TABLE student_scores ( student_id INT, score DECIMAL(4,1) ) DISTRIBUTED BY HASH(student_id) PROPERTIES ( "replication_num" = "1" ); -- Insert test data INSERT INTO student_scores VALUES (1, 85.5), (2, 92.0), (3, 78.5), (4, 88.0), (5, 95.5), (6, 82.0), (7, 90.0), (8, 87.5); -- Calculate variance of student scores SELECT VARIANCE(score) as score_variance FROM student_scores;
+-------------------+ | score_variance | +-------------------+ | 25.73437499999998 | +-------------------+