Window function ignore nulls not working in Databricks

Asked 2 years, 2 months ago

Viewed 2k times

I am new to Databricks and was required to implement the snowflake code in Databricks.

The snowflake table, code and output look like below:

table:

id	col1	hn
ee1	null	1
ee1	null	2
ee1	test	3
ee1	test	4
ee1	test2	5

Query used:

SELECT ID, FIRST_VALUE(col1) ignore nulls OVER (PARTITION BY ID ORDER BY hn) AS first_value, LAST_VALUE(col1) ignore nulls OVER (PARTITION BY ID ORDER BY hn) AS last_value FROM table

Output:

id	first_value	last_value
ee1	test	test2
ee1	test	test2
ee1	test	test2
ee1	test	test2
ee1	test	test2

When I tried the same query in Databricks using Spark SQL, ignore nulls did not work properly.

Can anyone provide the equivalent query for this in Databricks?

Improve this question

edited Oct 12, 2023 at 15:16

Mike Walton's user avatar

Mike Walton

7,4592 gold badges14 silver badges26 bronze badges

asked Oct 12, 2023 at 15:12

VarYaz's user avatar

VarYaz

1514 silver badges14 bronze badges

To handle null value comparisons, you can refer to stackoverflow.com/questions/70394130/…

Karthikeyan Rasipalay Durairaj
– Karthikeyan Rasipalay Durairaj

2023年10月12日 15:33:35 +00:00
Commented Oct 12, 2023 at 15:33

Add a comment |

1 Answer 1

Sorted by: Reset to default

The key point is the window frame specification:

SELECT ID, 
 FIRST_VALUE(col1) ignore nulls OVER (PARTITION BY ID ORDER BY hn) AS first_value, 
 LAST_VALUE(col1) ignore nulls OVER (PARTITION BY ID ORDER BY hn 
 ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING) AS last_value 
FROM table;

If not defined explicitly the default is RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW

Improve this answer

answered Oct 12, 2023 at 15:20

Lukasz Szozda's user avatar

Lukasz Szozda

182k26 gold badges278 silver badges326 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

Window function ignore nulls not working in Databricks

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Hot Network Questions

CollectivesTM on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Linked

Related