PostgreSQL dump distinct column without memory consumption

Asked 5 years, 8 months ago

Viewed 170 times

I'm trying to dump csv of distinct varchar(43) column to file, table has around billion rows, select distinct command either runs out of memory and command is aborted or server just closes all connections. Does PostgreSQL have built in methods for such task?

psql (10.9 (Ubuntu 10.9-0ubuntu0.18.10.1)) 16GB RAM

Improve this question

asked Jan 4, 2020 at 21:14

Phalelashvili's user avatar

Phalelashvili Phalelashvili

133 bronze badges

Please show command you ran, and the full error message it generated.

jjanes
– jjanes

2020年01月04日 21:21:37 +00:00
Commented Jan 4, 2020 at 21:21
It's just ERROR: out of memory. I ran copy (select distinct x from y) to 'path'; but it's select distinct command that's causing problem

Phalelashvili
– Phalelashvili

2020年01月04日 21:23:54 +00:00
Commented Jan 4, 2020 at 21:23
Try doing a set enable_hashagg=off first. Inappropriate hash aggregates is what usually causes this in my experience. The planner should know better than to use them with very large result sizes, but if the estimates are way off it might not know there is a very large result size.

jjanes
– jjanes

2020年01月04日 21:30:57 +00:00
Commented Jan 4, 2020 at 21:30
Can you show us an EXPLAIN select distinct x from y?

jjanes
– jjanes

2020年01月04日 21:31:44 +00:00
Commented Jan 4, 2020 at 21:31

Add a comment |

2 Answers 2

Sorted by: Reset to default

Solution suggested by jjanes worked, run set enable_hashagg=off in psql terminal

Improve this answer

answered Jan 5, 2020 at 8:20

Phalelashvili's user avatar

Phalelashvili Phalelashvili

133 bronze badges

Add a comment |

You can dump it without distinct and than use the Linux "uniq" command. Just make sure that you defined the swap file that is good idea in any case if your system doesn't use it constantly.

Improve this answer

answered Jan 5, 2020 at 9:20

Yuri Lansberg's user avatar

Yuri Lansberg Yuri Lansberg

464 bronze badges

Add a comment |

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

lang-sql

Stack Exchange Network

PostgreSQL dump distinct column without memory consumption

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

PostgreSQL dump distinct column without memory consumption

2 Answers 2

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related

Hot Network Questions