Tracking down where disk space has gone on Linux?

Question 1

When administering Linux systems I often find myself struggling to track down the culprit after a partition goes full. I normally use du / | sort -nr but on a large filesystem this takes a long time before any results are returned.

Also, this is usually successful in highlighting the worst offender but I've often found myself resorting to du without the sort in more subtle cases and then had to trawl through the output.

I'd prefer a command line solution which relies on standard Linux commands since I have to administer quite a few systems and installing new software is a hassle (especially when out of disk space!)

Question 2

Try ncdu, an excellent command-line disk usage analyser:

The output of the "ncdu" program. The output is listing files in /usr/portage/distfiles in descending size order, with the file size and name. The user is in the middle of deleting the 239.1 MiB "firefox-57.0.source.tar.xz" file, as it is superceded by the 239.4 MiB "firefox-57.0.1.source.tar.xz" later version.

Question 3

Typically, I hate being asked to install something to solve a simple issue, but this is just great.

Question 4

sudo apt install ncdu on ubuntu gets it easily. It's great

Question 5

You quite probably know which filesystem is short of space. In which case you can use ncdu -x to only count files and directories on the same filesystem as the directory being scanned.

Question 6

best answer. also: sudo ncdu -rx / should give a clean read on biggest dirs/files ONLY on root area drive. (-r = read-only, -x = stay on same filesystem (meaning: do not traverse other filesystem mounts) )

Question 7

I have so little space that I can't install ncdu

Question 8

Don't go straight to du /. Use df to find the partition that's hurting you, and then try du commands.

One I like to try is

# U.S.
du -h <dir> | grep '[0-9\.]\+G'
# Others
du -h <dir> | grep '[0-9,円]\+G'

because it prints sizes in "human readable form". Unless you've got really small partitions, grepping for directories in the gigabytes is a pretty good filter for what you want. This will take you some time, but unless you have quotas set up, I think that's just the way it's going to be.

As @jchavannes points out in the comments, the expression can get more precise if you're finding too many false positives. I incorporated the suggestion, which does make it better, but there are still false positives, so there are just tradeoffs (simpler expr, worse results; more complex and longer expr, better results). If you have too many little directories showing up in your output, adjust your regex accordingly. For example,

grep '^\s*[0-9\.]\+G'

is even more accurate (no < 1GB directories will be listed).

If you do have quotas, you can use

quota -v

to find users that are hogging the disk.

Question 9

This is very quick, simple and practical

Question 10

grep '[0-9]G' contained a lot of false positives and also omitted any decimals. This worked better for me: sudo du -h / | grep -P '^[0-9\.]+G'

Question 11

@jchavannes -P is unnecessary for this expression because there's nothing specific to Perl there. Also, -P isn't portable to systems that don't have the GNU implementation.

Question 12

In case you have really big directories, you'll want [GT] instead of just G

Question 13

I like to use du -h | sort -hr | head

Question 14

For a first look, use the "summary" view of du:

du -s /*

The effect is to print the size of each of its arguments, i.e. every root folder in the case above.

Furthermore, both GNU du and BSD du can be depth-restricted (but POSIX du cannot!):

GNU (Linux, ...):
```
du --max-depth 3
```
BSD (macOS, ...):
```
du -d 3
```

This will limit the output display to depth 3. The calculated and displayed size is still the total of the full depth, of course. But despite this, restricting the display depth drastically speeds up the calculation.

Another helpful option is -h (words on both GNU and BSD but, once again, not on POSIX-only du) for "human-readable" output (i.e. using KiB, MiB etc.).

Question 15

if du complains about -d try --max-depth 5 in stead.

Question 16

Great anwser. Seems correct for me. I suggest du -hcd 1 /directory. -h for human readable, c for total and d for depth.

Question 17

I'm use du -hd 1 <folder to inspect> | sort -hr | head

Question 18

du --max-depth 5 -h /* 2>&1 | grep '[0-9\.]\+G' | sort -hr | head to filter Permission denied

Question 19

You can also run the following command using du:

~# du -Pshx /* 2>/dev/null

The -s option summarizes and displays total for each argument.
-h prints Mio, Gio, etc.
-x = stay in one filesystem (very useful).
-P = don't follow symlinks (which could cause files to be counted twice for instance).

Be careful with -x, which will not show the /root directory if it is on a different filesystem. In that case, you have to run du -Pshx /root 2>/dev/null to show it (once, I struggled a lot not pointing out that my /root directory had gone full).

Question 20

du -Pshx .* * 2>/dev/null + hidden/system directories

Question 21

/root/shows without issues. Why would it not be shown ?

Question 22

Finding the biggest files on the filesystem is always going to take a long time. By definition you have to traverse the whole filesystem looking for big files. The only solution is probably to run a cron job on all your systems to have the file ready ahead of time.

One other thing, the x option of du is useful to keep du from following mount points into other filesystems. I.e:

du -x [path]

The full command I usually run is:

sudo du -xm / | sort -rn > usage.txt

The -m means return results in megabytes, and sort -rn will sort the results largest number first. You can then open usage.txt in an editor, and the biggest folders (starting with /) will be at the top.

Question 23

Thanks for pointing out the -x flag!

Question 24

"finding biggest takes long time.." -> Well it depends, but tend to disagree: doesn't take that long with utilities like ncdu - at least quicker than du or find (depending on depth and arguments)..

Question 25

since I prefer not to be root, I had to adapt where the file is written : sudo du -xm / | sort -rn > ~/usage.txt

Question 26

I use this for the top 25 worst offenders below the current directory

# -S to not include subdir size, sorted and limited to top 25
du -S . | sort -nr | head -25

Question 27

This command did the trick to find a hidden folder that seemed to be increasing in size over time. Thanks!

Question 28

Is this in bytes?

Question 29

By default, on my system, 'du -S' gives a nice human readable output. You get a plain number of bytes for small files, then a number with a 'KB' or 'MB' suffix for bigger files.

Question 30

@Siddhartha If you add -h, it will likely change the effect of the sort -nr command - meaning the sort will no longer work, and then the head command will also no longer work

Question 31

On Ubuntu, I need to use -h to du for human readable numbers, as well as sort -h for human-numeric sort. The list is sorted in reverse, so either use tail or change order.

Question 32

I always use du -sm * | sort -n, which gives you a sorted list of how much the subdirectories of the current working directory use up, in mebibytes.

You can also try Konqueror, which has a "size view" mode, which is similar to what WinDirStat does on Windows: it gives you a viual representation of which files/directories use up most of your space.

Update: on more recent versions, you can also use du -sh * | sort -h which will show human-readable filesizes and sort by those. (numbers will be suffixed with K, M, G, ...)

For people looking for an alternative to KDE3's Konqueror file size view may take a look at filelight, though it's not quite as nice.

Question 33

That's only Konqueror 3.x though - the file size view still hasn't been ported to KDE4.

Question 34

'du -sh * | sort -h ' works perfectly on my Linux (Centos distro) box. Thanks!

Question 35

At a previous company we used to have a cron job that was run overnight and identified any files over a certain size, e.g.

find / -size +10000k

You may want to be more selective about the directories that you are searching, and watch out for any remotely mounted drives which might go offline.

Question 36

You can use the -x option of find to make sure you don't find files on other devices than the start point of your find command. This fixes the remotely mounted drives issue.

Question 37

I use

du -ch --max-depth=2 .

and I change the max-depth to suit my needs. The "c" option prints totals for the folders and the "h" option prints the sizes in K, M, or G as appropriate. As others have said, it still scans all the directories, but it limits the output in a way that I find easier to find the large directories.

Question 38

One option would be to run your du/sort command as a cron job, and output to a file, so it's already there when you need it.

Question 39

For the commandline I think the du/sort method is the best. If you're not on a server you should take a look at Baobab - Disk usage analyzer. This program also takes some time to run, but you can easily find the sub directory deep, deep down where all the old Linux ISOs are.

Question 40

It can also scan remote folders via SSH, FTP, SMB and WebDAV.

Question 41

This is great. Some things just work better with a GUI to visualize them, and this is one of them! I need an X-server on my server anyways for CrashPlan, so it works on that too.

Question 42

I'm going to second xdiskusage. But I'm going to add in the note that it is actually a du frontend and can read the du output from a file. So you can run du -ax /home > ~/home-du on your server, scp the file back, and then analyze it graphically. Or pipe it through ssh.

Question 43

Maybe worth to note that mc (Midnight Commander, a classic text-mode file manager) by default show only the size of the directory inodes (usually 4096) but with CtrlSpace or with menu Tools you can see the space occupied by the selected directory in a human readable format (e.g., some like 103151M).

For instance, the picture below show the full size of the vanilla TeX Live distributions of 2018 and 2017, while the versions of 2015 and 2016 show only the size of the inode (but they have really near of 5 Gb each).

That is, CtrlSpace must be done one for one, only for the actual directory level, but it is so fast and handy when you are navigating with mc that maybe you will not need ncdu (that indeed, only for this purpose is better). Otherwise, you can also run ncdu from mc. without exit from mc or launch another terminal.

mwe

Question 44

Related question: serverfault.com/questions/246840/…

Question 45

Try feeding the output of du into a simple awk script that checks to see if the size of the directory is larger than some threshold, if so it prints it. You don't have to wait for the entire tree to be traversed before you start getting info (vs. many of the other answers).

For example, the following displays any directories that consume more than about 500 MB.

du -kx / | awk '{ if (1ドル > 500000) { print 0ドル} }'

To make the above a little more reusable, you can define a function in your .bashrc, ( or you could make it into a standalone script).

dubig() {
 [ -z "1ドル" ] && echo "usage: dubig sizethreshMB [dir]" && return
 du -kx 2ドル | awk '{ if (1ドル > '1ドル'*1024) { print 0ドル} }'
}

So dubig 200 ~/ looks under the home directory (without following symlinks off device) for directories that use more than 200 MB.

Question 46

It's a pity that a dozen of grep hacks are more upvoted. Oh and du -k will make it absolutely certain that du is using KB units

Question 47

Good idea about the -k. Edited.

Question 48

Even simpler and more robust: du -kx 2ドル | awk '1ドル>'$((1ドル*1024)) (if you specify only a condition aka pattern to awk the default action is print 0ドル)

Question 49

Good point @date_thompson_085. That's true for all versions of awk I know of (net/free-BSD & GNU). @mark-borgerding so this means that you can greatly simplify your first example to just du -kx / | awk '1ドル > 500000'

Question 50

@mark-borgerding: If you have just a few kBytes left somewhere you can also keep the whole output of du like this du -kx / | tee /tmp/du.log | awk '1ドル > 500000'. This is very helpful because if your first filtering turns out to be fruitless you can try other values like this awk '1ドル > 200000' /tmp/du.log or inspect the complete output like this sort -nr /tmp/du.log|less without re-scanning the whole filesystem

Question 51

From the terminal, you can get a visual representation of disk usage with dutree

It is very fast and light because it is implemented in Rust

dutree

$ dutree -h
Usage: dutree [options] <path> [<path>..]
Options:
 -d, --depth [DEPTH] show directories up to depth N (def 1)
 -a, --aggr [N[KMG]] aggregate smaller than N B/KiB/MiB/GiB (def 1M)
 -s, --summary equivalent to -da, or -d1 -a1M
 -u, --usage report real disk usage instead of file size
 -b, --bytes print sizes in bytes
 -f, --files-only skip directories for a fast local overview
 -x, --exclude NAME exclude matching files or directories
 -H, --no-hidden exclude hidden files
 -A, --ascii ASCII characters only, no colors
 -h, --help show help
 -v, --version print version number

See all the usage details in the website

Question 52

You should mention that you’re the author of the tool.

Question 53

seems your site is down? github.com/nachoparker/dutree

Question 54

I prefer to use the following to get an overview and drill down from there...

cd /folder_to_check
du -shx */

This will display results with human readable output such as GB, MB. It will also prevent traversing through remote filesystems. The -s option only shows summary of each folder found so you can drill down further if interested in more details of a folder. Keep in mind that this solution will only show folders so you will want to omit the / after the asterisk if you want files too.

Question 55

Not mentioned here but you should also check lsof in case of deleted/hanging files. I had a 5.9GB deleted tmp file from a run away cronjob.

https://serverfault.com/questions/207100/how-can-i-find-phantom-storage-usage Helped me out in find the process owner of said file ( cron ) and then I was able to goto /proc/{cron id}/fd/{file handle #} less the file in question to get the start of the run away, resolve that, and then echo ""> file to clear up space and let cron gracefully close itself up.

Question 56

I like the good old xdiskusage as a graphical alternative to du(1).

Question 57

Note this part of the question: "I'd prefer a command line solution which relies on standard Linux commands since..."

Question 58

You can use standard tools like find and sort to analyze your disk space usage.

List directories sorted by their size:

find / -mount -type d -exec du -s "{}" \; | sort -n

List files sorted by their size:

find / -mount -printf "%k\t%p\n" | sort -n

Question 59

I find this to be the best answer, to detect the large sized in sorted order

Question 60

I have used this command to find files bigger than 100Mb:

find / -size +100M -exec ls -l {} \;

Question 61

Still here? Or perhaps this answer has been upvoted...

While there are various graphical tools described in other answers, they don't do much to address the underlying issue of identifying how you may be able to free up space.

I am currently researching the same issue and came across agedu - which reports on access times as well as size. I've not had a chance to play with it yet - it's written by Simon Tatham (you may have heard of PuTTy) so is probably sensible/reliable.

However, like all the tools listed here, it collects data on demand. Even the most efficint coding on the fastests hardware will take time to walk a milt-terrabyte filesystem.

Question 62

If you can't use a GUI (like you're on a remote server), ncdu -e works nicely. Once the display opens up, use m then M to display and sort by mtime, while the (admittedly small) percentage graph is still there to get you an idea of the size.

Question 63

"If you can't use a GUI (like you're on a remote server)," - why does a remote server prevent you from using a gui?

Question 64

ncdu -e is wrong becasue it requires an argument

Anthony CartmellAnthony Cartmell · Accepted Answer · 2011-04-13 10:36:02Z

1118

Try ncdu, an excellent command-line disk usage analyser:

The output of the "ncdu" program. The output is listing files in /usr/portage/distfiles in descending size order, with the file size and name. The user is in the middle of deleting the 239.1 MiB "firefox-57.0.source.tar.xz" file, as it is superceded by the 239.4 MiB "firefox-57.0.1.source.tar.xz" later version.

Share

Improve this answer

edited May 12, 2023 at 14:43

Mithical's user avatar

Mithical

1071 silver badge6 bronze badges

answered Apr 13, 2011 at 10:36

Anthony CartmellAnthony Cartmell

20

77

Typically, I hate being asked to install something to solve a simple issue, but this is just great.

jds
– jds

2016年07月05日 18:58:27 +00:00
Commented Jul 5, 2016 at 18:58
57

sudo apt install ncdu on ubuntu gets it easily. It's great

Orion Edwards
– Orion Edwards

2017年07月19日 22:30:02 +00:00
Commented Jul 19, 2017 at 22:30
21

You quite probably know which filesystem is short of space. In which case you can use ncdu -x to only count files and directories on the same filesystem as the directory being scanned.

Luke Cousins
– Luke Cousins

2017年07月21日 11:51:36 +00:00
Commented Jul 21, 2017 at 11:51
38

best answer. also: sudo ncdu -rx / should give a clean read on biggest dirs/files ONLY on root area drive. (-r = read-only, -x = stay on same filesystem (meaning: do not traverse other filesystem mounts) )

B. Shea
– B. Shea

2017年09月21日 15:52:01 +00:00
Commented Sep 21, 2017 at 15:52
57

I have so little space that I can't install ncdu

Chris
– Chris

2018年06月14日 16:57:49 +00:00
Commented Jun 14, 2018 at 16:57

| Show 15 more comments

Stack Exchange Network

Tracking down where disk space has gone on Linux?

40 Answers 40

You must log in to answer this question.

Linked

Hot Network Questions