Python, subprocess, pipe and select

Question 1

I have a python program where I continuously read the output of other program launched via subprocess.Popen and connected via subprocess.PIPE

The problem I am facing is that it sometime lost significantly portion of the output from the launched program.

For example, monitor for inotify events via a pipe to inotifywait loses many events.

This is the relevant functions:

 process = subprocess.Popen(["inotifywait", "-q", "-r", "-m", 
 "--format", "%e:::::%w%f", srcroot], stdout=subprocess.PIPE, stderr=subprocess.PIPE)
 polling = select.poll()
 polling.register(process.stdout)
 process.stdout.flush()
 while True:
 process.stdout.flush()
 if polling.poll(max_seconds*1000):
 line = process.stdout.readline()
 if len(line)> 0:
 print line[:-1]

Executing the command inotifywait -q -r -m --format %e:::::%w%f /opt/fileserver/ > /tmp/log1 and moving some file around (to generate inotify events) give a>8000 line file. On the other hand, using my ./pscript.py > /tmp/log2 give a file with about 5000 lines.

Question 2

try getting line from stderr as well, and printing that, check if the lost data is actually there. - print process.stderr.read()

Question 3

Unfortunately the above example was somewhat simplified, as I was already checking for stderr. Thank you anyway.

Question 4

You're ignoring stderr completely in your example. Try to create the process like this:

process = subprocess.Popen(["inotifywait", "-q", "-r", "-m", 
 "--format", "%e:::::%w%f", srcroot], stdout=subprocess.PIPE, stderr=subprocess.STDOUT)

Furthermore, I'd use inotify directly with one of its Python bindings rather than spawning a process with inotifywait.

Question 5

Unfortunately the above example was somewhat simplified, as I was already checking for stderr. Moreover, I can not use pyinotify due to its slow performance (I tried it, and it is OK for some thousands of files, but in my case it can not even create a sufficient number of watches)... Thank you anyway.

Question 6

You don't have to set thousands of watches. You set a watch per folder you want to monitor and the application will call your callback. If you need to watch subfolders as well you just need to specify it when you create the watch.

Question 7

At high level, yes. But at low level, pyinotify need to establish a watch for any different file/directory. This process is time consuming and it has significant scalability problems. Pure C implementation are much faster, mainly because all code is compiled.

noxdafox 15.1k4 gold badges37 silver badges46 bronze badges · Accepted Answer · 2015-08-02 10:13:32Z

1

You're ignoring stderr completely in your example. Try to create the process like this:

process = subprocess.Popen(["inotifywait", "-q", "-r", "-m", 
 "--format", "%e:::::%w%f", srcroot], stdout=subprocess.PIPE, stderr=subprocess.STDOUT)

Furthermore, I'd use inotify directly with one of its Python bindings rather than spawning a process with inotifywait.

Share

Improve this answer

answered Aug 2, 2015 at 10:13

noxdafox's user avatar

noxdafox

15.1k4 gold badges37 silver badges46 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

shodanshok

shodanshok Over a year ago

Unfortunately the above example was somewhat simplified, as I was already checking for stderr. Moreover, I can not use pyinotify due to its slow performance (I tried it, and it is OK for some thousands of files, but in my case it can not even create a sufficient number of watches)... Thank you anyway.

2015年08月02日T10:43:05.137Z+00:00

noxdafox

noxdafox Over a year ago

You don't have to set thousands of watches. You set a watch per folder you want to monitor and the application will call your callback. If you need to watch subfolders as well you just need to specify it when you create the watch.

2015年08月02日T11:26:46.023Z+00:00

shodanshok

shodanshok Over a year ago

At high level, yes. But at low level, pyinotify need to establish a watch for any different file/directory. This process is time consuming and it has significant scalability problems. Pure C implementation are much faster, mainly because all code is compiled.

2015年08月02日T12:12:28.397Z+00:00

CollectivesTM on Stack Overflow

Python, subprocess, pipe and select

1 Answer 1

3 Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

1 Answer 1

3 Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related