matplotlib

matplotlib-devel Mailing List for matplotlib

Brought to you by: cjgohlke, dsdale, efiring, heeres, and 8 others

matplotlib-devel — matplotlib developers

You can subscribe to this list here.

2003	_Jan	_Feb	_Mar	_Apr	_May	_Jun	_Jul	_Aug	_Sep	_Oct (1)	_Nov (33)	_Dec (20)
2004	_Jan (7)	_Feb (44)	_Mar (51)	_Apr (43)	_May (43)	_Jun (36)	_Jul (61)	_Aug (44)	_Sep (25)	_Oct (82)	_Nov (97)	_Dec (47)
2005	_Jan (77)	_Feb (143)	_Mar (42)	_Apr (31)	_May (93)	_Jun (93)	_Jul (35)	_Aug (78)	_Sep (56)	_Oct (44)	_Nov (72)	_Dec (75)
2006	_Jan (116)	_Feb (99)	_Mar (181)	_Apr (171)	_May (112)	_Jun (86)	_Jul (91)	_Aug (111)	_Sep (77)	_Oct (72)	_Nov (57)	_Dec (51)
2007	_Jan (64)	_Feb (116)	_Mar (70)	_Apr (74)	_May (53)	_Jun (40)	_Jul (519)	_Aug (151)	_Sep (132)	_Oct (74)	_Nov (282)	_Dec (190)
2008	_Jan (141)	_Feb (67)	_Mar (69)	_Apr (96)	_May (227)	_Jun (404)	_Jul (399)	_Aug (96)	_Sep (120)	_Oct (205)	_Nov (126)	_Dec (261)
2009	_Jan (136)	_Feb (136)	_Mar (119)	_Apr (124)	_May (155)	_Jun (98)	_Jul (136)	_Aug (292)	_Sep (174)	_Oct (126)	_Nov (126)	_Dec (79)
2010	_Jan (109)	_Feb (83)	_Mar (139)	_Apr (91)	_May (79)	_Jun (164)	_Jul (184)	_Aug (146)	_Sep (163)	_Oct (128)	_Nov (70)	_Dec (73)
2011	_Jan (235)	_Feb (165)	_Mar (147)	_Apr (86)	_May (74)	_Jun (118)	_Jul (65)	_Aug (75)	_Sep (162)	_Oct (94)	_Nov (48)	_Dec (44)
2012	_Jan (49)	_Feb (40)	_Mar (88)	_Apr (35)	_May (52)	_Jun (69)	_Jul (90)	_Aug (123)	_Sep (112)	_Oct (120)	_Nov (105)	_Dec (116)
2013	_Jan (76)	_Feb (26)	_Mar (78)	_Apr (43)	_May (61)	_Jun (53)	_Jul (147)	_Aug (85)	_Sep (83)	_Oct (122)	_Nov (18)	_Dec (27)
2014	_Jan (58)	_Feb (25)	_Mar (49)	_Apr (17)	_May (29)	_Jun (39)	_Jul (53)	_Aug (52)	_Sep (35)	_Oct (47)	_Nov (110)	_Dec (27)
2015	_Jan (50)	_Feb (93)	_Mar (96)	_Apr (30)	_May (55)	_Jun (83)	_Jul (44)	_Aug (8)	_Sep (5)	_Oct	_Nov (1)	_Dec (1)
2016	_Jan	_Feb	_Mar (1)	_Apr	_May	_Jun (2)	_Jul	_Aug (3)	_Sep (1)	_Oct (3)	_Nov	_Dec
2017	_Jan	_Feb (5)	_Mar	_Apr	_May	_Jun	_Jul (3)	_Aug	_Sep (7)	_Oct	_Nov	_Dec
2018	_Jan	_Feb	_Mar	_Apr	_May	_Jun	_Jul (2)	_Aug	_Sep	_Oct	_Nov	_Dec

S	M	T	W	T	F	S
					1	2 (1)
3 (2)	4 (7)	5	6	7	8	9
10	11	12 (1)	13 (5)	14 (2)	15 (3)	16
17	18 (1)	19 (1)	20 (1)	21	22	23
24	25 (1)	26	27 (1)	28 (4)	29	30 (1)

Flat | Threaded

Re: [matplotlib-devel] Patch suggestion

From: John H. <jdh...@ac...> - 2005年04月14日 21:00:29

>>>>> "Nicholas" == Nicholas Young <su...@su...> writes:
 Nicholas> I've attempted to implement this code myself (see
 Nicholas> attached patch to src/_image.cpp) but I'm not a regular
 Nicholas> c++ or even c programmer so it's fairly likely there
 Nicholas> will be memory leaks in the code. For a 1024x2048 array
 Nicholas> using the GTKAgg backend and with plenty of memory free
 Nicholas> this change results in show() taking <0.7s rather than
 Nicholas> >4.6s; if there is a memory shortage and swapping
 Nicholas> becomes involved the change is much more noticeable. I
 Nicholas> haven't made any decent Python wrapping code yet - but
 Nicholas> would be happy do do so if someone familiar with c++
 Nicholas> could tidy up my attachment.
Hi Nicholas, 
Thanks for the suggestions and patch. I incorporated frombuffer and
have been testing it. I've been testing the performance of frombuffer
vs fromarray, and have seen some 2-3x speedups but nothing like the
numbers you are reporting. [Also, I don't see any detectable memory
leaks so I don't think you have any worries there]
Here is the test script I am using - does this look like a fair test?
You can uncomment report_memory on unix like systems to get a memory
report on each pass through the loop, and switch out fromarray vs
frombuffer to compare your function with mine
On a related note, below I'm pasting in a representative section the
code I am currently using in fromarray for MxNx3 and MxNx4 arrays --
any obvious performance gains to be had here numerix gurus?
Another suggestion for Nicholas -- perhaps you want to support MxN,
MxNx3 and MxNx4 arrays in your frombuffer function?
And a final question -- how are you getting your function into the
matplotlib image pipeline. Did you alter the image.py
AxesImage.set_data function to test whether A is a buffer object? If
so, you might want to post these changes to the codebase as well.
// some fromarray code
 //PyArrayObject *A = (PyArrayObject *) PyArray_ContiguousFromObject(x.ptr(), PyArray_DOUBLE, 2, 3); 
 PyArrayObject *A = (PyArrayObject *) PyArray_FromObject(x.ptr(), PyArray_DOUBLE, 2, 3); 
 int rgba = A->dimensions[2]==4; 
 double r,g,b,alpha;
 int offset =0;
 
 for (size_t rownum=0; rownum<imo->rowsIn; rownum++) {
 for (size_t colnum=0; colnum<imo->colsIn; colnum++) {
	offset = rownum*A->strides[0] + colnum*A->strides[1];
	r = *(double *)(A->data + offset);
	g = *(double *)(A->data + offset + A->strides[2] );
	b = *(double *)(A->data + offset + 2*A->strides[2] );
 	
	if (rgba) 
	 alpha = *(double *)(A->data + offset + 3*A->strides[2] );
	else
	 alpha = 1.0;
	
	*buffer++ = int(255*r); // red
	*buffer++ = int(255*g); // green
	*buffer++ = int(255*b); // blue
	*buffer++ = int(255*alpha); // alpha
	
 }
 }
## ... and here is the profile script ....
import sys, os, time, gc
from matplotlib._image import fromarray, fromarray2, frombuffer
from matplotlib.numerix.mlab import rand
from matplotlib.numerix import UInt8
def report_memory(i):
 pid = os.getpid()
 a2 = os.popen('ps -p %d -o rss,sz' % pid).readlines()
 print i, ' ', a2[1],
 return int(a2[1].split()[1])
N = 1024
#X2 = rand(N,N)
#X3 = rand(N,N,3)
X4 = rand(N,N,4)
start = time.time()
b4 = (X4*255).astype(UInt8).tostring()
for i in range(50):
 im = fromarray(X4, 0) 
 #im = frombuffer(b4, N, N, 0)
 #val = report_memory(i)
end = time.time()
print 'elapsed: %1.3f'%(end-start)

[matplotlib-devel] Patch suggestion

From: Nicholas Y. <su...@su...> - 2005年04月14日 15:39:41

Attachments: patch

Hi,
I'm a fairly heavy user of matplotlib (to plot results from plasma
physics simulations) and my use requires the display of fairly large
images.
Having done some testing I've discovered (after bypassing anything slow
from the python code) that for large images where there image size
approaches the available memory the main performance bar seems to be the
conversion of the raw data to the _image.Image class. The way in which
the conversion takes place - with data being taken non-sequentially from
many points in a floating point source array and then converted to an 1
byte integer - is slow and if swapping becomes involved even slower.
To overcome this problem I suggest implementing c++ code to allow the
creation of the image from a buffer (with each rgba pixel as 4 bytes)
rather than a floating point array. Where image data is being generated
elsewhere (in my case in Fortran code) it's trivial to output to a
different format and doing so means that the size of the input data can
be significantly smaller and that the data in the source array is
accessed sequentially (it's likely that a compiler will also be able to
optimise a copy of this data more effectively). The image can then be
scaled and over plotted as with any existing image.
I've attempted to implement this code myself (see attached patch to
src/_image.cpp) but I'm not a regular c++ or even c programmer so it's
fairly likely there will be memory leaks in the code. For a 1024x2048
array using the GTKAgg backend and with plenty of memory free this
change results in show() taking <0.7s rather than >4.6s; if there is a
memory shortage and swapping becomes involved the change is much more
noticeable. I haven't made any decent Python wrapping code yet - but
would be happy do do so if someone familiar with c++ could tidy up my
attachment.
Hope this is useful to others,
Nicholas Young

Flat | Threaded

Thanks for helping keep SourceForge clean.