matplotlib

matplotlib-devel Mailing List for matplotlib

Brought to you by: cjgohlke, dsdale, efiring, heeres, and 8 others

matplotlib-devel — matplotlib developers

You can subscribe to this list here.

2003	_Jan	_Feb	_Mar	_Apr	_May	_Jun	_Jul	_Aug	_Sep	_Oct (1)	_Nov (33)	_Dec (20)
2004	_Jan (7)	_Feb (44)	_Mar (51)	_Apr (43)	_May (43)	_Jun (36)	_Jul (61)	_Aug (44)	_Sep (25)	_Oct (82)	_Nov (97)	_Dec (47)
2005	_Jan (77)	_Feb (143)	_Mar (42)	_Apr (31)	_May (93)	_Jun (93)	_Jul (35)	_Aug (78)	_Sep (56)	_Oct (44)	_Nov (72)	_Dec (75)
2006	_Jan (116)	_Feb (99)	_Mar (181)	_Apr (171)	_May (112)	_Jun (86)	_Jul (91)	_Aug (111)	_Sep (77)	_Oct (72)	_Nov (57)	_Dec (51)
2007	_Jan (64)	_Feb (116)	_Mar (70)	_Apr (74)	_May (53)	_Jun (40)	_Jul (519)	_Aug (151)	_Sep (132)	_Oct (74)	_Nov (282)	_Dec (190)
2008	_Jan (141)	_Feb (67)	_Mar (69)	_Apr (96)	_May (227)	_Jun (404)	_Jul (399)	_Aug (96)	_Sep (120)	_Oct (205)	_Nov (126)	_Dec (261)
2009	_Jan (136)	_Feb (136)	_Mar (119)	_Apr (124)	_May (155)	_Jun (98)	_Jul (136)	_Aug (292)	_Sep (174)	_Oct (126)	_Nov (126)	_Dec (79)
2010	_Jan (109)	_Feb (83)	_Mar (139)	_Apr (91)	_May (79)	_Jun (164)	_Jul (184)	_Aug (146)	_Sep (163)	_Oct (128)	_Nov (70)	_Dec (73)
2011	_Jan (235)	_Feb (165)	_Mar (147)	_Apr (86)	_May (74)	_Jun (118)	_Jul (65)	_Aug (75)	_Sep (162)	_Oct (94)	_Nov (48)	_Dec (44)
2012	_Jan (49)	_Feb (40)	_Mar (88)	_Apr (35)	_May (52)	_Jun (69)	_Jul (90)	_Aug (123)	_Sep (112)	_Oct (120)	_Nov (105)	_Dec (116)
2013	_Jan (76)	_Feb (26)	_Mar (78)	_Apr (43)	_May (61)	_Jun (53)	_Jul (147)	_Aug (85)	_Sep (83)	_Oct (122)	_Nov (18)	_Dec (27)
2014	_Jan (58)	_Feb (25)	_Mar (49)	_Apr (17)	_May (29)	_Jun (39)	_Jul (53)	_Aug (52)	_Sep (35)	_Oct (47)	_Nov (110)	_Dec (27)
2015	_Jan (50)	_Feb (93)	_Mar (96)	_Apr (30)	_May (55)	_Jun (83)	_Jul (44)	_Aug (8)	_Sep (5)	_Oct	_Nov (1)	_Dec (1)
2016	_Jan	_Feb	_Mar (1)	_Apr	_May	_Jun (2)	_Jul	_Aug (3)	_Sep (1)	_Oct (3)	_Nov	_Dec
2017	_Jan	_Feb (5)	_Mar	_Apr	_May	_Jun	_Jul (3)	_Aug	_Sep (7)	_Oct	_Nov	_Dec
2018	_Jan	_Feb	_Mar	_Apr	_May	_Jun	_Jul (2)	_Aug	_Sep	_Oct	_Nov	_Dec

S	M	T	W	T	F	S
		1 (2)	2 (5)	3	4	5 (1)
6	7	8	9	10 (2)	11 (3)	12
13 (1)	14	15 (3)	16 (6)	17 (4)	18 (4)	19 (5)
20 (2)	21 (9)	22 (3)	23 (1)	24 (1)	25 (2)	26
27	28 (10)	29 (6)	30 (5)	31 (4)

Flat | Threaded

Re: [matplotlib-devel] boxplot notch

From: Fernando P. <fpe...@gm...> - 2009年12月15日 22:26:27

On Tue, Dec 15, 2009 at 9:57 AM, Andrew Straw <str...@as...> wrote:
>
>  notch_max = med + 1.57*iq/np.sqrt(row)
>  notch_min = med - 1.57*iq/np.sqrt(row)
>
> Is this code actually calculating a meaningful value? If so, what?
>
>From the statistics ignoramus in the room, so take this with a grain
of salt... I'd write that code as
notch_max = med + (iq/2) * (pi/np.sqrt(row))
and it makes more sense. The notch limits are an estimate of the
interval of the median, which is (one-half, for each up/down) the
q3-q1 range times a normalization factor which is pi/sqrt(n), where
n==row=len(d). The 1/sqrt(n) makes some sense, as it's the usual
statistical error normalization factor. The multiplication by pi, I'm
not so sure, and I can't find that exact formula in any quick stats
reference, but I'm sure someone who actually knows stats can point out
where it comes from.
Note that the code below does:
 if notch_max > q3:
 notch_max = q3
 if notch_min < q1:
 notch_min = q1
though matlab explicitly states in:
http://www.mathworks.com/access/helpdesk/help/toolbox/stats/boxplot.html
that
"""
Interval endpoints are the extremes of the notches or the centers of
the triangular markers. When the sample size is small, notches may
extend beyond the end of the box.
"""
So it seems to me that the more principled thing to do would be to
leave those notch markers outside the box if they land there, because
that's a warning of the robustness of the estimation. Clipping them to
q1/q3 is effectively hiding a problem...
cheers,
f

[matplotlib-devel] boxplot notch

From: Andrew S. <str...@as...> - 2009年12月15日 17:58:06

Hi,
I've been reading about box plots and examining the source code for 
boxplot() lately. While there doesn't seem to be a convention about what 
the notch specifies, I can't find any justification (or text describing) 
what exactly the MPL notch is. The source code is:
 # get median and quartiles
 q1, med, q3 = mlab.prctile(d,[25,50,75])
 iq = q3 - q1
 notch_max = med + 1.57*iq/np.sqrt(row)
 notch_min = med - 1.57*iq/np.sqrt(row)
Is this code actually calculating a meaningful value? If so, what?
The original commit was r1098, which doesn't offer a useful comment 
either (only "aaplied several sf patches" ... looking through the SF bug 
tracker, I couldn't find anything relevant from before the commit date 
of 2005年03月28日).

[matplotlib-devel] should mlab.prctile(x,50) == np.median(x)?

From: Andrew S. <str...@as...> - 2009年12月15日 17:23:08

The following (uncommitted) test currently fails. The reason is that 
mlab.prctile(x,50) doesn't handle even length sequences according to the 
numpy and wikipedia convention for the definition of median. Do we agree 
that it should pass?
Not only would I commit the test, but I also have a fix to make it pass, 
derived from scipy.stats.scoreatpercentile().
This would affect boxplot, if not more.
def test_prctile():
 # test odd lengths
 x=[1,2,3]
 assert mlab.prctile(x,50)==np.median(x)
 # test even lengths
 x=[1,2,3,4]
 assert mlab.prctile(x,50)==np.median(x)
 # derived from email sent by jason-sage to MPL-user on 20090914
 ob1=[1,1,2,2,1,2,4,3,2,2,2,3,4,5,6,7,8,9,7,6,4,5,5]
 p = [75]
 expected = [5.5]
 # test vectorized
 actual = mlab.prctile(ob1,p)
 assert np.allclose( expected, actual )
 # test scalar
 for pi, expectedi in zip(p,expected):
 actuali = mlab.prctile(ob1,pi)
 assert np.allclose( expectedi, actuali )

Flat | Threaded

Thanks for helping keep SourceForge clean.