Jump to content
Wikipedia The Free Encyclopedia

Talk:Data binning

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia
This article is rated Stub-class on Wikipedia's content assessment scale.
It is of interest to the following WikiProjects:
WikiProject icon Statistics High‐importance
WikiProject icon This article is within the scope of WikiProject Statistics , a collaborative effort to improve the coverage of statistics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.StatisticsWikipedia:WikiProject StatisticsTemplate:WikiProject StatisticsStatistics
High This article has been rated as High-importance on the importance scale.
WikiProject icon This article is within the scope of WikiProject Mathematics , a collaborative effort to improve the coverage of mathematics on Wikipedia. If you would like to participate, please visit the project page, where you can join the discussion and see a list of open tasks.MathematicsWikipedia:WikiProject MathematicsTemplate:WikiProject Mathematicsmathematics
Low This article has been rated as Low-priority on the project's priority scale.

I'm no expert on data binning, although an academic statistician. It seems to me that at least the comparison with a histogram is wrong. They form completely different notions. Data binning seems to me to be a kind of smoothing method. Nijdam (talk) 21:56, 22 July 2010 (UTC) [reply ]

I am currently studying data binning in the context of PET image reconstruction and it seems that it is extensively used (SSRB Single Slice Rebinning, MSRB, Fourier Rebinning (FORE), etc). Seems to be too much in-depth for such an article, at least without first explaining some key concepts. Will add anyway unless told otherwise. --Garuh knight (talk) 23:09, 28 June 2011 (UTC) [reply ]
So, normally ``binning" is just making a histogram, but there are more advanced methods which involve some type of data smoothing. One simple way (which I've seen called ``Lever smoothing" is to split a data point between two adjacent bins based on how far it is from the midpoint of each bin. Right now I'm trying to find information on how to implement ``Gaussian binning" (my term), which is where you bin data into bins weighted by a Gaussian. It is definitely a technique used, but all the information on it seems to be buried in technical publications. If that could be described in the article it would be a very valuable contribution. Danski14 (talk) 18:43, 28 October 2014 (UTC) [reply ]
LightGBM by Microsoft and its scikit-learn similar Histogram-based Gradient Boosting algorithm are applications of binning too. --Dnavarro (talk) 15:31, 18 December 2019 (UTC) [reply ]

Without a supporting reference, I am not so sure about the comments regarding binning as being useful for GC/MS or NMR spectra. Barcodeplane (talk) 16:03, 7 May 2020 (UTC) [reply ]

AltStyle によって変換されたページ (->オリジナル) /