bcolz

A columnar data container that can be compressed.

[画像:Blosc logo]

bcolz.blosc.org DISCONTINUED. You can find some alternatives below.

Suggest Changes

Popularity

4.7

Stable

Activity

0.0

Stable

Stars 955

Watchers 61

Forks 149

Last Commit about 3 years ago

Description

bcolz provides columnar, chunked data containers that can be compressed either in-memory and on-disk. Column storage allows for efficiently querying tables, as well as for cheap column addition and removal. It is based on NumPy, and uses it as the standard data container to communicate with bcolz objects, but it also comes with support for import/export facilities to/from HDF5/PyTables tables and pandas dataframes.

bcolz objects are compressed by default not only for reducing memory/disk storage, but also to improve I/O speed. The compression process is carried out internally by Blosc, a high-performance, multithreaded meta-compressor that is optimized for binary data (although it works with text data just fine too).

bcolz can also use numexpr internally (it does that by default if it detects numexpr installed) so as to accelerate many vector and query operations (although it can use pure NumPy for doing so too). numexpr can optimize the memory usage and use multithreading for doing the computations, so it is blazing fast. This, in combination with carray/ctable disk-based, compressed containers, can be used for performing out-of-core computations efficiently, but most importantly transparently.

Just to whet your appetite, here it is an example with real data, where bcolz is already fulfilling the promise of accelerating memory I/O by using compression:

Programming language: C

License: BSD 3-clause "New" or "Revised" License

Tags: Science And Data Analysis High Performance Data Analysis

Latest version: v1.2.1

bcolz alternatives and similar packages

Based on the "Science and Data Analysis" category.
Alternatively, view bcolz alternatives based on common mentions on social networks and blogs.

Pandas

9.9 9.9 L2 bcolz VS Pandas

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

pandas-dev logo
NumPy

9.8 10.0 L1 bcolz VS NumPy

The fundamental package for scientific computing with Python.

numpy logo

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

Promo www.influxdata.com

[画像:InfluxDB Logo]

SciPy

9.4 10.0 L2 bcolz VS SciPy

SciPy library main repository

scipy logo
SymPy

9.4 9.9 L2 bcolz VS SymPy

A computer algebra system written in pure Python

sympy logo
NetworkX

9.3 9.6 L3 bcolz VS NetworkX

Network Analysis in Python

networkx logo
Dask

9.2 9.4 L2 bcolz VS Dask

Parallel computing with task scheduling

dask logo
statsmodels

9.2 9.5 L3 bcolz VS statsmodels

Statsmodels: statistical modeling and econometrics in Python

statsmodels logo
Getting Started

9.1 5.9 bcolz VS Getting Started

PyGWalker: Turn your dataframe into an interactive UI for visual analysis

Kanaries logo
PyMC

8.9 9.3 L4 bcolz VS PyMC

Bayesian Modeling and Probabilistic Programming in Python

pymc-devs logo
Numba

8.8 9.8 L3 bcolz VS Numba

NumPy aware dynamic Python compiler using LLVM

numba logo
astropy

8.4 9.9 L2 bcolz VS astropy

Astronomy and astrophysics core library

astropy logo
Biopython

8.3 9.1 L2 bcolz VS Biopython

Official git repository for Biopython (originally converted from CVS)

biopython logo
orange

8.2 9.6 L2 bcolz VS orange

🍊 :bar_chart: :bulb: Orange: Interactive data analysis

biolab logo
RDKit

7.5 9.6 L1 bcolz VS RDKit

The official sources for the RDKit library

rdkit logo
Statsforecast

7.5 7.4 bcolz VS Statsforecast

Lightning ⚡️ fast forecasting with statistical and econometric models.

Nixtla logo
Interactive Parallel Computing with IPython

7.3 8.0 L3 bcolz VS Interactive Parallel Computing with IPython

IPython Parallel: Interactive Parallel Computing in Python

ipython logo
blaze

7.1 0.0 L4 bcolz VS blaze

NumPy and Pandas interface to Big Data

blaze logo
Cubes

5.8 0.0 L3 bcolz VS Cubes

[NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis

DataBrewery logo
#<Sawyer::Resource:0x00007f547e829e00>

5.8 5.5 bcolz VS #<Sawyer::Resource:0x00007f547e829e00>

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

fugue-project logo
Open Mining

5.7 0.0 L3 bcolz VS Open Mining

DISCONTINUED. Business Intelligence (BI) in Python, OLAP

mining logo
NIPY

5.4 6.7 L3 bcolz VS NIPY

Workflows and interfaces for neuroimaging packages

nipy logo
bcbio-nextgen

5.4 6.2 L3 bcolz VS bcbio-nextgen

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

bcbio logo
bccb

4.5 4.4 L4 bcolz VS bccb

Incubator for useful bioinformatics code, primarily in Python and R

chapmanb logo
Neupy

4.4 0.0 L5 bcolz VS Neupy

NeuPy is a Tensorflow based python library for prototyping and building neural networks

itdxer logo
Bubbles

3.7 0.0 L5 bcolz VS Bubbles

[NOT MAINTAINED] Bubbles – Python ETL framework

Stiivi logo
PyDy

3.6 9.0 L3 bcolz VS PyDy

Multibody dynamics tool kit.

pydy logo
harold

2.5 1.8 L2 bcolz VS harold

An open-source systems and controls toolbox for Python3

ilayn logo
signac

2.5 8.4 bcolz VS signac

Manage large and heterogeneous data spaces on the file system.

glotzerlab logo
PatZilla

2.3 1.8 bcolz VS PatZilla

PatZilla is a modular patent information research platform and data integration toolkit with a modern user interface and access to multiple data sources.

ip-tools logo
LynxKite

2.2 7.1 bcolz VS LynxKite

The complete graph data science platform

lynxkite logo
Kotori

2.1 2.0 bcolz VS Kotori

A flexible data historian based on InfluxDB, Grafana, MQTT, and more. Free, open, simple.

daq-tools logo
Terkin

1.8 0.0 bcolz VS Terkin

Datalogger for MicroPython and CPython.

hiveeyes logo
cclib

0.9 bcolz VS cclib

A library for parsing and interpreting the results of computational chemistry packages.
dask-memusage

0.9 0.0 bcolz VS dask-memusage

A low-impact profiler to figure out how much memory each task in Dask is using

itamarst logo
ElasticBatch

0.9 0.0 bcolz VS ElasticBatch

Elasticsearch tool for easily collecting and batch inserting Python data and pandas DataFrames

dkaslovsky logo
Open Babel

- bcolz VS Open Babel

A chemical toolbox designed to speak the many languages of chemical data.

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of bcolz or a related project?

Add another 'Science and Data Analysis' Package

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.

featured getstream.io

Popular Comparisons

SaaSHub - Software Alternatives and Reviews

featured www.saashub.com

Do not miss the trending, packages, news and articles with our weekly report.

Awesome Python is part of the LibHunt network. Terms. Privacy Policy.

(CC)

BY-SA

We recommend Spin The Wheel Of Names for a cryptographically secure random name picker.