redframes

General Purpose Data Manipulation Library

[画像:maxhumber logo]

Source Code Changelog

Suggest Changes

Popularity

2.8

Stable

Activity

1.4

Stars 325

Watchers 5

Forks 5

Last Commit almost 3 years ago

Description

redframes (rectangular data frames) is a data manipulation library for ML and visualization. It is fully interoperable with pandas, compatible with scikit-learn, and works great with matplotlib!

redframes prioritizes syntax over flexibility and scope. And minimizes the number-of-googles-per-lines-of-codeTM so that you can focus on the work that matters most.

"What is redframes?" would be the answer to the Jeopardy! clue "A pythonic dplyr".

Programming language: Python

License: BSD 2-clause "Simplified" License

Tags: Machine Learning Visualization Pandas Data

redframes alternatives and similar packages

Based on the "Machine Learning" category.
Alternatively, view redframes alternatives based on common mentions on social networks and blogs.

scikit-learn

10.0 9.9 L3 redframes VS scikit-learn

scikit-learn: machine learning in Python

scikit-learn logo
tensorflow

10.0 10.0 L1 redframes VS tensorflow

An Open Source Machine Learning Framework for Everyone

tensorflow logo

InfluxDB – Built for High-Performance Time Series Workloads

InfluxDB 3 OSS is now GA. Transform, enrich, and act on time series data directly in the database. Automate critical tasks and eliminate the need to move data externally. Download now.

Promo www.influxdata.com

[画像:InfluxDB Logo]

Keras

9.9 9.8 L2 redframes VS Keras

Deep Learning for humans

keras-team logo
gym

9.8 0.0 redframes VS gym

A toolkit for developing and comparing reinforcement learning algorithms.

openai logo
xgboost

9.7 9.6 L1 redframes VS xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

dmlc logo
MindsDB

9.7 9.9 redframes VS MindsDB

Federated Query Engine for AI - The only MCP Server you'll ever need

mindsdb logo
dspy

9.6 9.8 redframes VS dspy

DSPy: The framework for programming—not prompting—language models

stanfordnlp logo
MLflow

9.6 10.0 redframes VS MLflow

The open source developer platform to build AI agents and models with confidence. Enhance your AI applications with end-to-end tracking, observability, and evaluations, all in one integrated platform.

mlflow logo
PaddlePaddle

9.6 10.0 L1 redframes VS PaddlePaddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

PaddlePaddle logo
Prophet

9.5 6.2 redframes VS Prophet

Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.

facebook logo
CNTK

9.5 0.0 L1 redframes VS CNTK

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

microsoft logo
gensim

9.4 7.9 L3 redframes VS gensim

Topic Modelling for Humans

piskvorky logo
TFLearn

9.0 0.0 L3 redframes VS TFLearn

Deep learning library featuring a higher-level API for TensorFlow.

tflearn logo
NuPIC

8.8 0.0 L3 redframes VS NuPIC

DISCONTINUED. Numenta Platform for Intelligent Computing is an implementation of Hierarchical Temporal Memory (HTM), a theory of intelligence based strictly on the neuroscience of the neocortex.

numenta logo
H2O

8.7 8.7 redframes VS H2O

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.

h2oai logo
Pyro.ai

8.6 7.0 redframes VS Pyro.ai

Deep universal probabilistic programming with Python and PyTorch

pyro-ppl logo
Surprise

8.4 0.0 L4 redframes VS Surprise

A Python scikit for building and analyzing recommender systems

NicolasHug logo
srez

8.3 0.0 L5 redframes VS srez

DISCONTINUED. Image super-resolution through deep learning
LightFM

7.9 0.0 L4 redframes VS LightFM

A Python implementation of LightFM, a hybrid recommendation algorithm.

lyst logo
Atomic Agents

7.9 9.5 redframes VS Atomic Agents

Building AI agents, atomically

BrainBlend-AI logo
Pylearn2

7.7 0.0 L2 redframes VS Pylearn2

Warning: This project does not have any current developer. See bellow.

lisa-lab logo
skflow

7.6 1.3 L4 redframes VS skflow

DISCONTINUED. Simplified interface for TensorFlow (mimicking Scikit Learn) for Deep Learning

tensorflow logo
Sacred

7.5 3.1 redframes VS Sacred

Sacred is a tool to help you configure, organize, log and reproduce experiments developed at IDSIA.

IDSIA logo
PyBrain

7.4 0.0 L4 redframes VS PyBrain

Another Python Machine Learning Library.

pybrain logo
Clairvoyant

7.0 2.1 L3 redframes VS Clairvoyant

Software designed to identify and monitor social/historical cues for short term stock movement

anfederico logo
garak, LLM vulnerability scanner

6.3 9.8 redframes VS garak, LLM vulnerability scanner

DISCONTINUED. the LLM vulnerability scanner [Moved to: https://github.com/NVIDIA/garak]

leondz logo
karateclub

6.2 6.7 redframes VS karateclub

Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs (CIKM 2020)

benedekrozemberczki logo
Metrics

6.1 0.0 redframes VS Metrics

Machine learning evaluation metrics, implemented in Python, R, Haskell, and MATLAB / Octave

benhamner logo
python-recsys

6.0 0.0 L4 redframes VS python-recsys

A python library for implementing a recommender system

ocelma logo
awesome-embedding-models

5.9 0.0 redframes VS awesome-embedding-models

A curated list of awesome embedding models tutorials, projects and communities.

Hironsan logo
pydeep

5.8 0.0 L3 redframes VS pydeep

Deep learning in Python

andersbll logo
Crab

5.5 0.0 L2 redframes VS Crab

Crab is a flexible, fast recommender engine for Python that integrates classic information filtering recommendation algorithms in the world of scientific Python packages (numpy, scipy, matplotlib).

muricoca logo
hebel

5.0 0.0 L2 redframes VS hebel

GPU-Accelerated Deep Learning Library in Python

hannes-brt logo
seqeval

4.8 2.6 redframes VS seqeval

A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)

chakki-works logo
Xorbits

4.7 2.7 redframes VS Xorbits

Scalable Python DS & ML, in an API compatible & lightning fast way.

xorbitsai logo
adaptive

4.6 4.8 redframes VS adaptive

:chart_with_upwards_trend: Adaptive: parallel active learning of mathematical functions

python-adaptive logo
TrueSkill, the video game rating system

4.3 1.4 redframes VS TrueSkill, the video game rating system

An implementation of the TrueSkill rating system for Python

sublee logo
pdpipe

3.9 7.0 redframes VS pdpipe

Easy pipelines for pandas DataFrames.

pdpipe logo
SciKit-Learn Laboratory

3.9 4.3 redframes VS SciKit-Learn Laboratory

SciKit-Learn Laboratory (SKLL) makes it easy to run machine learning experiments.

EducationalTestingService logo
Robocorp Action Server

3.9 7.5 redframes VS Robocorp Action Server

Create 🐍 Python AI Actions and 🤖 Automations, and deploy & operate them anywhere

robocorp logo
rwa

3.8 0.0 L5 redframes VS rwa

Machine Learning on Sequential Data Using a Recurrent Weighted Average

jostmey logo
nptyping

3.6 0.0 redframes VS nptyping

💡 Type hints for Numpy and Pandas

ramonhagenaars logo
Feature Forge

3.5 0.0 L4 redframes VS Feature Forge

A set of tools for creating and testing machine learning features, with a scikit-learn compatible API

machinalis logo
Data Flow Facilitator for Machine Learning (dffml)

3.3 9.3 redframes VS Data Flow Facilitator for Machine Learning (dffml)

DISCONTINUED. The easiest way to use Machine Learning. Mix and match underlying ML libraries and data set sources. Generate new datasets or modify existing ones with ease.

intel logo
brew

3.2 0.0 L4 redframes VS brew

DISCONTINUED. Multiple Classifier Systems and Ensemble Learning Library in Python.
bodywork

3.1 0.0 redframes VS bodywork

DISCONTINUED. ML pipeline orchestration and model deployments on Kubernetes.

bodywork-ml logo
openskill.py

2.9 7.3 redframes VS openskill.py

Multiplayer Rating System. No Friction.

vivekjoshy logo
OptaPy

2.9 5.5 redframes VS OptaPy

OptaPy is an AI constraint solver for Python to optimize planning and scheduling problems.

optapy logo
MLP Classifier

2.8 0.0 L4 redframes VS MLP Classifier

A handwritten multilayer perceptron classifer using numpy.

meetvora logo
vowpal_porpoise

2.6 0.0 L3 redframes VS vowpal_porpoise

lightweight python wrapper for vowpal wabbit

josephreisinger logo

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of redframes or a related project?

Add another 'Machine Learning' Package

Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.

featured getstream.io

Popular Comparisons

SaaSHub - Software Alternatives and Reviews

featured www.saashub.com

README

About

redframes (rectangular data frames) is a general purpose data manipulation library that prioritizes syntax, simplicity, and speed (to a solution). Importantly, the library is fully interoperable with pandas, compatible with scikit-learn, and works great with matplotlib.

Install & Import

pip install redframes

import redframes as rf

Quickstart

Copy-and-paste this to get started:

import redframes as rf
df = rf.DataFrame({
 'bear': ['Brown bear', 'Polar bear', 'Asian black bear', 'American black bear', 'Sun bear', 'Sloth bear', 'Spectacled bear', 'Giant panda'],
 'genus': ['Ursus', 'Ursus', 'Ursus', 'Ursus', 'Helarctos', 'Melursus', 'Tremarctos', 'Ailuropoda'],
 'weight (male, lbs)': ['300-860', '880-1320', '220-440', '125-500', '60-150', '175-310', '220-340', '190-275'],
 'weight (female, lbs)': ['205-455', '330-550', '110-275', '90-300', '45-90', '120-210', '140-180', '155-220']
})
# | bear | genus | weight (male, lbs) | weight (female, lbs) |
# |:--------------------|:-----------|:---------------------|:-----------------------|
# | Brown bear | Ursus | 300-860 | 205-455 |
# | Polar bear | Ursus | 880-1320 | 330-550 |
# | Asian black bear | Ursus | 220-440 | 110-275 |
# | American black bear | Ursus | 125-500 | 90-300 |
# | Sun bear | Helarctos | 60-150 | 45-90 |
# | Sloth bear | Melursus | 175-310 | 120-210 |
# | Spectacled bear | Tremarctos | 220-340 | 140-180 |
# | Giant panda | Ailuropoda | 190-275 | 155-220 |
(
 df
 .rename({"weight (male, lbs)": "male", "weight (female, lbs)": "female"})
 .gather(["male", "female"], into=("sex", "weight"))
 .split("weight", into=["min", "max"], sep="-")
 .gather(["min", "max"], into=("stat", "weight"))
 .mutate({"weight": lambda row: float(row["weight"])})
 .group(["genus", "sex"])
 .rollup({"weight": ("weight", rf.stat.mean)})
 .spread("sex", using="weight")
 .mutate({"dimorphism": lambda row: round(row["male"] / row["female"], 2)})
 .drop(["male", "female"])
 .sort("dimorphism", descending=True)
)
# | genus | dimorphism |
# |:-----------|-------------:|
# | Ursus | 2.01 |
# | Tremarctos | 1.75 |
# | Helarctos | 1.56 |
# | Melursus | 1.47 |
# | Ailuropoda | 1.24 |

For comparison, here's the equivalent pandas:

import pandas as pd
# df = pd.DataFrame({...})
df = df.rename(columns={"weight (male, lbs)": "male", "weight (female, lbs)": "female"})
df = pd.melt(df, id_vars=['bear', 'genus'], value_vars=['male', 'female'], var_name='sex', value_name='weight')
df[["min", "max"]] = df["weight"].str.split("-", expand=True)
df = df.drop("weight", axis=1)
df = pd.melt(df, id_vars=['bear', 'genus', 'sex'], value_vars=['min', 'max'], var_name='stat', value_name='weight')
df['weight'] = df["weight"].astype('float')
df = df.groupby(["genus", "sex"])["weight"].mean()
df = df.reset_index()
df = pd.pivot_table(df, index=['genus'], columns=['sex'], values='weight')
df = df.reset_index()
df = df.rename_axis(None, axis=1)
df["dimorphism"] = round(df["male"] / df["female"], 2)
df = df.drop(["female", "male"], axis=1)
df = df.sort_values("dimorphism", ascending=False)
df = df.reset_index(drop=True)
# 🤮

IO

Save, load, and convert rf.DataFrame objects:

# save .csv
rf.save(df, "bears.csv")
# load .csv
df = rf.load("bears.csv")
# convert redframes → pandas
pandas_df = rf.unwrap(df)
# convert pandas → redframes
df = rf.wrap(pandas_df)

Verbs

Verbs are pure and "chain-able" methods that manipulate rf.DataFrame objects. Here is the complete list (see docstrings for examples and more details):

Verb	Description
`accumulate`‡	Run a cumulative sum over a column
`append`	Append rows from another DataFrame
`combine`	Combine multiple columns into a single column (opposite of `split`)
`cross`	Cross join columns from another DataFrame
`dedupe`	Remove duplicate rows
`denix`	Remove rows with missing values
`drop`	Drop entire columns (opposite of `select`)
`fill`	Fill missing values "down", "up", or with a constant
`filter`	Keep rows matching specific conditions
`gather`‡	Gather columns into rows (opposite of `spread`)
`group`	Prepare groups for compatible verbs‡
`join`	Join columns from another DataFrame
`mutate`	Create a new, or overwrite an existing column
`pack`‡	Collate and concatenate row values for a target column (opposite of `unpack`)
`rank`‡	Rank order values in a column
`rename`	Rename column keys
`replace`	Replace matching values within columns
`rollup`‡	Apply summary functions and/or statistics to target columns
`sample`	Randomly sample any number of rows
`select`	Select specific columns (opposite of `drop`)
`shuffle`	Shuffle the order of all rows
`sort`	Sort rows by specific columns
`split`	Split a single column into multiple columns (opposite of `combine`)
`spread`	Spread rows into columns (opposite of `gather`)
`take`‡	Take any number of rows (from the top/bottom)
`unpack`	"Explode" concatenated row values into multiple rows (opposite of `pack`)

Properties

In addition to all of the verbs there are several properties attached to each DataFrame object:

df["genus"] 
# ['Ursus', 'Ursus', 'Ursus', 'Ursus', 'Helarctos', 'Melursus', 'Tremarctos', 'Ailuropoda']
df.columns 
# ['bear', 'genus', 'weight (male, lbs)', 'weight (female, lbs)']
df.dimensions
# {'rows': 8, 'columns': 4}
df.empty
# False
df.memory
# '2 KB'
df.types
# {'bear': object, 'genus': object, 'weight (male, lbs)': object, 'weight (female, lbs)': object}

matplotlib

rf.DataFrame objects integrate seamlessly with matplotlib:

import redframes as rf
import matplotlib.pyplot as plt
football = rf.DataFrame({
 'position': ['TE', 'K', 'RB', 'WR', 'QB'],
 'avp': [116.98, 131.15, 180, 222.22, 272.91]
})
df = (
 football
 .mutate({"color": lambda row: row["position"] in ["WR", "RB"]})
 .replace({"color": {False: "orange", True: "red"}})
)
plt.barh(df["position"], df["avp"], color=df["color"]);

scikit-learn

rf.DataFrame objects are fully compatible with sklearn functions, estimators, and transformers:

import redframes as rf
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
df = rf.DataFrame({
 "touchdowns": [15, 19, 5, 7, 9, 10, 12, 22, 16, 10],
 "age": [21, 22, 21, 24, 26, 28, 30, 35, 28, 21],
 "mvp": [1, 1, 0, 0, 0, 0, 0, 1, 0, 0]
})
target = "touchdowns"
y = df[target]
X = df.drop(target)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=1)
model = LinearRegression()
model.fit(X_train, y_train)
model.score(X_test, y_test)
# 0.5083194901655527
print(X_train.take(1))
# rf.DataFrame({'age': [21], 'mvp': [0]})
X_new = rf.DataFrame({'age': [22], 'mvp': [1]})
model.predict(X_new)
# array([19.])

Do not miss the trending, packages, news and articles with our weekly report.

Awesome Python is part of the LibHunt network. Terms. Privacy Policy.

(CC)

BY-SA

We recommend Spin The Wheel Of Names for a cryptographically secure random name picker.