data
今日热榜项目TopList的Python实现,异步爬取微博热榜,知乎,V2EX,GIthub,通过Flask展示。
Select, put and delete data from JSON, TOML, YAML, XML, INI, HCL and CSV files with a single tool. Also available as a go mod.
LakeSoul is an end-to-end, realtime and cloud native Lakehouse framework with fast data ingestion, concurrent update and incremental data analytics on cloud storages for both BI and AI applications.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Make stream processing easier! Easy-to-use streaming application development framework and operation platform.
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
lakeFS - Data version control for your data lake | Git for data
The flexible backend for all your projects 🐰 Turn your DB into a headless CMS, admin panels, or apps with a custom UI, instant APIs, auth & more.
🔥🔥 AllData可定义数据中台,以数据平台为底座,以数据中台为桥梁,以机器学习平台为工厂,以大模型应用为上游产品,提供全链路数字化解决方案。产品正式演示体验、社群咨询、商务采购:https://docs.qq.com/doc/DVHlkSEtvVXVCdEFo
A curated list of awesome MySQL software, libraries, tools and resources
Interactive Redis: A Terminal Client for Redis with AutoCompletion and Syntax Highlighting.
Python cluster client for the official redis cluster. Redis 3.0+.
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
Federated Query Engine for AI - The only MCP Server you'll ever need
Docker compose and Google Colab demo to build a CDC with Delta Lake
Broadcast, Presence, and Postgres Changes via WebSockets
Apache Doris is an easy-to-use, high performance and unified analytics database.
Automatically generate a RESTful API service for your legacy database. No code required!
Compare tables within or across databases
An open source multi-tool for exploring and publishing data
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊
pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (...
Commandline tool for running SQL queries against JSON, CSV, Excel, Parquet, and more.
CLI tool that can execute SQL queries on CSV, LTSV, JSON, YAML and TBLN. Can output to various formats.
World's most advanced database DevSecOps solution for Developer, Security, DBA and Platform Engineering teams. The GitHub/GitLab for database DevSecOps.