Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

March 06, 2026: Weekly Status Update in Gluten #11714

GlutenPerfBot started this conversation in General
Discussion options

This weekly update is generated by LLMs. You're welcome to join our Github for in-depth discussions.

Overall Activity Summary

The past 7 days have seen intense activity across the Gluten project with 60+ pull requests and 20+ active issues. The community is actively preparing for the upcoming 1.6.0 release while simultaneously advancing major features like ANSI mode support, Parquet type widening, and dynamic filtering optimizations. The Velox backend continues to dominate development focus with significant performance improvements and bug fixes.

Key Ongoing Projects

Dynamic Filter Pushdown & Performance Optimizations

ANSI Mode Support Expansion

Parquet Type Widening & Schema Evolution

Release Preparation

Priority Items

Critical Bug Fixes Needed:

Performance Critical:

Notable Discussions

#11585: Useful Velox PRs Tracking - @FelixYBW maintains a comprehensive tracker of 100+ Velox PRs submitted by the Gluten community that haven't been merged upstream, including critical fixes for ANSI mode, Parquet reading, and performance optimizations.

#11713: Apache Gluten Graduation Tasks - @weiting-chen coordinates Gluten's transition from Apache Incubator to Top Level Project, involving repository renaming, documentation updates, and process changes.

#8429: Gluten Slack Channel - @zhouyuan announced the new ASF workspace Slack channel for real-time community discussions.

Emerging Trends

  1. ANSI Mode as Default: With Spark 4.0 enabling ANSI by default, the community is rapidly implementing ANSI-compliant functions and type conversions
  2. Dynamic Filtering Revolution: Multiple PRs focus on pushing filters closer to storage for significant performance gains
  3. Release Quality Focus: Extensive test suite fixes and infrastructure improvements ahead of 1.6.0 release
  4. Cross-Backend Compatibility: Increased attention to ensuring features work across Velox, ClickHouse, and other backends

Good First Issues

#11699: S3 IMDS Configuration - Add support for Velox's new S3 IMDS configuration options. Good for contributors familiar with cloud storage configurations.

#11703: Iceberg Configuration Mapping - Map Iceberg writer configurations to Velox equivalents. Requires understanding of both Iceberg and Velox configuration systems.

#11513: Fix input_file_name() for Iceberg - Resolve the issue where input_file_name() returns empty strings on Iceberg tables. Good introduction to Iceberg integration.

#10134: ANSI Mode Support - Contribute to the comprehensive ANSI mode implementation. Multiple sub-tasks available for different type casting functions, suitable for contributors wanting to learn Spark's type system.

#11622: TIMESTAMP_NTZ Type Support - Implement support for Spark's TIMESTAMP_NTZ type in Velox backend. Good for learning type system integration between Spark and native engines.

You must be logged in to vote

Replies: 0 comments

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
1 participant

AltStyle によって変換されたページ (->オリジナル) /