Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Gluten Flink Pending tasks #12164

KevinyhZou started this conversation in General
Discussion options

Currently, we have supported nexmark sql taks from q0 to q22` in gluten flink, but some operator still need to be refined. We plan to prioritize support for stateless operators first , then add support for stateful operators.

Stateless opeators

  • Connector support: kafka, pulsar, hdfs
    1. kafka [Flink][Feature]Support kafka connector bigo-sg/velox#2 , hdfs [FLINK]Support filesystem sink bigo-sg/velox#6 connectors are already implementd,need more tests and refine
    2. pulsar connector not implement yet
  • Format support: json, pb, parquet, orc
    1. json format is done. pb, parquet ,orc is still not implemented.
  • state management: pulsar/kafka offset, hive write part counter ...;
  • checkpoint: periodic snapshot and restore
    1. interfaces have been defined
  • failover:exceptions or errors need to be catched by velox, and flink controll the failover process
  • support mutiple parallelism
    currently only support single pallelism
  • basic data type
    1. primitive and complex types already supported: Boolean, Int, Bigint, Double, Varchar, Char, Timestamp, Decimal, Date, Row, Array, Map;
    2. types not fully supported : Timestamp with precision or timezone, e.g timestamp(n), timestamp_ltz
  • experssion support
    FROM_UNIXTIME, DATE_FORMAT, UNIX_TIMESTAMP, HOUR, DAY, YEAR, TO_TIMESTAMP, PROCTIME, unix_timestamp, concat, Json_Value, TO_DATE, TO_TIMESTAMP_LTZ, SUBSTRING, CURRENT_DATE, FROM_BASE64, COALESCE, ifnull, TIMESTAMPADD, LOCALTIMESTAMP, ``

Stateful operators

Feature list

You must be logged in to vote

Replies: 2 comments

Comment options

Great! We have to make it happen. Let's do it from stateless SQL, please make some issues from stateless computing.

You must be logged in to vote
0 replies
Comment options

bigo-sg/velox4j#35 pulsar connector
support mutiple parallelism

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet

AltStyle によって変換されたページ (->オリジナル) /