Name	Name	Last commit message	Last commit date
Latest commit History 11 Commits
bcht_bigdata_obsever	bcht_bigdata_obsever
conf	conf
README.md	README.md
pom.xml	pom.xml

Name

Last commit message

Last commit date

Latest commit

History

EsIndexer

使用hbase协处理器将hbase数据写入elaticsearch中字段以及更新规则可配置

该工具主要分为2部分: 1.hbase协处理器,需要将项目打包成jar后加载到需要建索的hbase表上。加载: disable 'mytable' alter 'mytable', METHOD => 'table_att','coprocessor'=>'hdfs://bd1.baht|com.bcht.obsever.DataObserver||1001' enable 'mytable'

卸载: disable 'mytable' alter 'mytable',METHOD=>'table_att_unset',NAME=>'coprocessor1ドル' enable 'mytable 注意事项:每次协处理器更新后重新加载需要jar包位置不同或者jar包名称做修改。因为协处理器会有缓存,需要保证读取的class路径不同,以免无法生效。

2.es建索程序,项目打包后需要将conf文件夹也放在与jar包平级的文件夹下。直接执行java -jar com.bcht.es.EsIndexer 即可,可以根据需要指定输出日志路径。停止程序的时候请使用kill -15命令触发程序钩子,确保正在建索的消息正常消费完成。

3.reindex程序暂时没有写,思路就是将指定的hbase表数据全部scan出来后发送到kafka中即可。

4.该工具需要kafka当作消息处理中间件,目的是为了批量建立es索引,提高消费吞吐量。在测试机器上面进行简单的表建索实验可以达到 10000条/s以上的速度,测试机器配置较差,理论上生产环境大部分情景下的数据处理都能保证较低的数据延迟。

About

使用hbase协处理器将hbase数据写入elaticsearch中字段以及更新规则可配置

Resources

Stars

Watchers

Forks

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

world1256/EsIndexer

Folders and files

Latest commit

History

Repository files navigation

EsIndexer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EsIndexer

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages