好文档 - 专业文书写作范文服务资料分享网站

Efficient histogram-based range query estimation for dirty data

天下 分享 时间: 加入收藏 我要投稿 点赞

Efficient histogram-based range query estimation

for dirty data

Yan ZHANG;Hongzhi WANG;Long YANG;Jianzhong LI

【期刊名称】《中国高等学校学术文摘·计算机科学》 【年(卷),期】2018(012)005

【摘要】In recent years,data quality issues have attracted wide attentions.Data quality problems are mainly caused by dirty data.Currently,many methods for dirty data management have been proposed,and one of them is entity-based relational database in which one tuple represents an entity.The traditional query optimizations are not suitable for the new entity-based model.Then new query optimizations need to be developed.In this paper,we propose a new query selectivity estimation strategy based on histogram,and focus on solving the overestimation which traditional methods lead to.We prove our approaches are unbiased.The experimental results on both real and synthetic data sets show that our approaches can give good estimates with low error.

【总页数】16页(984-999) 【关键词】

【作者】Yan ZHANG;Hongzhi WANG;Long YANG;Jianzhong LI

【作者单位】Department of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001, China;Department of Computer

Efficient histogram-based range query estimation for dirty data

Efficienthistogram-basedrangequeryestimationfordirtydataYanZHANG;HongzhiWANG;LongYANG;JianzhongLI【期刊名称】《中国高等学校学术文摘·计算机科学》【年(卷),期】2018(012)005【摘要】In
推荐度:
点击下载文档文档为doc格式
4f8bp4pdsu6j6mw9sjhs44p5c1cp9m00dwz
领取福利

微信扫码领取福利

微信扫码分享