hive 如何处理科学计数法

时间：2015-06-27 10:06:40 阅读：354 评论：0 收藏：0 [点我收藏+]

说明：Hive中int , float , double这些数值类型在存储大额度数字时，在前端展现上总是使用科学计数法来表示，
这样搞的挺烦。举例说明

样例数据：
select lte_drop_rate from t_table limit 10;
输出结果：
5.0E-4
7.1E-4
5.41E-4
...
...

处理方案：
select
(case
--处理非科学计数法表示的字符串
when length(regexp_extract(lte_drop_rate,‘([0-9]+\\.)([0-9]+)(E-*[0-9]+)‘,2))=0
then lte_drop_rate
--处理整数
when length(regexp_extract(lte_drop_rate,‘([0-9]+\\.)([0-9]+)(E[0-9]+)‘,2))<=cast(regexp_extract(lte_drop_rate,‘(E)([0-9]+)‘,2) as int)
then rpad(regexp_replace(regexp_extract(lte_drop_rate,‘([^E]+)‘,1),‘\\.‘,‘‘),cast(regexp_extract(lte_drop_rate,‘(E)([0-9]+)‘,2) as int)+1,‘0‘)
--处理小数
when length(regexp_extract(lte_drop_rate,‘([0-9]+\\.)([0-9]+)(E[0-9]+)‘,2))>cast(regexp_extract(lte_drop_rate,‘(E)([0-9]+)‘,2) as int)
then concat(substr(regexp_replace(regexp_extract(lte_drop_rate,‘([^E]+)‘,1),‘\\.‘,‘‘),1,cast(regexp_extract(lte_drop_rate,‘(E)([0-9]+)‘,2) as int)+1),‘\.‘,
substr(regexp_replace(regexp_extract(lte_drop_rate,‘([^E]+)‘,1),‘\\.‘,‘‘),cast(regexp_extract(lte_drop_rate,‘(E)([0-9]+)‘,2) as int)+2))
--处理类似“3.4E-6”这种字符串
when lte_drop_rate regexp ‘E-‘
then concat(‘0.‘,repeat(‘0‘,cast(regexp_extract(lte_drop_rate,‘(E)(-)([0-9]+)‘,3) as int)-1),regexp_replace(regexp_extract(lte_drop_rate,‘(.+)(E)‘,1),‘\\.‘,‘‘))
else lte_drop_rate
end)
from t_table limit 10

输出结果：
0.00050
0.00051
0.0010
。。。

hive 如何处理科学计数法

标签：hive 科学计数法 hadoop

原文地址：http://blog.csdn.net/sunlei1980/article/details/46652445

踩

(0)

评论一句话评论（0）

分享档案

更多>

2021年07月29日 (22)
2021年07月28日 (40)
2021年07月27日 (32)
2021年07月26日 (79)
2021年07月23日 (29)
2021年07月22日 (30)
2021年07月21日 (42)
2021年07月20日 (16)
2021年07月19日 (90)
2021年07月16日 (35)

周排行