Hive query issue - 走看看

zoukankan html css js c++ java

Hive query issue

One time, I have written a query with two tables join,

One table is big table with partitions , another table is filter this big table.

Then join the two tables.

The big table is about some millions after filter by partition, and the small table is 170 thousands rows.

The query running a lot of time.

And the big data environment even go to safe mode for this.

I kill this job .

How to monitor long running hive job for this?

Why the name node come to safe mode for the query?

the parent process was killed for java outofmemory exception, SA found this root cause.

another issue is that, pay attention to the split(field,seperater),

if the seperater is |, you should use [|] or \|, because | stand for special meaning in regex expression.

查看全文

相关阅读:
【CSP2019模拟】题解
 【Codeforces 868 G】— El Toll Caves（类欧几里得）
【Codeforces 868 G】— El Toll Caves（类欧几里得）
如何写出规范的代码？做一名追求极致的软件工程师！
浏览器原理
 URL（待整合到HTTP书中哦）
FTP服务器
 background-image 和 img
XML的总结学习
 逻辑思维代码逻辑

原文地址：https://www.cnblogs.com/huaxiaoyao/p/4663392.html

Copyright © 2011-2022 走看看