Hive常用命令總結

longliqiang88發表於2015-08-12

原文網址 : http://www.codeceo.com/article/hive-command.html

本文只是總結一些在Hive中常用的命令，並且假設需要的目錄或者資料已經存在。

建立表，\t作為列的分隔符

create table trade_detail (id bigint,income double,expenses double,time string) row formate delimited fields terminated by '\t';

create table user_info(id bigint, account string, name string, age int) row format delimited fields terminated by '\t';

接下來是稍複雜的語句，建立表的的同時進行賦值

create table result row format delimited fields terminated by '\t' as select t1.account, t1.income, t1.expenses, t1.surplus, t2.name from user_info t2 join (select account, sum(income) as income, sum(expenses) as expenses, sum(income-expenses) as surplus from trade_detail group by account) t1 on(t1.account = t2.account);

載入本地檔案到資料表中

load data local inpath '/home/hadoop/data/student.txt' overwrite into table student;

load data local inpath '/home/hadoop/data/user_info.doc' overwrite into table user_info;

建立外部表，建立外部表的一般情況指的是：先有檔案存放著資料，之後我們再來建立表，也就是說建立一張表，然後指向這個有資料的目錄。以後只要是向這個目錄中上傳符合格式的資料會被自動裝在到資料庫表中，因為在metastore（後設資料）會記錄這些資訊

create external table t_detail(id bigint, account string, income double, expenses double, time string) ) row format delimited fields terminated by '\t' location '/hive/td_partition';

建立分割槽表，一般用於資料量比較大的情況下， partitioned by (logdate string)用來指定按照什麼進行分割槽

create external table t_detail(id bigint, account string, income double, expenses double, time string)  row format delimited fields terminated by '\t' location '/hive/td_partition' partitioned by (logdate string);

將mysql中的資料直接儲存到Hive中

sqoop export --connect jdbc:mysql://192.168.8.103:3306/hmbbs --username root --password hadoop --export-dir '/user/hive/warehouse/pv_2013_05_31/000000_0' --table pv

基本的插入語法

insert overwrite table tablename [partiton(partcol1=val1,partclo2=val2)]select_statement from t_statement
insert overwrite table test_insert select * from test_table;

更新表的名稱

hive> alter table source RENAME TO target;

新增新一列

alter table invites add columns (new_col2 INT COMMENT 'a comment');

刪除表：

DROP TABLE records;

刪除表中資料，但要保持表的結構定義

dfs -rmr /user/hive/warehouse/records;

顯示所有函式

show functions;

檢視函式用法

describe function substr;

內連線

SELECT sales.*, things.* FROM sales JOIN things ON (sales.id = things.id);

檢視hive為某個查詢使用多少個MapReduce作業

Explain SELECT sales.*, things.* FROM sales JOIN things ON (sales.id = things.id);

外連線

SELECT sales.*, things.* FROM sales LEFT OUTER JOIN things ON (sales.id = things.id);
SELECT sales.*, things.* FROM sales RIGHT OUTER JOIN things ON (sales.id = things.id);
SELECT sales.*, things.* FROM sales FULL OUTER JOIN things ON (sales.id = things.id);

建立檢視

hive> CREATE VIEW valid_records AS SELECT * FROM records2 WHERE temperature !=9999;

檢視檢視詳細資訊

hive> DESCRIBE EXTENDED valid_records;

node 常用命令總結
2019-03-26
Git常用命令總結
2018-09-07
Git
Docker 常用命令總結
2020-08-04
Docker
Kafka 常用命令總結
2019-04-23
Kafka
Linux常用命令總結
2024-07-25
Linux
docker常用命令總結
2023-03-01
Docker
hive基礎總結(面試常用)
2019-02-11
Hive面試
Hive所有的配置總結轉載
2020-11-23
Hive
linux總結及常用命令
2018-07-18
Linux
Spring boot常用命令總結
2024-03-08
Spring Boot
Linux 程式管理常用命令總結
2019-03-28
Linux
console常用命令總結筆記
2019-04-19
筆記
【Hadoop篇】--Hadoop常用命令總結
2018-03-07
Hadoop
Hive表小檔案合併方法總結
2020-10-17
Hive
Git常用命令總結（超實用）
2022-11-14
Git
開源容器 Podman 常用命令總結！
2022-04-24
Hive常用效能優化方法實踐全面總結
2021-01-25
Hive優化
Redis | Redis常用命令及示例總結（API）
2021-12-03
RedisAPI
MySQL基礎知識和常用命令總結
2020-04-28
MySql
Git 常用命令總結，將會持續更新
2021-05-06
Git
Hive常用命令,快鍵和關鍵詞
2018-03-14
Hive
Hive中靜態分割槽和動態分割槽總結
2021-03-31
Hive
Hive（總）看完這篇，別說你不會Hive！
2020-09-24
Hive
【Hadoop】pyhton連結hive
2018-06-27
HadoopHive
恆訊科技總結整理：mysql資料庫常用命令
2021-07-27
MySql資料庫
Git常用命令總結及一些問題思考
2021-09-09
Git
MySQL檢視錶和清空表的常用命令總結
2021-09-09
MySql
Linux常用命令總結，這些一定要知道!
2020-12-31
Linux
npm常用命令彙總
2018-06-11
NPM
linux 常用命令彙總
2019-04-22
Linux
ffmpeg常用命令彙總
2024-07-02
Mysql常用命令彙總
2021-09-09
MySql
MySQL 常用命令彙總
2024-12-17
MySql
windows下Hive搭建踩坑彙總
2023-05-17
WindowsHive
MongoDB常用命令彙總(一)
2018-11-27
MongoDB
【Git】git常用命令彙總
2024-10-06
Git
DB2常用命令彙總
2020-02-24
DB2
hive-3.0.0 版本中遇到的bug 彙總
2024-03-04
Hive
javaSE總結（轉+總結）
2020-08-16
Java

Hive常用命令總結

相關文章