EXPLAIN <HiveQL> ;。
hive> EXPLAIN SELECT * FROM userXX.fact_player_stats LIMIT 10; OK ABSTRACT SYNTAX TREE: (TOK_QUERY (TOK_FROM (TOK_TABREF (TOK_TABNAME user00 fact_player_stats))) (TOK_INSERT (TOK_DESTINATION (TOK_DIR TOK_TMP_FILE)) (TOK_SELECT (TOK_SELEXPR TOK_ALLCOLREF)) (TOK_LIMIT 10)))
STAGE DEPENDENCIES: Stage-0 is a root stage
STAGE PLANS: Stage: Stage-0 Fetch Operator limit: 10 Processor Tree: TableScan alias: fact_player_stats Select Operator expressions: expr: playerid type: string expr: teamid type: string expr: ab type: int expr: r type: int expr: h type: int expr: 2b type: int expr: 3b type: int expr: hr type: int expr: rbi type: int expr: g type: int outputColumnNames: col0, col1, col2, col3, col4, col5, col6, col7, col8, col9 Limit ListSink
Time taken: 1.166 seconds
hive> use userXX;
hive> EXPLAIN SELECT A.PlayerID, B.teamID, B.AB, B.R, B.H, B.2B, B.3B, B.HR, B.RBI FROM Master A JOIN BATTING B ON A.playerID = B.playerID;
OK
ABSTRACT SYNTAX TREE:
(TOK_QUERY (TOK_FROM (TOK_JOIN (TOK_TABREF (TOK_TABNAME Master) A) (TOK_TABREF (TOK_TABNAME batting) B) (= (. (TOK_TABLE_OR_COL A) playerID) (. (TOK_TABLE_OR_COL B) playerID)))) (TOK_INSERT (TOK_DESTINATION (TOK_DIR TOK_TMP_FILE)) (TOK_SELECT (TOK_SELEXPR (. (TOK_TABLE_OR_COL A) PlayerID)) (TOK_SELEXPR (. (TOK_TABLE_OR_COL B) teamID)) (TOK_SELEXPR (. (TOK_TABLE_OR_COL B) AB)) (TOK_SELEXPR (. (TOK_TABLE_OR_COL B) R)) (TOK_SELEXPR (. (TOK_TABLE_OR_COL B) H)) (TOK_SELEXPR (. (TOK_TABLE_OR_COL B) 2B)) (TOK_SELEXPR (. (TOK_TABLE_OR_COL B) 3B)) (TOK_SELEXPR (. (TOK_TABLE_OR_COL B) HR)) (TOK_SELEXPR (. (TOK_TABLE_OR_COL B) RBI)))))
STAGE DEPENDENCIES:
Stage-1 is a root stage
Stage-0 is a root stage
STAGE PLANS:
Stage: Stage-1
Map Reduce
Alias -> Map Operator Tree:
a
TableScan
alias: a
Reduce Output Operator
key expressions:
expr: playerid
type: string
sort order: +
Map-reduce partition columns:
expr: playerid
type: string
tag: 0
value expressions:
expr: playerid
type: string
b
TableScan
alias: b
Reduce Output Operator
key expressions:
expr: playerid
type: string
sort order: +
Map-reduce partition columns:
expr: playerid
type: string
tag: 1
value expressions:
expr: teamid
type: string
expr: ab
type: int
expr: r
type: int
expr: h
type: int
expr: 2b
type: int
expr: 3b
type: int
expr: hr
type: int
expr: rbi
type: int
Reduce Operator Tree:
Join Operator
condition map:
Inner Join 0 to 1
condition expressions:
0 {VALUE._col1}
1 {VALUE._col3} {VALUE._col7} {VALUE._col8} {VALUE._col9} {VALUE._col10} {VALUE._col11} {VALUE._col12} {VALUE._col13}
handleSkewJoin: false
outputColumnNames: _col1, _col38, _col42, _col43, _col44, _col45, _col46, _col47, _col48
Select Operator
expressions:
expr: _col1
type: string
expr: _col38
type: string
expr: _col42
type: int
expr: _col43
type: int
expr: _col44
type: int
expr: _col45
type: int
expr: _col46
type: int
expr: _col47
type: int
expr: _col48
type: int
outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5, _col6, _col7, _col8
File Output Operator
compressed: false
GlobalTableId: 0
table:
input format: org.apache.hadoop.mapred.TextInputFormat
output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
Stage: Stage-0
Fetch Operator
limit: -1
Time taken: 0.679 seconds
DESCRIBE <資料表名稱>;
hive> use userXX hive> DESCRIBE mysql_data; OK id int name string Time taken: 0.126 seconds
hive> DESCRIBE master; OK lahmanid int playerid string managerid int hofid string birthyear int birthmonth int birthday int birthcountry string birthstate string birthcity string deathyear int deathmonth int deathday int deathcountry string deathstate string deathcity string namefirst string namelast string namenote string namegiven string namenick string weight int height int bats string throws string debut string finalgame string college string lahman40id string lahman45id string retroid string holtzid string bbrefid string Time taken: 0.122 seconds
DESCRIBE FUNCTION <函數名稱>;
hive> DESCRIBE FUNCTION max; OK max(expr) - Returns the maximum value of expr Time taken: 0.082 seconds hive> DESCRIBE FUNCTION count; OK count(*) - Returns the total number of retrieved rows, including rows containing NULL values. count(expr) - Returns the number of rows for which the supplied expression is non-NULL. count(DISTINCT expr[, expr...]) - Returns the number of rows for which the supplied expression(s) are unique and non-NULL. Time taken: 0.111 seconds