When you manually modify the partitions directly on HDFS, you need to run MSCK REPAIR TABLE to update the Hive Metastore. Not doing so will result in inconsistent results.
SELECT doesn’t show the renamed partition
SHOW PARTITIONS still shows the older partition
DESCRIBE FORMATTED zipcodes PARTITION(state='PR');
SHOW TABLE EXTENDED LIKE zipcodes PARTITION(state='PR');
Static Partition in HIve
Features of Static Partitioning
Partitions are manually added so it is also known as manual partition
Data Loading in static partitioning is faster as compare to dynamic partitioning so static partitioning is preferred when we have massive files to load.
In static partitioning individual files are loaded as per the partition we want to set.
where clause is used in order to use limit in static partition
Altering the partition in the static partition is allowed whereas dynamic partition doesn’t support Alter statement.
Let's create a table 'student' with the following attributes (student_name, father_name, and percentage ) and partition it using 'section' in our default database.
CREATE TABLE student(student_name STRING ,father_name STRING ,percentage FLOAT)
partitioned by (section STRING)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY ',';
Create seperate file for section A, section B,section C and section D.
Loading data from student_A partitioned with section 'A'
LOAD DATA LOCAL INPATH '/home/hdoop/Documents/student_A' INTO TABLE student
partition(section = "A");
Loading data from student_B partitioned with section 'B'
LOAD DATA LOCAL INPATH '/home/hdoop/Documents/student_B' INTO TABLE student
partition(section = 'B');
Loading data from student_C partitioned with section 'C'
LOAD DATA LOCAL INPATH '/home/hdoop/Documents/student_C' INTO TABLE student
partition(section = 'C');
Loading data from student_D partitioned with section 'D'
LOAD DATA LOCAL INPATH '/home/hdoop/Documents/student_D' INTO TABLE student partition(section = 'D');
Now go to your HDFS(/user/hive/warehouse/) and check the student table to see how the partitions are made.
Partitioning an external table
CREATE EXTERNAL TABLE customer_external(id STRING, name STRING, gender STRING, state STRING) PARTITIONED BY (country STRING);
Now a partition can be added to the EXTERNAL table, using the ALTER TABLE ADD PARTITION command:
ALTER TABLE customer_external ADD PARTITION(country='UK') LOCATION '/user/hive/warehouse/customer/country=UK'