Install VirtualBox Guest Additions on OEL | Kernel Headers Not Found For Target Kernel
Update all packages Install below Package Reboot Server Run the vbox edition Go to virtual box setting and change the display resolution
In this blog , we are going to see how to clone database using rman backup. SOURCE SERVER Take full db backup on source server Move source db backup from source to target server Target Server Create pfile from source and edit to target Create required directory structure on target server Start target database in […]
Connection from Spark to Oracle Database Download ojdbc jar connector Code to connect oracle to spark Spark submit with jar file Mysql Database with Insert statement Spark Code to connect to mysql database Write Spark DF to MYSQL database
–define or –hivevar Options — database Option -s, -e Options, Environment variables & Redirecting Output to File Connecting Remote Hive Server Running Queries from file Hive Batch Mode Commands
Spark repartition() vs coalesce() – repartition() is used to increase or decrease the RDD, DataFrame, Dataset partitions whereas the coalesce() is used to only decrease the number of partitions in an efficient way. Example 2 Rdd Coalesce Rdd Repartition
Install the Oracle virtual box Download the HDP setup from below link https://www.cloudera.com/downloads/hortonworks-sandbox.html URL to HDP sandbox http://localhost:4200/ HDP dashboard sandbox-hdp.hortonworks.com:8080/#/main/dashboard/metrics MySQL userid – root password – hortonworks1
Mapper Programe Reducer Programe Driver Programe Run Mapreduce Programe
Mapper Code Reducer Code Driver Code Add external Jar Files Input file Run Mapreduce programme Mapreduce Wordcount output
sample data Analytical data sample Load sample data to hive table RANK() Function DENSE_RANK() Function ROW_NUMBER()