Master in Data Analyst | Join Free Webinar on 15 Sep 2025 at 7 PM IST | Register for Free Demo

Managing Cluster Nodes

Breadcrumb Abstract Shape
Breadcrumb Abstract Shape
Breadcrumb Abstract Shape
Breadcrumb Abstract Shape
Breadcrumb Abstract Shape
Breadcrumb Abstract Shape
  • User AvatarPradip
  • 25 Jul, 2025
  • 0 Comments
  • 1 Min Read

Managing Cluster Nodes

1. Cluster Node Overview:

  1. Definition: A cluster node in Oracle RAC refers to an individual server or instance
    that is part of the larger Oracle RAC cluster. Each node runs its own Oracle instance,
    but all instances work together to serve a single database.
  2. Node Components: Nodes include Oracle Clusterware, Automatic Storage
    Management (ASM), and the Oracle Database instance.

2. Adding Nodes to the Cluster:

  1. Preparation: Ensure the new node meets hardware and software requirements, and
    has proper network configuration (public, private, and VIP).
  2. Installation: Use the addnode.sh script or Oracle Universal Installer (OUI) to add a
    new node to the existing cluster. This involves installing the Grid Infrastructure
    software on the new node.
  3. Post-Installation: After adding the node, configure the Oracle Database on the new
    node using the Database Configuration Assistant (DBCA) to extend the existing RAC
    database.

3. Removing Nodes from the Cluster:

  1. Preparation: Before removing a node, ensure that no active sessions or workloads
    are running on it.
  2. Removing: Use the delnode.sh script or Oracle Universal Installer (OUI) to remove
    the node from the cluster.
  3. Post-Removal: Reconfigure the cluster to ensure the remaining nodes handle the
    workload properly. Adjust services and load balancing configurations if necessary.

4. Node Eviction:

  1. Concept: Node eviction occurs when Oracle Clusterware forcibly removes a node
    from the cluster due to a failure or split-brain scenario. This is done to protect the
    integrity of the database.
  2. Reasons for Eviction: Common reasons include network issues, hardware failures,
    or resource starvation.
  3. Handling Evictions: Review logs (e.g., alert.log, crsd.log, cssd.log) to
    diagnose the cause of the eviction and take corrective actions to prevent future
    evictions.

5. Node Maintenance:

  1. Planned Maintenance: If you need to perform maintenance on a node, you can stop
    the database instance on that node using srvctl and then perform the necessary
    tasks.
  2. Unplanned Maintenance: In case of unexpected failures, use Clusterware tools
    (crsctl, srvctl) to relocate resources and ensure high availability.
  3. Rebooting Nodes: When rebooting a node, ensure proper shutdown and startup of
    Oracle services using crsctl or srvctl to avoid cluster disruptions.

6. Monitoring Cluster Nodes:

  1. Monitoring Tools: Use Oracle Enterprise Manager (OEM), Cluster Health Monitor
    (CHM), and tools like crsctl, srvctl, and oswatcher to monitor node performance,
    resource usage, and availability.
  2. Key Metrics: Monitor CPU, memory, disk I/O, network latency, and interconnect
    performance across all nodes to ensure optimal functioning.