Sun Microsystems
Products & Services
 
Support & Training
 
 

Previous Previous     Contents     Next Next

Viewing Queue Details

You access the Queue Details page by clicking the Queue button in the Summary Status table on the main Monitor page.

Figure 1-17 Queue Details Page

queue details page

Note that this table provides information on all queue instances on the currently selected master host, including instances on hosts that were not added by GEMM framework. The information appears in groups of ten rows at a time, with the ability to page back and forth between the rows.

Interpreting Data

For each queue instance, there are columns for the Queue instance name, the status, the total number of slots and number of used slots. The status is indicated by a colored circle and icon similar to the Job Alerts previously described. The only additional feature is a green icon to indicate queue instances that have no alert conditions. Clicking the Back icon in the table header returns you to the Monitor Grid main page.

Sorting Data

By default, rows display alphabetically by queue instance name but you can use any column whose header is written in white to change the ordering of the rows. Clicking on a column header sorts the rows according to the values in that column; clicking again on the column header reverses the sort. The sorting is preserved across pages if you click a pagination button.

Viewing Additional Details

The final column of each row has an Inspect icon. Clicking on this icon displays a table with the full details for that queue instance. The final entry in this table shows the timestamp when the data was obtained. For information on the meaning of the other table entries, consult the N1 Grid Engine 6 Administration manual. Clicking on the 0 icon for this table returns you to the Queue Details page.

Viewing Host Details

You access the Host Details page by clicking the Host button on in the Summary Status table on the main Monitor page.

Figure 1-18 Host Details View

host details view

This page displays a table with the state of all the compute hosts that are members of the grid. The title of the table also indicates which host is currently chosen as the Proxy Host.

Note that this table has information on all compute hosts reporting to the currently-chosen master host, including those that were not added by GEMM framework.

Interpreting Data

The information appears in groups of ten rows at a time, with the ability to page back and forth between the rows. For each host, there are columns for the Hostname, Architecture, Load per CPU, Memory in use, Total Memory, and Swap Space in use. The status is also indicated by a colored circle and icon similar to the Host Alerts table with an additional green icon to indicate hosts that have no alert conditions. Clicking the Back icon in the table header returns you to the Monitor Grid main page.

Sorting Data

By default, rows display alphabetically but you can use any column whose header is white to change the ordering of the rows. Clicking on a column header sorts the rows according to the values in that column; clicking again on the column header reverses the sort. The sorting is preserved across pages if you click a pagination button.

Seeing Additional Details

The final column of each row has an Inspect icon. Clicking on this icon displays a table where full details for that host appear. The final entry in this table shows the timestamp when the data was obtained. For information on the meaning of the other table entries, please consult the N1 Grid Engine 6 Administration manual. Clicking the Back icon on this table returns you to the Host Details page.

Viewing Grid Engine Daemon Logs

You access the Grid Engine Daemon Logs page by clicking the Daemons button on in the Summary Status table on the main Monitor page.

Figure 1-19 Grid Engine Daemons Log View

log view

The Logs page contains a table which displays the names of all compute hosts that were deployed by GEMM, plus the name of master host if it was deployed by GEMM.

Two additional columns are also shown. The first column, labeled Master, contains an Inspect icon for the master host. The second column, labeled execd, contains an Inspect icon for each compute host. Clicking these icons lets you retrieve the actual log message files.


Note - If the master host was not deployed by GEMM, no host in the table will have the Inspect icon for the Qmaster column. Similarly, if there are compute hosts that were not deployed by GEMM, these hosts will not appear in this table. Clicking the Back icon in the table header returns you to the Monitor Grid main page.


Retrieving Log Message Files

Figure 1-20 Example Log Message File

example log message file

Clicking an inspect icon retrieves and displays the qmaster and execd daemon messages file for the corresponding host. A progress bar indicates the progress of this process. When the Done button appears, clicking it displays the contents of the chosen messages file with each line appearing in its own row in a table. Rows display 25 at a time with the ability to page through them.

The rows display in reverse chronological order, so that the most recent message appears at the top of the list. Clicking on the Back icon for this table returns you to the Grid Engine Daemon Logs page. For more information on daemon messages, see the N1 Grid Engine 6 Administration manual.

Interpreting Messages

The first column of this table shows a colored circle and icon to indicate the severity of that message. A green circle indicates a message of type Info. A yellow circle indicates a message of type Warning or Critical. A red circle indicates a message of type Error. The second column shows the time stamp for the message and the third column shows the actual text of the message.

Viewing Cluster Queues

Figure 1-21 Cluster Queues Page

cluster queues page

This table shows a summary of the state of all the cluster queues configured on the grid, indicating the numbers of slots in various states. For information on cluster queues, see the N1GE 6 Administration Guide.

Viewing Host Alerts

Figure 1-22 Host Alerts Page

Host alerts page

This table shows all hosts where the threshold for either the load or memory has been crossed. There are two types of alerts each indicated by a different colored circle and icon.

A warning alert is indicated by a yellow icon. This alert displays if the load goes above the load warning threshold or the memory goes below the memory warning threshold.

A critical alert is indicated by a red icon. This alert displays if the load goes above the load critical threshold or the memory goes below the memory critical threshold.

The Host Alerts table is empty if no hosts have crossed any threshold. You configure the values for the load and memory warning and critical thresholds on the Settings page.

Viewing Queue Alerts

Figure 1-23 Queue Alerts Page

queue alerts page

This table shows queue instances that are not in the usual running state. There are three types of alerts each indicated by a different colored circle and icon.

  • A red icon indicates the queue instance is in either the Unknown or Error state.

  • A yellow icon indicates the queue instance is in either an Alarm or Suspended state.

  • A gray icon indicates the queue instance is in a Disabled state.

    The exact state of the queue instance is also given in the Status column. For more information on queue instance states, see the N1 Grid Engine 6 Administration Manual.

Viewing Job Alerts

Figure 1-24 Job Alerts Page

job alerts page

This table displays grid jobs which are not in the usual running state. There are two types of alerts each indicated by a different colored circle and icon.

  • A red icon indicates the job is in an Error state.

  • A yellow icon indicates the jobs pending time has exceed the pending time threshold.

    You configure the values for the pending time threshold on the Settings page. For more information on job states, see the N1 Grid Engine 6 Administration manual.

Using Grid Active Monitor

You can quickly see the status of the Grid by using the SCS Active Monitor feature. Choose Station Settings >Active Monitor. and scroll down the page to the Base Services table shown in the following figure.

Figure 1-25 Grid Active Monitor Table

grid active monitor table

When the status of the grid changes due to an event like a queue alert, the button next to the Grid Engine entry changes color in the following way:

  • Green: N1GE is up and running fine.

  • Yellow: the SCS cannot contact the proxy host or cannot obtain monitoring information from it but it is still possible that the master is running.

  • Red: the proxy host indicates that the master is down.

  • Grey: N1GE is not installed anywhere.

Previous Previous     Contents     Next Next