Professional Documents
Culture Documents
Copyright 1996, 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012, 2013 EMC Corporation. All Rights
Reserved. EMC believes the information in this publication is accurate as of its publication date. The information is subject to change
without notice.
THE INFORMATION IN THIS PUBLICATION IS PROVIDED AS IS. EMC CORPORATION MAKES NO REPRESENTATIONS OR WARRANTIES OF
ANY KIND WITH RESPECT TO THE INFORMATION IN THIS PUBLICATION, AND SPECIFICALLY DISCLAIMS IMPLIED WARRANTIES OF
MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
Use, copying, and distribution of any EMC software described in this publication requires an applicable software license.
EMC2, EMC, Data Domain, RSA, EMC Centera, EMC ControlCenter, EMC LifeLine, EMC OnCourse, EMC Proven, EMC Snap, EMC SourceOne,
EMC Storage Administrator, Acartus, Access Logix, AdvantEdge, AlphaStor, ApplicationXtender, ArchiveXtender, Atmos, Authentica,
Authentic Problems, Automated Resource Manager, AutoStart, AutoSwap, AVALONidm, Avamar, Captiva, Catalog Solution, C-Clip, Celerra,
Celerra Replicator, Centera, CenterStage, CentraStar, ClaimPack, ClaimsEditor, CLARiiON, ClientPak, Codebook Correlation Technology,
Common Information Model, Configuration Intelligence, Configuresoft, Connectrix, CopyCross, CopyPoint, Dantz, DatabaseXtender, Direct
Matrix Architecture, DiskXtender, DiskXtender 2000, Document Sciences, Documentum, elnput, E-Lab, EmailXaminer, EmailXtender,
Enginuity, eRoom, Event Explorer, FarPoint, FirstPass, FLARE, FormWare, Geosynchrony, Global File Virtualization, Graphic Visualization,
Greenplum, HighRoad, HomeBase, InfoMover, Infoscape, Infra, InputAccel, InputAccel Express, Invista, Ionix, ISIS, Max Retriever,
MediaStor, MirrorView, Navisphere, NetWorker, nLayers, OnAlert, OpenScale, PixTools, Powerlink, PowerPath, PowerSnap, QuickScan,
Rainfinity, RepliCare, RepliStor, ResourcePak, Retrospect, RSA, the RSA logo, SafeLine, SAN Advisor, SAN Copy, SAN Manager, Smarts,
SnapImage, SnapSure, SnapView, SRDF, StorageScope, SupportMate, SymmAPI, SymmEnabler, Symmetrix, Symmetrix DMX, Symmetrix
VMAX, TimeFinder, UltraFlex, UltraPoint, UltraScale, Unisphere, VMAX, Vblock, Viewlets, Virtual Matrix, Virtual Matrix Architecture, Virtual
Provisioning, VisualSAN, VisualSRM, Voyence, VPLEX, VSAM-Assist, WebXtender, xPression, xPresso, YottaYotta, the EMC logo, and where
information lives, are registered trademarks or trademarks of EMC Corporation in the United States and other countries.
All other trademarks used herein are the property of their respective owners.
Copyright 2013 EMC Corporation. All rights reserved. Published in the USA.
This course covers the purpose and use of the default installed ReportPacks for EMC Smarts.
We explore the ReportPacks for EMC Ionix ITOps, EMC Smarts Events, and EMC Smarts
Health along with a discussion of a majority of the child reports of each ReportPack.
This module focuses on a brief description of the architecture and navigation in the
Watch4net product. Emphasis is on menu navigation, accessing reports and ReportPacks
discussion of the general use of the report user interface.
Reports are a primary use of the information available through the use of Watch4net. They
are built using nodes on the Report Navigation Panel to define output which contains filters
and formulas that define the metrics to be displayed. The actual metrics employed will vary,
and are defined as part of the report definition. Reports may be designed to provide a great
deal of flexibility to the user.
Specifically, Watch4net allows the use of ReportPacks that are made of predefined reports
that are dedicated to specific technologies or vendors. The ReportPacks include built-in
metrics, formulas, and analytics which can be an enormous benefit and saves a great deal of
time and customization effort.
Tables are lists with columns. Lists display child node names and allow you to drill
down to more specific reports.
Graphs are reports that display the selected variables or values computed through
formulas over time.
By default, the ReportPacks for EMC Smarts are installed automatically in the ReportPacks
branch or the Report Navigation Panel. There are three ReportPacks that are specific to EMS
Smarts products. They are the ReportPack for Ionix ITOps (this name will change with the
rebranding of Ionix to EMC Smarts), ReportPack for EMC Smarts Events, and ReportPack for
EMC Smart Health. We will investigate the structure of each of these ReportPacks in this
course.
This module covered an overview of Watch4net reports, discussed the standard report types,
and introduced the three default installation ReportPacks for EMC Smarts products.
This module focuses on the use of the ReportPack for EMC Ionix ITOps to observe application
and IP statistics for devices managed by EMC Ionix Application Connectivity Manager and
Ionix IP Domain Managers. Ionix has been rebranded as EMC Smarts however, that change is
not yet fully implemented in version 6.1u3 of Watch4net.
Administrators may install or move ReportPacks to any desired node in the Report Navigation
Panel. In this case, our administrator has installed all of the element management related
reports to a node titled EMS/NMS. The default location is to place the default ReportPacks
into the ReportPacks node. In this module, we will discuss the reports, both top level and
child that comprise the first of our ReportPacks for EMC Smarts - the EMC ReportPack for
Ionix ITOps.
This ReportPack branches into two primary reports, Ionix ACM and Ionix IP. Ionix ACM
provides pre-defined reports specific to the Ionix Application Connectivity Manager while the
Ionix IP branch provides several reports relating to devices and interfaces that are managed
by EMC Smarts IP domain managers.
The top level report for the Ionix ACM selection is a table listing the devices (with device
type) within the managed ACM domain that have been discovered to have measurable and
reportable Key Performance Indicators (KPIs). These indicators include response times
including maximums and minimums, and availability percentage - both over time and
current.
When a device in the list is selected, two views specific to the device are opened.
On the top is a graph that represents the applications running on that host by response time.
Below that graph is another table listing all the applications with KPIs running on the
selected device and displays the same data as the top level table specific to the selected
device.
The Ionix IP selection opens a top level table that displays all the managed devices collected
from IP domain managers. Just as in the Ionix ACM table, each entry represents an individual
device. Selecting a device drills down into the device specific graphs and tables as we will see
momentarily.
The table navigation bar allows the user to move from page to page of the displayable data.
Additionally, the user can determine the number of entries per displayed page as indicated
on the right side of the bar.
The first four columns of this report allows filtering to refine the display and reduce the
number of displayed devices. To construct a display filter, select the funnel character to the
right of the desired column heading as shown in the Device Name column in this slide. For
additional information on filtering consult the Web Portal Guide.
The right side of the Ionix IP table displays data per device including:
The data field columns display data available for each displayed device. If you are uncertain
how the data is calculated, the blue question mark to the right of the column heading will
display an analysis of how that column is calculated, and if threshold values are defined, it
will show those values as well. This is illustrated in the two yellow help boxes displayed on
this slide.
When you click a device, either in the tree to the left or in the device table, Watch4net will
drill down and display a set of data and graphs specific to the selected device in a scrollable
panel on the right side of the display. The information displayed includes:
Device Type.
Device IP Address.
Device Location.
Device Description.
Additionally, several graphs and tables are presented representing valuable information
about the selected device sampled over the past week. We will look at each of these graphs
individually over the next few pages.
Though the data can be seen in the scrollable report associated with the device, it is often
desirable to view the graphs in a larger individual display so that changes are more clearly
presented. To access each specific child report for the device, you may click on the report in
the device display discussed on the previous page or expand the tree in the device list as
shown here. From here, you can select the desired report, or expand the expandable reports,
to drill down further for more specific information. Through the next few pages, we will
discuss each of the reports available from this point.
The first report is the Availability percentage report. It displays a graph of the percentage of
time the device was available in one hour samples over the past week (7 days). In this graph,
there are threshold values defined for a critical threshold at 95% and a major threshold at
99%.
This report represents the percentage of time the device was reachable. It displays a graph of
the percentage of time the device was reachable in one hour samples over the past week (7
days). In this graph, there are threshold values defined for a critical threshold at 95% and a
major threshold at 99%.
The CPU Utilization is presented as a line graph for each CPU. If multiple CPUs exist, the user
can drill down to each CPU by selecting the appropriate graph or expanding the report tree
and selecting the desired report from the expanded list to open it in a single report graph.
The defined thresholds for CPU utilization are critical at 80% and major at 60%.
The Memory Used (%) Physical line graph presents physical memory usage in hourly
samples over the past week. The defined thresholds are critical at 90% and major at 70%. As
seen here, usage has stayed above the major threshold most of the week and in more recent
days, it has increased above the critical threshold. The data bar in the upper left portion of
the graph, just above the 100%, gives the maximum, minimum, average, and 95th sample
values. Notice the warning that the lack of physical memory can affect the servers capability.
The Memory used (%) Swap line graph presents swap memory usage in hourly samples
during the past week. The defined thresholds are critical at 90% and major at 70%. As seen
here, usage has stayed above 40% for most of the week and in more recent days it has
increased above 50% and is trending toward 60%. Again, the warning that the lack of physical
memory can affect the servers capability is provided to help the user.
Each File System is presented in the Files Systems table report. The information presented is
the name of the server, the files system name, utilization %, current capacity in mega-bytes,
current remaining space in mega-bytes, and the current utilization percentage. Select any file
system in the table or in the tree to the left to drill down to a graph of the file systems usage
in the last day.
The Interfaces Traffic table displays the Interface/Port name, Description, Speed, Incoming
and Outgoing in mega-bits per second, utilization, and availability by percentage if the data is
collected. In some cases, the field will be empty indicating that the data is either not
reported or not collected. The use can select any interface in the table or from the tree list to
observe the data in greater detail.
When an interface or port is selected, information about the adapter, if available, is shown in
three graphs:
Bandwidth (Mbits/s)
Interface Utilization % (In and Out)
Availability
This module covered both the Ionix ACM and Ionix IP reports that are part of the ReportPack
for EMC Ionix ITOps.
This module focuses on the use of the ReportPack for EMC Smarts Events to observe event
related data for devices managed by EMC Smarts managers.
There are two top level entries in the ReportPack for EMC Smarts Events. The Overview, that
displays events by severity, and an All Events table. The Events by Category reports display
events in numerous different categories as collected from the EMC Smarts domain.
The top portion of the Overview report is a stacked bar graph of events for the past month
by severity. Each severity is added to provide the total count with the severities shown as the
stacked blocks in the daily column.
Under the Events by Severity graph, is the All Events table. The all events table displays each
event with the Event state, Device name, Part type, Part, Notified At time stamp, Cleared At
time stamp, and Duration of the event. The All Events table can be filtered by the user on all
fields, with the exception of the event duration. Filtering can be very useful in reducing the
number of events displayed to ease problem solving efforts.
Downtime Calculation.
Real-time View.
Chronic Problem Analysis.
Search for Component.
Each group is expandable. With the exception of the Search for a Component group, all are
further defined with various child reports.
The Events by Category / Downtime Calculation has four predefined child reports:
Availability by Component
Impacted Services
Impacted Business Processes
Impacted Customers
All of these reports have the same format except the base element, so we will use the
Availability by Component report as the example for this group of child reports.
The Events by Category / Real-time View report group has nine predefined child reports
covering events that have occurred in the last day. The reports include acknowledgements,
and currently active events (selectable from a table) on a per device type, severity, source,
event type and events per element. Additionally, the Unassigned Events report lists events
for which no one has taken ownership.
Real-time View / Acknowledged events is a list of Owners (users that have acknowledged
events) and a count of the events they have acknowledged. Clicking on the Owner will drill
down to create a list of the events that the user has acknowledged.
The Real-time View / Currently Active Events (table view) report includes data about the
event, the device the event is about, the part type the event is about, the part description,
and the time stamp for when this event was last modified. Filtering can be employed on each
column to refine the list of events the user is working with.
The Real-time View / Currently Active Events per Device Type is a TopN type chart broken
down by device type. Selecting a device type bar will drill down to another graph that shows
the devices and actual count of events that make up the bar. For example, the Switch device
type bar shows a 7.09 percent indicating that switch events comprise 7.09 percent of the
opened events.
If we then select the Switch bar, we will drill down into Currently Active Events per Device
Type /Switch report that shows there were four switches that contributed to that 7.09
percent and to what degree each switch contributed. In this case, switch W4N-DS-QA caused
4 events, or 44.44 percent of all the switch events.
The Currently Active Events per Severity graph shows the percentage of total active events
broken down by severity. For example, critical events comprise 1.57 percent of total active
events at the time this graph was captured. If you want to see what events are included in
this percentage, click on the bar and Watch4net will drill down to the data.
This report displays a list of Event types, the number of occurrences of each type, and when
the most recent event of each type occurred. If the list is large, it is possible that the user can
filter on the event type to refine the list making it easier to perform analysis. We can drill
down further by clicking on an event type. In this case, we will choose the Down event type.
The resulting display is a list of detailed Down events. In this case, there is only one.
However, the list could be large, so it is beneficial to note that this list can be filtered on any
field to assist in problem solving analysis.
The detailed event includes the fields Severity, Category, Event, Device, Part Type, Part
Description, Last Modified At, Impact, Owner, and Trouble Ticket ID.
The Currently Active Events per Severity table view lists all currently active events and ranks
them by severity. Each event is displayed including:
Rank numerical value associated with the severity.
Severity the severity value collected from the smarts manager.
Event identifies the type of event.
Device the name of the device that the event is about.
Part type classification of the object type that the event is reporting.
Part name of the part that this event is about.
Last Notified At time stamp for the most recent notification for this event.
With the exception of the Rank field, all fields may be filtered and sorted providing a great
deal of flexibility in the viewing of this list. The user can refine the list as necessary to aid in
the analysis of problems.
Clicking on the Currently Active Events per Source report will display a table in the report
panel for each source domain. You can scroll through the list of tables and browse in that
manner or you can choose to drill into any source domain by either selecting the domain in
the report tree or by clicking on the table title for the desired domain.
The resulting table for the selected domain contains the same fields that we saw earlier in
the detailed event view. Just as with that view, all fields can be filtered and sorted to allow
maximum flexibility in this report.
The Real-time View / Currently Active Events per Type presents a TopN Bar Graph that
identifies the percentage of total events that each event type represents. For example, in this
graph, the OutOfRange events are 3.15 percent of all currently active events. Placing the
mouse pointer over the bar momentarily will bring up a box identifying the number of events
of this type and the total currently active events as illustrated in this slide. We can drill down
by clicking the bar for the OutOfRange event to see the actual details of the events for the
selected type.
Again, the detailed event is presented and includes the fields Severity, Category, Event,
Device, Part Type, Part Description, Last Notified At, Impact, Owner, and Trouble Ticket ID.
The Events per Element view is a table that lists events in the last day by the class of the
element that the event is about. It identifies the Element Class, Instance Name, and the
number of Occurrences of that class in the last day. This report can be sorted and / or filtered
on the first two fields to aid in refining the view. Clicking on an entry will drill down to a list
that will reveal the actual events for the selected element.
The resulting report is a table of the actual events that have occurred in the last day for the
previously selected event. This report allows both sorting and filtering on all fields as
necessary.
The Unassigned Events report creates a table of currently active events for which there is no
ownership assigned. It is a detailed report and thus includes the same fields as all the other
detailed reports we have looked at in this ReportPack. Also, as previously stated, all fields
may be sorted and filtered to refine the report as necessary.
The next group of reports in the Events by Category group are the Chronic Problems Analysis
group of child reports. In this group we will look at reports that show:
Devices With Most Events by Source
Elements With Most Root Cause Events by Source
Events by Cities
Events by Devices
Events on EMC Smarts Domains
Mean Time to Resolve (by Operator)
Operator Workload
Post Mortem
Recurring Events
Root Cause Problems per Day
Root Cause Problems per Source per Day
Any of these reports may be selected from either the report tree on the left of the display or,
when the Chronic Problems Analysis table is displayed in the report panel, they can be
selected from the table of reports on display.
The first report, Devices With Most Events by Source, displays the source domains and the
percentage of events reflecting the percentage of total events that each domain owns. By
hovering the mouse pointer over the bar, the user will see a dialog box inside the red box in
this display. This specifies the number of events of the total number of events that are from
the selected source. In this example, the domain manager IP-Analysis is the source of 7.26
percent of events. This also equals 23 of the 317 total events that have occurred in the last
day.
Clicking on the desired source domain bar will allow Watch4net to drill down into that
domains report as we will see on the next page.
This table, that is a list of elements within the selected domain for which events have been
collected in the last day, shows the elements and the number of occurrences of events per
element. The user may filter on the Source column to filter or sort the elements. Choosing an
element will drill down the final degree to display the events that make up the occurrence
count.
The child report at the lowest point of this tree is again the Active Events report which
provides specific details of the event, the device, part type, part, and last modified at data for
the events. By this time, the user has identified the event, the device the event is about, and
the source domain of the event by drilling through this set of reports.
Similar to the first report in this group, this report displays a TopN Graph of the percentage of
Root Cause events by Source for the last month. The extra time frame is useful in identifying
the chronic events over the past month. Just as before, using the mouse pointer, the user is
able to expose the counts of events associated with the percentage displayed. In the
example here, the INCHARGE-SA domain is the source of .25 percent of the root cause
events which is 32 of 13,027 root events in the past month. The underlying tree of child
reports is exactly the same as the Devices With Most Events By Source which we have
already seen, so we will go on to the next report.
Next is the Events by Cities graph. In this case, all of the events for the day at the time of this
capture originated in Paris. To see the list of Cities, expand the report tree on the left side of
the display. Hovering over the bar in the graph show a count of events for the selected city as
a portion of the whole. As you can see in this case, the count is 430 of 430. As in the previous
reports, clicking the bar will drill down to a list of elements for which events were collected.
Clicking on an element will display the events for that element, just as in the previous report
tree examples.
The top level report displays both the graph of Occurrences per Device and a table and
duration of events. Since there may be more events in a day than can be realistically
graphed, and the currently defined graph does not permit user filtering, the table of events
will most likely be useful. The table may be filtered on all, but the Duration field and may also
be sorted as needed. An event may be selected from the table to drill down to a detailed
report of all iterations of the event.
The Events on EMC Smarts domains table is a detailed Active Events table of events that are
specific to EMC Smarts domains. The detailed list contains the normal detailed report fields.
The Mean Time to Resolve report lists the Operators base upon ownership along with the
number of events the operator owns and the average duration of the owned events.
The Operator Workload is a list of operators, including a row for unassigned events, along
with the count of Active Events owned and the Max Impact of those events. Clicking on an
operator drills down to an Active Events (detailed) report of the active events owned by the
selected operator.
The Post Mortem report is a list of events that are cleared and have a duration of greater
than 15 minutes.
The Recurring Events graph shows event types along with the percentage of total events in
the last day. In the example here, Down events are 5.35 percent of the events for the day.
Using the mouse-over technique, we can see that the actual number of Down events is 23 of
the 430 events for the day. Clicking the bar will allow drilling down to a detailed report of
those 23 events we have seen before.
The Root Cause Problems per Day report lists the dates in the last month and the count of
root cause events for each day. Clicking on a date will open a detailed report of the events
that comprise the count for that day.
The Root Cause Problems Per Source Per Day is an expansion of the previous report to
include the source domain of the events. This allows the user to first choose the domain
before looking at the events for that domain on a specific date. Once the domain and date
are selected, this report acts in the same was as the previous report.
This last report in the ReportPack for EMC Smarts Events is the Search for a Component
report. It presents a list with Device Name and Device Class fields. Filtering and or sorting
may be performed on either to find the desired Device.
Selecting a device in the Search for a Component list opens two reports, each of which we
have already seen. A report of Active Events (detailed) for the selected device and another
table that will allow the user to look at past events for the selected device on a per date
basis.
This module covered ReportPack for EMC Smarts Events. We explored the Overview reports
which include the Event Occurrence by Severity and the All Events reports. Then we walked
through the Events by Category report and discussed the drill down into those reports from
an operator perspective.
This module focuses on the use of the ReportPack for EMC Smarts Health to observe the
health of EMC Smarts domain managers. Information is available per domain type, per host,
and in a list of hosts and domains.
The ReportPack for EMC Smarts Health consists of three primary reports:
Per Domain Type lists Domain Types, number of each type, and availability.
Per Host lists the Hosts that support the domain managers and their availability.
List of Hosts and Domains list of hosts and their supported domains. Child reports are
defined to provide data on thread usage, CPU utilization, downtime, number of reboots
and availability over the last day.
The Per Domain Type report lists in a table the detected EMC Smarts domains and the
number of instances of each type detected as well as availability - both current and average
over the last day. Clicking a Domain Type allows the user to drill down into a table of
domains of the selected type along with availability and CPU utilization data for that domain.
Availability % Graph
CPU Utilization Graph
Selecting a Domain entry in the Domain Type Table allows the user to drill down even
further into the selected domain manager.
There are two branches of reporting under each of the listed domain types. Here, the ACM
Domain has been expanded to show the entire tree of reports available. Note that there are
two distinct groups of reports, the Domain Type and Domain Performance groups.
Within the Domain Type group there is an Overview report and a detailed view. The detailed
view is further broken down into four reports detailing the health of the Codebook,
Discovery, Object Repository, and the Hosting System.
The Overview table lists the elements of the domain manager and their current and average
health for the last day. By selecting the desired element, each element may be looked at
individually in a graph of the last day.
The Detailed View table lists the same elements of the domain manager. Each element may
be looked at in more detail by drilling down into the desired element as illustrated by the
report tree.
The Correlation Setting of the Codebook shows the current correlation interval for the
selected domain. In this case it is 30 seconds. The Process Duration report lists the processes
and their respective durations as an aggregated average over the last day.
The Discovery report of the Detailed View reports statistics on the domain's discovery
process. Each of the metrics are displayed along with their current values as an aggregate
average over the last day.
The right side of the Ionix IP table displays data per device including two tables:
The Object Repository table lists the following ICIM objects for the selected domain
manager.
o
ICIM_Instrumentation
ICIM_ManagedElement
ICIM_MetaObject
MR_Object
UnitaryComputerSystem
The System view consists of an overall CPU Usage and Uptime table which includes:
CPU Time time that the CPU is used.
Uptime amount of time the system has been up.
Ratio (%) the percentage of total uptime that the CPU has been used for this domain.
There are also four graphs. Two of the graphs show CPU Utilization and Treads. The other
two graphs detail memory usage - both physical and virtual.
The Overall CPU Usage % is a percentage of time that the CPU is used by the domain as
opposed to the total uptime averaged over the last day.
This two report graphs show the last days CPU Utilization averaged and the average number
of threads used, also over the last day.
The System Memory Usage graphs for the system hosting the selected domain show the use
of system memory during the last day for both Physical and Virtual memory in megabytes.
The Domain Performance reports reflect the Availability % of the domain selected and the
CPU utilization for both the host that the manager is running on and the percentage of use
that the domain manager used in the last day.
The Per Host table displays the name of every server that hosts one of the domains whose
health is monitored. The table includes the host name, the number of domains on the host,
and availability (both average percentage and current sample percentage). Each host can be
selected to drill down into reports associated with that host and the domains it supports.
As seen here, the Per Host tree of reports is nearly identical to that of the Per Domain Type
reports that we just explored. The one exception is the Event Log report that provides an
overview of the selected host and a detailed view.
The list of Hosts and Domains is a list of hosts where each contains:
Host
Domain Name
Version
Threads
CPU Utilization % (current and average for last day)
Downtime Duration in seconds
Number of Reboots
Availability (%) (over period)
Selecting a host in the list allows the user to drill down into more detailed graphs specific to
the Host.
When a host is selected from the List of Hosts and Domains, two graph reports are generated
- Availability % averaged in real time for the last day and CPU Utilization for the domain on
the host averaged over the last day.
This module covered the ReportPack for EMC Smarts Health and discusses the Per Domain
Type, Per Host, and List of Hosts and Domains reports.
This course covered the ReportPacks that support the EMC Smarts domains that are installed
and pre-configured as part of the default installation of Watch4net.
This concludes the training.