Skip to main content

NetApp Cloud Insights Preview, Part 4: Troubleshooting



Thank you for tuning in once again to my blog series on NetApp Cloud Insights. If you haven't seen the previous entries, here are some links:


In this post, I'm going to cover a really straightforward troubleshooting workflow that you can do within Cloud Insights to help you zero in on the root cause of some VM latency.  We'll start by searching for an asset (a VMware virtual machine in this case) from the search bar at the top of the Cloud Insights dashboard.



We can see a bit of increased latency beginning at approximately 12:00AM that we want to take a look at. On the right-hand side of the screen, Cloud Insights has identified other objects that it believes are correlated with the asset in question. The top correlated object, for example, is the VMware datastore where the VM is located. If we click the checkbox next to the datastore asset, it'll stack the chosen metrics (latency and IOPs for this example) on top of the existing values for the VM. 


You can see from the data above that there is a pretty good chance that increased I/O on the VMware datastore caused the VM latency to increase. 

In this particular case, the SnapMirror schedule for this volume starts at 12:00AM, so this behavior is expected, but it serves to illustrate how useful the data can be for troubleshooting. 

If you notice down in the lower-right, there is a section called "Greedy" - these are resources that are reporting higher-than-normal amounts of IOPs and may be impacting other workloads. It's not shown in the picture, but there is also a "Degraded" section if a troublesome workload is impacting other workloads negatively. 

It's pretty cool how you can layer all this data from multiple systems collected by Cloud Insights together and use it for root cause analysis and troubleshooting with just a few clicks.

This concludes the series on Cloud Insights, but please don't think that what I've covered here is the sum total of what Cloud Insights has to offer - it does SO MUCH more for public cloud monitoring, cost reclamation estimating, VM right-sizing, and more. As you may be aware, Insight 2018 US just wrapped up, where Cloud Insights was officially announced. So if these posts have piqued your interest, head out to cloud.netapp.com and sign up for a free trial of Cloud Insights and see for yourself. 

Thank you for reading!

Comments

Popular posts from this blog

How To: Unjoin NetApp Nodes from a Cluster

Let me paint you a word picture:

You've upgraded to a shiny new AFF - it's all racked, stacked, cabled and ready to rock. You've moved your volumes onto the new storage and your workloads are performing beautifully (of course) and it's time to put your old NetApp gear out to pasture.

We're going to learn how to unjoin nodes from an existing cluster. But wait! There are several prerequisites that must be met before the actual cluster unjoin can be done.


Ensure that you have either moved volumes to your new aggregates or offlined and deleted any unused volumes.Offline and delete aggregates from old nodes.Re-home data LIFs or disable/delete if they are not in use.Disable and delete intercluster LIFs for the old nodes (and remove them from any Cluster Peering relationships)Remove the old node's ports from any Broadcast Domains or Failover Groups that they may be a member of.Move epsilon to one of the new nodes (let's assume nodes 3 and 4 are the new nodes, in th…

NetApp ONTAP 9.3 Simulator Deployment - Part 1

I am going to be doing a few of these simulator/lab posts in an effort to set up an environment that will pave the way for future guides and blog posts. Hopefully it'll also be a good resource for folks that want to set up their own labs to test out new features and software versions. Today I'm going to show the steps required to deploy Netapp's ONTAP Simulator 9.3 on vSphere 6.5.  I'll also be doing a follow-up article that will detail the process of clustering a second node with this first one.

Note: My lab has vCenter 6.5 deployed along with a Distributed vSwitch, so the steps will be specific to that deployment. I will also assume that you already have basic networking and storage for your virtual machines in place.

Step 1: Deploying the Simulator

1. Browse out to https://mysupport.netapp.com, click on "Sign In" in the upper right-hand corner and log in using your NetApp account credentials.

2. Click on the Downloads drop-down at the top of the screen and c…

Cisco UCS Platform Emulator Installation

To continue my series of posts on building the framework for a functional lab environment, I'd like to talk about the Cisco UCS Platform Emulator (UCSPE). It is a software appliance packaged as a vSphere OVA that approximates a UCS deployment, including the networking components (a pair of switches called the Fabric Interconnects) and both blade and rackmount UCS servers (B- and C-Series, respectively). It can be a great tool for learning and becoming more familiar with the UCS platform. I will be deploying my UCSPE on vSphere 6.7 in my lab, but it should work similarly in other recent versions.

1. Start by downloading the UCS Platform Emulator OVA from https://communities.cisco.com/docs/DOC-71877 - you will need a Cisco Connection Online (CCO) login in order to begin the download. I am using version 3.1(2ePE1) of the emulator for this guide as that appeared to be the latest version available at the time of writing. Side note, I also noticed during the boot process that this versi…