Can’t connect to vmware data recovery virtual appliance

I couldn’t for the life of me connect to the data recovery virtual appliance. I tried all sorts of things, this forum has a lot of helpful hints…

http://communities.vmware.com/thread/208385?start=0&tstart=0

But this post gave me the answer,

I called into Tech Support and they have confirmed the following:

Vista & Windows 2008 (32bit/64bit) vSphere Client has issues with the Data Recovery Plugin. Sometimes the plugin doesn’t register, sometimes it does. When it does register, you will not be able to connect your Data Recovery appliance. It does however work on XP. Once you’ve registered the appliance from an XP client, and walked through the “Getting Started Wizard” you can access the appliance from the Vista/Windows 2008 vSphere Client without issues (you may need a restart of your vSphere client).

The tech confirmed with an internal VMware BUG ID # 412552.

So I installed the client and plugin on my XP box and lo and behold I could connect. After running through the getting started wizard I could then connect on my 2003 box that has vcenter installed…. so strange but I’m just glad its working. Oh yeah they have a newer version out 1.1

So you bought an EqualLogic SAN, now what… part one

This post might be extremely long. I like to document things just in case I can spare other people the pain that I went through.

Ok, so you just bought an iScsi SAN right. So you should learn a bunch about iScsi I would think. If you are using iScsi with vmware you NEED to read this.

http://www.vmware.com/pdf/vsphere4/r40/vsp_40_iscsi_san_cfg.pdf or my copy is here http://sites.google.com/site/mellerbeck/Home/vsp_40_iscsi_san_cfg.pdf?attredirects=0&d=1

If you are doing 3.5 then read this one http://www.vmware.com/pdf/vi3_35/esx_3/r35/vi3_35_25_iscsi_san_cfg.pdf

I would read this as well… http://virtualgeek.typepad.com/virtual_geek/2009/01/a-multivendor-post-to-help-our-mutual-iscsi-customers-using-vmware.html

So, did you read it? I mean go READ IT. Trust me! What I gleaned from those guides is this: (copied from http://www.yellow-bricks.com/2008/07/21/queuedepth-and-whats-next/

…. often overlooked is Disk.UseLunReset and/or Disk.UseDeviceReset. ESX defaults to Disk.UseLunReset=1 and Disk.UseDeviceReset=1. This means that when a SCSI bus is reset all SCSI reservations are cleared, not for a specific LUN but for the complete device. This is useful when one uses local storage, but within a VMware environment most companies utilize a SAN and you don’t want to disrupt the entire SAN when it’s not necesarry. You can set this via the commandline, powershell and via VirtualCenter:

  1. VirtualCenter -> Configuration Tab -> Advanced Settings -> Disk -> Disk.UseLunReset=1 , Disk.UseDeviceReset=0
  2. Get-VMHost | Set-VMHostAdvancedConfiguration -Name Disk.UseDeviceReset -Value 0
  3. Commandline -> esxcfg-advcfg -s 1 /Disk/UseLunReset
    Commandline -> esxcfg-advcfg -s 0 /Disk/UseDeviceReset

The next thing I learned from the guide is I think you want to change the Disk.MaxLUN parameter. It defaults to 255 but unless you are planning on having that many luns setting it to say 50 is more reasonable and will make your ESX boot quicker as well as scan LUNS quicker.

In the 3.5 guide it mentions removing the vmfs-2 module (not so in the 4 manual) but it is easy to do as laid out in this post. http://www.yellow-bricks.com/2009/03/13/disabling-the-vmfs-2-module-exploring-the-next-generation-of-esx/

Basically run this command. esxcfg-module -d vmfs2

It also mentions somewhere (I couldn’t find it again) that if you only made changes to an iSCSI LUN then you only need to rescan that one (just right click the vmhba33 and rescan)

OK, on with the show

Get access to the dell\equallogic support site. You will need it for firmware and software.

  • Create an EqualLogic Support Account

Our setup: 3 IBM 3850 M2 with 2 processors, 64 GB of RAM. 2 nics four ports (etherchannel bonded on incoming Cisco like this)  2 nics four ports for iscsi traffic. And a nic for the service console, and another gig nic for vmotion. Nics are Intel PRO/1000 dual port controllers

What we bought: two PS6000XV, two Dell PowerConnect 6224 Ethernet Switch’s ( We probably should have gone with the 6248 for more ports but this is sufficient)

Two Stacking Modules for our Ethernet Switches. Everywhere I read recommends stacking the switches as the way to go.

A lot of cat 6 network cable of two colors. We refer to one switch as our Red switch and another as our Orange switch. This way we have a visual check of whether every system has redundant paths.

Ok, so the hardware arrived. Rack and stack them along with the switches. Connect the stacking cables between the switches. How should you do that well look here on page 32

Manual for Dell Power Connect 6224

http://support.dell.com/support/edocs/network/pc62xx/en/UG/PDF/ug_en.pdf

ug_en.pdf_001

I don’t know who names things over at Dell, but they seriously need to revisit their taxonomy.

Ok, so now everything should be cabled up. You want redundant paths from end to end.

Ok, configure the switch. But how should we configure it? Yet again how Dell names things leaves me less than amused. Of course the guide that you need to configure your dell power connect 6224 for use with equallogic is named none other than…. wait for it…. Dell EqualLogic Configuration Guide…. yup thats it.

Ok where is it? You can find it here http://sites.google.com/site/mellerbeck/Home/Dell_EqualLogic_Configuration_Guide.pdf?attredirects=0&d=1

So on page 22 (26) you can get the step by step commands to configure that switch, can Dell make it any harder to find this stuff?

I got a hold of a couple of other manuals for this switch that are pretty useful.

http://sites.google.com/site/mellerbeck/Home/PowerConnectTuning-rev1.07.pdf?attredirects=0&d=1

I did everything but enable the Cut-Through Switch Forwarding.

And here is the manual if you need to configure LAG’s http://sites.google.com/site/mellerbeck/Home/Dell-PowerConnect-How_to_configure_LAG_LACP-1.0.pdf?attredirects=0&d=1

Here is a list of some manuals as well http://support.dell.com/support/edocs/network/pc62xx/en/index.htm

Double check your firmware on the  Switch 2.2.0.3 seems to work. Power connect 6224 Firmware is hiding here http://ftp.dell.com/firmware/

So from the Dell Equallogic Configuration Guide I gleaned that stacking seems very recommended. Also, some main requirements are enabling Flow Control. For Flow control on the 6224 it is a global setting. I have set it and not seen it go out to all ports. So I disabled an enabled it again. And it went out to most ports (most would read active a few inactive) after disabling and enabling it again it seems to have finally stuck. A next important point is no STP functionality on switch ports that connect end nodes (end nodes are your ESX boxes or the SAN) which in my case is pretty much it! So I disabled STP on everything. (Why disable STP? It can cause delays that can cause your fail-overs to not work!) If you must use STP they recommend Rapid STP which to turn that on is in the config guide. Then finally, enable Jumbo Frames.

This is what my switch looks like, if you know it should be different please let me know! Since its hard to get straight answers with these switches for some reason.

Ok, so your switches are configured.

Next, turn on one of your SAN’s (If you happen to have two, this way you can name it otherwise you have to locate the serial num which is impossible) and run the setup wizard. RAID 50 seems to be the defacto for most people. Add it to the group. Upgrade the firmware on your SAN (can download from the equallogic support site). Do it at the beginning and get it out of the way now :)

Read the release notes for your firmware, here it is for the 4.2.1 http://sites.google.com/site/mellerbeck/Home/110-6024-EN-R1_RNotes_V4.pdf?attredirects=0&d=1

The supported configuration Limits is something to pay attention to! page 4 (8)

What I gleaned from the Release notes are a few Reg edits that you want to make to anything attaching to the SAN.

Increase the value of the TimeOutValue parameter (HKEY_LOCAL_MACHINE/SYSTEM/
CurrentControlSet/Services/Disk/TimeOutValue) to at least 60 seconds (the default is
10 seconds). Make sure to set the type to DWORD and enter the value (60) in decimal.
This will increase the timeout period for all disk I/Os for the disk class driver (in contrast to driver-
specific Registry parameters that affect only your iSCSI initiator). You must reboot the server for
the change to take effect.

If you install the VMware tools it will automatically make this change (I believe, it doesn’t hurt to check)

***

For any environment (clustered or not) using multi-path I/O, make the following changes:
–    Add or modify the UseCustomPathRecoveryInterval key, and set its value to 1.
–    Add or modify the PDORemovePeriod key, and set its value to 120.
–    Add or modify the PathRecoveryInterval key to 60, or half the value of the
PDORemovePeriod key, if it exists.

HKLM\System\CurrentControlSet\Services\mpio\Parameters\UseCustomPathRecoveryInterval                     1
HKLM\System\CurrentControlSet\Services\mpio\Parameters\PDORemovePeriod                                                   120
HKLM\System\CurrentControlSet\Services\mpio\Parameters\PathRecoveryInterval                                            60

These also look they are added by a vmware tools install as well.

This is also an interesting tip for Vista or Server 2008.

Accessing PS Series Groups Using iSCSI Initiators on Microsoft Vista or Windows Server
2008. To support group access from initiators running on these operating systems, you must enable
ICMP echo requests for ICMPv4, or for ICMPv6, if using an IPv6 initiator.

Turn on the other SAN. Name it. Add it to the group. RAID 50,

Alrighty, now I’m really gonna make your eyes bleed. Configuring VMware vSphere Software iSCSI with Dell EqualLogic PS Series Storage.pdf

(You could also google that up) So following that guide we create a virtual switch with jumbo frames, and created 8 vnics (the maximum), then assigned network adapters, then associate the vmkernel ports to the physical adapters,  enable iscsi, then finally bind the VMkernel ports. At the end of the PDF there is a scripting section that will help automate this.

OK, on second thought don’t create 8 virtual nics. The equallogic has a limitation of 512 iSCSI connections. So every time I was creating a volume I was eating up 24 connections. So now I am reducing this to 4 virtual nics to match the four physical ports, probably makes more sense that way anyways. I did learn that each pool can have 512 connections so maybe what I should have done was create separate pools. But once we added all of our space into the default pool this wasn’t an option any more. To learn more about Storage Pools you could read this http://sites.google.com/site/mellerbeck/Home/DeployingPoolsandTieredStorageinaPSSeriesSAN..pdf?attredirects=0&d=1

So now lets put the SAN through its paces. First thing I think you should test is all of the redundant paths. So create a volume and attach to it. I put a VM on it and started it up. I started pulling connections from a least dangerous to most dangerous. So first I simulated a dead nic by pulling the ethernet to it. Next I tested pulling power to the slave switch. Next, I tested pulling power to the master switch and seeing how long failover took and whether the VM’s survived. Oh you might want to make sure you install these patches on your vsphere boxes beforehand https://www.equallogic.com/enewsletter/technnote_0072009.html or else your volume might not stay connected. Finally I pulled power to the master controller on the Equallogic. All of these test helped verified that the redundant paths were working as advertised.

Ok, now that that is verified lets start stress testing the infrastructure. But first off install the Dell Equallogic SAN HeadQuarters so we can get some visibility into the pounding we are giving the SAN. You download SAN HeadQuarters from the support site. Unfortunately its an EXE that is named Setup.exe, in fact just about everything you download is named Setup.exe grrr….. bad form, bad form. Ok, so to use SAN HeadQuarters you will need to add a read only SNMP community name. So through Group Configuration of your Equallogic, click the SNMP tab, and then click add to add one. Now when you install SAN headquarters you will use that public SNMP community name.

So now you can see your SAN activity, here is a snapshot of my SAN being really bored.

If that hasn’t scarred you enough you can always read part two of this post.

http://michaelellerbeck.com/2009/11/23/so-you-bought-an-equallogic-san-now-what-part-two/

AnywhereUSB drivers

Finding usbd.sys for server 2008 enterprise for an AnywhereUSB

This was a pain. Basically mount the .iso

Download imagex.exe from here http://www.tipandtrick.net/2008/imagex-600118000-x86-and-x64-for-windows-server-2008-and-vista-sp1-standalone-download/

This explains what to do with it http://4sysops.com/archives/how-to-mount-a-wim-image-with-imagex-in-windows-vista/

Right click on wimfltr.inf and then click on “install”

Reboot.

Create a new folder this will be your mount point mine was c:\test

Run the command imagex.exe /mount d:\sources\install.wim 5 c:\test

Now you can search for usbd.sys in the c:\test folder

Or you could just download it from here (this is the 64bit version for server2008  enterprise)

http://sites.google.com/site/mellerbeck/Home/usbd.zi?attredirects=0&d=1

rename to a .zip and unzip

Old Datastores still hanging on after storage motion on vsphere

I have been able to clear this by doing a vmotion off to another host.

vpxclient.exe runtime error vsphere vcenter import crashes

So I ran across the lovely error that vcenter would crash when I would go to import a machine. After much googling I finally ran across this post

http://www.vmware.com/support/vsphere4/doc/vsp_vcc_41_rel_notes.html

This is the jist of it

Microsoft Visual C++ Runtime Library error message is displayed during import or export of a machine
If a user environment has versions of VI Client 2.5 Update 1, Update 2, Update 3, or Update 4 that coexist with vSphere Client 4.0, and you install the vCenter Converter 4.1.0 client plug-in, when you start the vCenter Converter Import or Export wizard, the vSphere Client session is terminated abruptly. An OpenSSL DLL conflict between the VI Client versions and the vCenter Converter 4.1.0 client plug-in causes this problem.
Workaround: Go to the Launcher folder in the VI Client install directory, for example, C:\Program Files\VMware\Infrastructure\Virtual Infrastructure Client\Launcher, and delete the following DLL files from that location:

  • libeay32.dll
  • ssleay32.dll

I deleted these file from my windows\system32 folder (after verifying that the timestamp was old) and this did the trick.

Making ESX angry

So I always find ways to make the ESX servers angry. They are always stupid in hindsight. So the first easy way is to attempt to snapshot a VM that has very little space. It will start the snapshot and then run out of space, pausing your VM. And then joy, your vm is not working anymore. It tends to stall out the virtual center and won’t let you shut down the VM either. If you are patient, you can restart the virtual center. Then eventually get the VM shutdown, and then you can remove the snapshot. If you are lucky it will let you start it up again.

Another really fun way to make ESX angry is to delete an iscsi volume that it is actively using (deleting it on the SAN side) if you are stupid enough to do this (like me) you will see BAD THINGS like this http://communities.vmware.com/thread/228586

Gah, what a week so far….

 

IBM 3850 M2 not detecting Intel PRO/1000 PT – 10/100/1000Mbps Dual Port Server Gigabit Ethernet Adapter

For what ever reason if I plug in the adapter into slot 6 or 7 (the hot swap ports) they never are detected. So instead I moved one of our Qlogic HBA’s to the hot swap port, and then put the NIC in the regular port and it works just fine. Go figure.

This post is hilarious!

Interesting Blogger, Seth Godin

I have recently stumbled onto Seth Godins Blog here http://sethgodin.typepad.com/seths_blog/ so far I have been delighted to read his little daily snippets and insights into marketing and general life philosophy. He sounds like he would be a really interesting person to talk to.

His post of today is on asking what do you need me to do

For an entrepreneur\consultant\freelancer etc… he states that this is a dangerous question. So he recommends the proactive approach of letting your clients know what you can do for them. I do think that there is a fine balance between finding out what they think they need (and helping them decide if they really need it, that’s tricky!) and offering solutions that you think they might be interested in.

And interesting side note is that most of the time you can find a consultant who will implement anything you want, I think a really good consultant will question whether you really need what you think you want. (I read somewhere that you can find a consultant to implement a peanut butter and jelly sandwhich if that’s what you want, but I can’t find that quote again… it really cracked me up!)

For example we had a real honest tree removal guy come out and tell us that our tree was just fine (just sun scorched) and didn’t think we needed to remove it. The previous guy had left his car running and given us a removal quote in about five minutes.

Another example could be the case of whether you really need a 100K website if you get maybe a couple of hundred hits on your website and don’t really sell to consumers. It would be insane for the web company to leave that money on the table, but in all honesty shouldn’t they at least mention that the ROI might not warrant the expense?

http://www.typepad.com/services/trackback/6a00d83451b31569e20120a5aa72b7970c