Using Ai to Perform Boring Vmware Vsphere Tasks

Using AI to perform boring VMware vSphere tasks

GrahamJanuary 27, 2025April 23, 2025no commentAutomate with AI Browser Use Web UI LLM vSphere

Contents

Recently I have been thinking more and more about AI and I wondered if it was possible to use AI to automate boring administrative tasks in vSphere.
First I though to use some Generative AI to provide instruction on how to perform the task, but I had no way to actually get something to perform them for me.

Then today I discovered than you can use a tool called browser-use, combined with a model to perform tasks within a browser.

They way this works is you setup the browser-use system with your model of choice and API key. Then you simply provide a prompt such as “login to this URL with user X and password Y” It then launches a browser session and scans the page then injects your form details.

This is all basic stuff though, so I tried something more complex. I used these tools to login to the vSphere client and interact with my virtual machines but I think if you get the prompts right, you could even get vMotion to work and far more complex things including advanced configuration.

Before I run through how to set this up, take a look at the quick demo which logs into the vSphere client, deals with an SSL error, navigated around a bunch of errors in the client and then powers on a VM. The prompt for this is in the instructions below.

Demo

Setting up the environment

Install Python

Download and Install Python: Download Python | Python.org

Install Browser Use Web UI

Download or clone from: GitHub – browser-use/web-ui: Run AI Agent in your browser
Extract and change into the folder directory
Follow the instructions for Option 1: Local Installation
If you’re using PowerShell, when instructed to run:
source .venv/bin/activate
you will need to actually run:
.venv/bin/activate

Launch

From the same directory you have been working in, run:
python webui.py --ip 127.0.0.1 --port 7788
Open a web browser to:
http://127.0.0.1:7788

Configure Browser Use Web UI

Open the LLM Configuration and select your model. I am using the DeepSeek Reasoner Model.
Locate your model’s API keys
This will vary depending on the model you are using.
- For DeepSeek, Navigate to DeepSeek and then API platform.
- Create a new account
- Top up up the account with $2 to get started
- Next go to API keys and generate a new API key
Add your API key to Web UI (Under LLM Configuration)
Enter the Base URL for the model. For DeepSeek, this is:
https://api.deepseek.com/v1
Under Agent Settings, it is recommended to disable “Use Vision” due to compatibility issues with DeepSeek

Performing vSphere Tasks

Now onto the good bit!

Go to Run Agent and enter your prompt.
Be sure to provide the URL, user and password plus what you want it to do.

Task Description
1. Open the vSphere client
2. Wait for the licensing banner to load
3. Close the licensing error banner with the x on the top right of the browser
4. Power on the VM using the power on icon

Additional Information
The vSphere client URL is: https://vcf-m01-vc01.lab.local/ui/
The vSphere client username is: administrator@vsphere.local
The vSphere client password is: Password12345!
The VM name is: vcf-m01-nsx01a

What’s interesting here is in the console window you can see what the model is trying to do in plain English, then it tries that.
I also though it was interesting how it tried a few ways to bypass the SSL error until it finally found a good way to do it. Also sometimes it does the wrong thing (opens the wrong menu for example) but it will then close that menu and try again, a little like how a human would make a mistake and correct it.

Share your prompts!

If you manage to get something interesting working such as live vMotion, advanced configurations or even a host upgrade using this method, please let me know.

I’d be happy to put together a library of prompts and credit you for your work.

A note on using LLMs

I highly recommend doing your own research before using DeepSeek’s LLMs, especially for work purposes outside of China.

I would also avoid this use of LLMs to orchestrate tasks in your production environment due to the obvious risks associated with it.

One more thing

If you have found this post useful, consider joining my newsletter for updates and other useful downloads and guides >> Subscribe to the newsletter

Graham

Graham is a seasoned expert in VMware and Microsoft solutions, bringing deep expertise to his role at Dell Technologies. As a Senior Principal Engineer he focuses on cutting-edge virtualization and storage projects. A VMware Certified Implementation Expert, 9x VMware vExpert, and VMware User Moderator on the official VMTN forums, Graham is a trusted community resource. Connect with him on Twitter for insights! @VirtualG.uk

See Full Bio

Tags :Automate with AI Browser Use Web UI LLM vSphere

add a comment

Leave a Response Cancel reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Cookie	Duration	Description
yt-player-headers-readable	never	The yt-player-headers-readable cookie is used by YouTube to store user preferences related to video playback and interface, enhancing the user's viewing experience.
yt-remote-cast-installed	session	The yt-remote-cast-installed cookie is used to store the user's video player preferences using embedded YouTube video.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-fast-check-period	session	The yt-remote-fast-check-period cookie is used by YouTube to store the user's video player preferences for embedded YouTube videos.
yt-remote-session-app	session	The yt-remote-session-app cookie is used by YouTube to store user preferences and information about the interface of the embedded YouTube video player.
yt-remote-session-name	session	The yt-remote-session-name cookie is used by YouTube to store the user's video player preferences using embedded YouTube video.
ytidb::LAST_RESULT_ENTRY_KEY	never	The cookie ytidb::LAST_RESULT_ENTRY_KEY is used by YouTube to store the last search result entry that was clicked by the user. This information is used to improve the user experience by providing more relevant search results in the future.

Cookie	Duration	Description
_ga	1 year 1 month 4 days	Google Analytics sets this cookie to calculate visitor, session and campaign data and track site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.

Cookie	Duration	Description
VISITOR_INFO1_LIVE	6 months	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
VISITOR_PRIVACY_METADATA	6 months	YouTube sets this cookie to store the user's cookie consent state for the current domain.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
__eoi	6 months	Description is currently not available.
_pk_id.2207.d26e	1 year 1 month	Description is currently not available.
_pk_ses.2207.d26e	1 hour	Description is currently not available.
fm_cookie_d906c532a46cc05ae740fbf60cad7d07	1 month	Description is currently not available.