Building Virtual Machine automated vertical scalability with VMSS in Azure

Are you planning to lift and shift VMs into Cloud? Or have you done migration and now looking for a way how to scale them automatically?

Well, this article can be right for you!

When it comes to lifting and shifting the application and/or services hosted on the Virtual Machines (VM) from an on-premise environment to the Cloud, building a strategy and design of how to achieve some sort of automatization in VM scaling can be a challenging task.

In general, some type of scalability can be achieved with Virtual Machines as they are but it’s going to be very inopportune.

That’s why it is very important to work on cloud-based infrastructure design prior to the lifting-and-shifting process itself. And remember, VM scaling needs to happen automatically.

The approach I like to teach others is to automate everything that follows repetitive cycles.

But hang on, what if I scale up horizontally in Cloud by adding extra HW resources (RAM, CPU) to logical machine computing power hosting my VM? .. Yes, this may work but with some scripting to be done first and that is less likely going to be repetitive with the same set of input properties…

But what if full scaling automatization can be accomplished with a higher running cost efficiency and as little configuration work as possible?

Yes, that is all possible these days and I am going to share how to use one of the options from the market.

The options I liked to pursue one project of mine is coming from Microsoft Azure Resources stash.

Why is that?

It’s not a secret that I’ve worked with Azure since the early saga beginning. Therefore, I have built a long experience with the Azure platform. On the other hand, I have to admit that Azure Software Engineers have done a great job on building platform APIs and Web Wide UX/UI interface (Azure Portal) to make this process seamless and as easy to use as possible. More on my driving decision factors later …

Let’s get started

The Azure resources I have been mentioning here in the prologue are:

VMSS in Azure portal
Azure Compute Galleries in Azure portal
Azure Load Balancers in Azure portal

My reasons of choosing Azure

Every project has different needs and challenges coming from the business domain requirements. More importantly, rational justification on the economical side of the project complexity is mostly the driver of the project technological path in the design stage.

For this project, I was lucky because the customer I designed this solution for had part of the business applications and services in Azure already. Also, customer big ambitious plans to migrate everything else from an on-premise data centre to the Cloud in the near time horizon just made my decision more sealed and therefore Cloud in Azure was the way to go.

Infrastructure diagram

Let’s get a better understanding of the designed system infrastructure from the simplified infrastructure diagram below.

Take it with a grain of salt as the main purpose of it is to highlight the main components used in the project and discuss these in this post.

VMSS simplified infrastructure diagram

What I like most about the selected Azure stack

  • VM redundacy accross multiple data centers globally
  • has an ability to multiply VM instances as needed with an option to resize the instance computing power when needed (RAM, CPU, etc. => vertical scaling)
  • high service availability and resilience (subject to infrastructure desing – in my case, I provissioned total of two VMSSs, one geographically different data center each)
  • I like the flexibility of building my rules in VMSS on which system decides whether VM instances go up or down on quantity
  • Azure traffic ballancer can be linked to VMSS easily
  • the VMSS service can provision up to 600 VM instances (and that is a lot!)
  • the Azure Compute Gallery (ACG) service is able repliate image globaly, supports image versioning and auto deployment of the latest model to VM running instance (and that was a hot feature for me)

Steps to provision services in Azure

In nutshell, follow these steps to provision Azure services and build the cloud infrastructure from the ground up:

  1. Lift and shift the VM into Azure (I can recommend to use Azure Migrate service to start this process)
  2. Create a new Azure resource: Azure Compute Gallery
  3. Go to running instance of VM and capture and generalize the image of the migrated VM
Capturing VM state into the image, Azure portal
Selecting an option to Generalized VM captured state into the image
  1. Create two replicated images (for one datacentre each)
Two replicated images setting
  1. Save the image into Azure Compute Gallery created in step 2
  2. Create two new Azure resources: Virtual Machine Scale Set (in geographically different data centers as per settings in ‘Target regions’ in step 4 for Scaleset redundacy capabilities)
  3. Create scale out/in rules in VMSS

Scale out/in VMSS rules example

The screenshot image below shows the example of setting up the Scaling rules for one of the VMSS instances.

VMSS scaling rules example

As you can see in my default profile in the picture above, this VMSS instance is not running any VM instances by default (Minimum = 0). But rather, spins up some (scaling out) based on these criteria:

  1. The main VMSS instance hosted in datacenter A increases on average CPU (or)
  2. The load balancer availability drops below 70% in given timeframe

Very similar rules are used in the reverse process, aka scaling in.

If you’re planning to use a similar concept in your solution, count with a VM operation system booting time in your metrics if high VM hosted service availability and responsiveness is important to meet.

Microsoft Azure recently introduces a new feature called Predictive autoscale with Pre-launch setup (at the time of writing this article in preview only) which should solve a VM boot time issue for most of the use case scenarios. It works based on cyclical workload patterns determined by machine learning and predict scaling out need action in advance.

I like to say, using Machine learning capabilities in this sort of behaviour analysis is a very smart move from Microsoft forward.

I think VMSS has a lot to offer to businesses starting their journey to the Cloud.

The process of setting the infrastructure up is not complicated and can be done over UI/UX design in the Azure portal in no time. The VMSS scaling rules offer a lot of options to choose from and the level of integration with other types of Azure resources is on a very mature level, too.

Hope you enjoyed this reading and leave me some comments below 🙂

/cheers

How to save over 50% on Azure resources running cost

The running cost on some of the Azure resources (and licenses) can be massive and can cause a lot of frustration to a new starting business.

Therefore, I do always talk to my clients and try to find the best solution fitting their current needs. Then following the strategy of organic growth is the best way how to keep costs down while following the business transformation with technology into the mature and profitable one in future.

In this article, I am going to explain how to cut Azure resource costs down from 50 – 70% depending on the resource type and length of commitment.

Where to start?

Azure has a very smart way of how to keep the customer’s engagement for years. I admire this strategy because creates a really good value for both parties (customer and provider).

If you haven’t heard about Azure Reservations it’s a good time to start your home research with this link.

In nutshell, you pay less for the Azure resource based on pre-purchased Reservations in years. That means that a longer commitment with the resource you make less expensive it is going to get.

How to order Reservation in Azure portal

1. Log in to the Azure portal and search for ‘Reservations’. Select the Reservations option from the list and you should be able to see this page like in the picture below:

This is the current list of all reservations I have for one of my clients

2. Click on the ‘plus’ icon on the top left corner. You will be redirected on this page as shown in the picture below:

List of resources to choose from

3. Select resource you like to reserve from the list (I chose Virtual Machine)

4. If you currently hosting some VM (as in my case) in Azure without Reservation, this tool does the filtering of the size of VM automatically on the next page for you based on real-time utilization of that VM – that is smart!

5. Refine your selection in the next window by selecting the exact instance you like to reserve like in the picture below. This step is brilliant. It gives you an exact quote of how much it is going to cost you and what saving you are getting with a selected time commitment!

Available VM sizes with a price quote and estimate saving

6. Review the order and click ‘Review + Buy’ as is shown in the picture below:

Review order and purchase the reservation

7. … and we are done! You can monitor the overall Reservation utilization on the resource on the same page later on.

The process flow thanks to MS UX and UI is very intuitive, fast and clear. Tell me your thoughts in the comments below!

Changing mind after purchase?

Unfortunately, there is some cost associated if want to cancel after purchasing the Reservation.

But, what I would recommend doing instead is trying to do a Reservation exchange!

Yep, you hear me right, you can exchange the Reservation for some other one as long as the purchasing price is not lower than the original one.

I think it’s brilliant and saves a lot of fiddling around on cost management when business strategy changes!

/cheers


Why is CDN very important for your static HTTP content

Almost everyone heard about CDN but what that actually is?

Explained: CDN (Content delivery network) is the set of geographically distributed servers (proxy servers if you like) that cache the static content on running physical hardware which speeds up the actual download to its destination.

The global CDN network
The global CDN network in Example

Now, let me explain why the CDN network is such a big player in the Solution infrastructure and why no Solution developer/architect should overlook this.

But before we go any further let me mention another term which is: response latency.

Explained: In other words, the time needed for to download the Website content entirely to the consumer (End-user) device.

And as you can imagine, this is another very important factor to have your eye on if want to keep your audience engaged to service provided content as long as possible.

Low latency means a better User responsiveness/experience with the Website (Web service).

The question is, how to achieve the lowest latency possible? … there are two ways how to do it:

  • to use a very fast network for content delivery, or
  • to cache the content as closely as possible geographically to your audience

… the combination of both of these is the ultimate state towards which the global network is going (near real-time response).

And as all of you probably understand by now, to get the best ROI in the time you put into the content it is very important to have your infrastructure in the best shape possible. Keeping your visitors happy by serving them content as fast as possible helps to build better Website awareness and audience growth.

What CDN service provider I use?

Among all of the CDN providers, I have came across, Cloudflare is the one I was attracted most.

.. for many reasons:

The main one is that the service is offering a reasonably good DDoS protection shielding and well distributed and fast CDN server nodes.

Cloudflare account dashboard

To me, it is almost unbelievable that all of that for as much as $0! Yes, all of that can be yours for FREE! Very sweet deal, don’t you think? (btw, I am not participating on any affiliate program!)

Setting all of that up is a really straightforward and well-documented process.

If you want to know more visit this guide of how to set it all up.

The entire configuration process becomes even easier if having a domain name address purchased separately from the Web hosting (easier to maintain the DNS servers configuration over Domain name provider portal – which every solid domain name provider has).

Some another feature Cloudflare provides is the fast route finder across the Cloudflare network called Argo, which helps to decrease loading time and reduces bandwidth costs.

I have been using this service for one of my clients who is providing Address lookup and Address validation services over REST API web services hosted in Cloud in multiple geographically different data centres and I must say that the customer experience was very positive since.

In numbers, I was able to reduce an HTTP response latency time down from 1.4s to 0.5s! And these are very good performance improvements for a business where time is the essence.

I am leaving this link here if interested to know more about this.

Anyway, thank you again for visiting this post, I hope you have enjoyed reading and let me know what CDN provider you’re using!

/cheers