Virtual machine performance comparison of 6 cloud platforms: Selectel, MCS, Ya.Oblako, Google Cloud, AWS and Azure

Just now I came across two articles from one corporate blog about clouds - one about Kubernetes, and the second was an attempt to measure performance using a technique that seemed doubtful to me (spoiler - not for nothing).

About K8s, I also have something to say, but let's talk about performance.

The mistrust of the results was caused by many factors, but the main ones for me were the following: there were no test run parameters, the number of iterations was not announced, the machines chosen were not announced, there was no detailed configuration either. It is doubtful in general.
In general, I mainly use Google Cloud and AWS (a total of about ten years of experience has come to work with them) and I don’t work with domestic cloud providers, but, by coincidence, I have active accounts in Selectel, MCS, I. Cloud and, after this test, also in Azure.

Fortunately, all these platforms are public and, whatever I intend to, everyone can go, repeat and check if they wish.

The result of all this was the thought - why not spend a couple of hundred rubles ~~, all weekends~~ and really thoughtfully not to measure all six platforms and find out which of them gives the best performance relative to the cost and in absolute figures with the same configurations, and at the same time compare global suppliers with Russian.
And also, as it turned out, to clarify some of the "features" in the allocation of resources and remind ourselves and others that it is not always and not on all platforms for the same money you can get predictable performance.

The results were not to be said to be phenomenal, but in my opinion extremely curious.

I ask those interested under the cat.

Technique

Virtual machines

Each cloud provider runs successively in different availability zones (if there are two zones, then 1 machine is the first zone and 2 is in the second) three virtual machines with 4 CPUs, 8 GB of RAM and a 50 GB system disk.

The type of processor / instance is the newest available, if there is a choice.

VM type - shared with full kernel allocation.

Type of disks - network SSD with the ability to remount to another VM.

Options for guaranteed allocation of IOPS or machines optimized for this are not used, unless it is provided for by standard conditions of use and cannot be waived.

The default file system is ext4.

No manual system settings were made.

A series of tests was run on each of the machines, the totals for each machine were averaged.

The final performance of the platform is expressed by the arithmetic average of the average test values for each of the virtual machines, but the standard deviation is also in the tables for those interested.

The operating system is Ubuntu 16.04 of the latest available patch level.

Cost calculation

The calculation of the cost was made without taking into account any bonus charges from the provider, without taking into account the cost of traffic, at the rate of the virtual machine’s work for a full calendar month without stopping.

Also, some platforms allow, through certain restrictions (which you can skillfully survive), to significantly reduce the cost of resources.

For AWS, these are Spot instances, for GCE, Preemptible instances. With a suitable application architecture, they can be successfully used without harm to it, but with a benefit for the wallet, it has been verified by me personally, and by dozens of companies that use both.
To this category can be attributed the type of disk in the Selectel. Despite the fact that the main measurements involved disks of the type “Fast”, there is still significantly cheaper “Universal”, which does not shine with speed, but is suitable for a huge number of tasks. Options for its use were also taken into account in the final calculations.

Tests

To run the tests, such a script was written, from which you can see all the launch parameters:

Test script

#!/usr/bin/env bash TIME=60 # Workload 70% read 30% write cat > fio-rand.fio << EOL [global] name=fio-rand-RW filename=fio-rand-RW rw=randrw rwmixread=70 rwmixwrite=30 bs=4K direct=1 numjobs=1 time_based=1 runtime=${TIME} [file1] size=2G iodepth=16 EOL echo "Run FIO" for i in {1..3}; do echo "$i iter:" fio fio-rand.fio |grep -E "(read|write|bw|iops|READ|WRITE)" |grep -v "Disk" done echo "Run stress-ng." for i in {1,2,4}; do for z in {1..3}; do echo -n "$z iter. Stress-NG for $i CPU: " stress-ng --cpu $i --cpu-method matrixprod --metrics-brief -t $TIME 2>&1 |sed -n '6p'| awk '{print $5}' done done for i in {1,2,4}; do for z in {1..3}; do echo -n "$z iter. Sysbench CPU for $i thread(s): " sysbench --num-threads=$i --max-time=$TIME --test=cpu run 2>&1|grep "total time:"|awk '{print $3}' done done for i in {1,2,4}; do for z in {1..3}; do echo -n "$z iter. Sysbench Memory for $i thread(s): " sysbench --num-threads=$i --max-time=$TIME --test=memory run 2>&1| grep "Operations performed:" done done

For all tests except Sysbench CPU, more is better.

The results of all launches were collected in Excel spreadsheets for further calculations.
Well, it seems like I did - I told, now I need to tell what happened.

Testing

The machine is an example outside the test.

Clouds are usually compared to ordinary iron servers. I do not see much sense in this, since the cloud is not only and not so much directly computing power, but first of all - an ecosystem, but, nevertheless, I think many people will still be interested in such a comparison. Well, in general, something must be compared. With something close, famous and understandable.
I didn’t have the iron machine at hand, but there is a very new non-Dell workstation, it’s also a home server with a well-known processor (E5-4650L @ 2.60GHz), a suitable amount of not the fastest DDR3 EEC memory (frankly, the slowest of those that were generally compatible) and the SmartBuy SSD, bought 4 years ago and recently moved to this assembly.

Since all of this works under FreeBSD 11.2, a suitable virtual machine was created using bhyve tools and a test was run there.

Startup log

 Run FIO 1 iter: read : io=891652KB, bw=14861KB/s, iops=3715, runt= 60001msec bw (KB /s): min= 116, max=17520, per=100.00%, avg=15449.34, stdev=2990.83 write: io=381908KB, bw=6365.3KB/s, iops=1591, runt= 60001msec bw (KB /s): min= 49, max= 7752, per=100.00%, avg=6620.06, stdev=1290.46 READ: io=891652KB, aggrb=14860KB/s, minb=14860KB/s, maxb=14860KB/s, mint=60001msec, maxt=60001msec WRITE: io=381908KB, aggrb=6365KB/s, minb=6365KB/s, maxb=6365KB/s, mint=60001msec, maxt=60001msec 2 iter: read : io=930228KB, bw=15504KB/s, iops=3875, runt= 60001msec bw (KB /s): min= 5088, max=17144, per=99.98%, avg=15500.61, stdev=2175.23 write: io=398256KB, bw=6637.6KB/s, iops=1659, runt= 60001msec bw (KB /s): min= 2064, max= 7504, per=100.00%, avg=6639.82, stdev=979.69 READ: io=930228KB, aggrb=15503KB/s, minb=15503KB/s, maxb=15503KB/s, mint=60001msec, maxt=60001msec WRITE: io=398256KB, aggrb=6637KB/s, minb=6637KB/s, maxb=6637KB/s, mint=60001msec, maxt=60001msec 3 iter: read : io=886780KB, bw=14779KB/s, iops=3694, runt= 60001msec bw (KB /s): min= 1823, max=17248, per=100.00%, avg=15520.09, stdev=2453.59 write: io=379988KB, bw=6333.3KB/s, iops=1583, runt= 60001msec bw (KB /s): min= 731, max= 7488, per=100.00%, avg=6647.33, stdev=1054.67 READ: io=886780KB, aggrb=14779KB/s, minb=14779KB/s, maxb=14779KB/s, mint=60001msec, maxt=60001msec WRITE: io=379988KB, aggrb=6333KB/s, minb=6333KB/s, maxb=6333KB/s, mint=60001msec, maxt=60001msec Run stress-ng. 1 iter. Stress-NG for 1 CPU: 12227 2 iter. Stress-NG for 1 CPU: 12399 3 iter. Stress-NG for 1 CPU: 12134 1 iter. Stress-NG for 2 CPU: 23812 2 iter. Stress-NG for 2 CPU: 23558 3 iter. Stress-NG for 2 CPU: 21254 1 iter. Stress-NG for 4 CPU: 39495 2 iter. Stress-NG for 4 CPU: 39876 3 iter. Stress-NG for 4 CPU: 42370 1 iter. Sysbench CPU for 1 thread(s): 11.0566s 2 iter. Sysbench CPU for 1 thread(s): 11.0479s 3 iter. Sysbench CPU for 1 thread(s): 11.0451s 1 iter. Sysbench CPU for 2 thread(s): 5.6159s 2 iter. Sysbench CPU for 2 thread(s): 5.5664s 3 iter. Sysbench CPU for 2 thread(s): 5.5407s 1 iter. Sysbench CPU for 4 thread(s): 2.8368s 2 iter. Sysbench CPU for 4 thread(s): 2.8801s 3 iter. Sysbench CPU for 4 thread(s): 2.8244s 1 iter. Sysbench Memory for 1 thread(s): Operations performed: 104857600 (2537704.01 ops/sec) 2 iter. Sysbench Memory for 1 thread(s): Operations performed: 104857600 (2536025.17 ops/sec) 3 iter. Sysbench Memory for 1 thread(s): Operations performed: 104857600 (2472121.34 ops/sec) 1 iter. Sysbench Memory for 2 thread(s): Operations performed: 104857600 (3182800.43 ops/sec) 2 iter. Sysbench Memory for 2 thread(s): Operations performed: 104857600 (3379413.65 ops/sec) 3 iter. Sysbench Memory for 2 thread(s): Operations performed: 104857600 (3306495.59 ops/sec) 1 iter. Sysbench Memory for 4 thread(s): Operations performed: 104857600 (4300089.71 ops/sec) 2 iter. Sysbench Memory for 4 thread(s): Operations performed: 104857600 (4163689.93 ops/sec) 3 iter. Sysbench Memory for 4 thread(s): Operations performed: 104857600 (4163996.47 ops/sec)

If you translate the results into a table view, you get the following:

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	3715.00	3875.00	3694.00	3761.33	99.00
FIO WRITE IOPS	1591.00	1659.00	1583.00	1611.00	41.76
STRESS-NG 1 CPU	12227.00	12399.00	12134.00	12253.33	134.45
STRESS-NG 2 CPU	23812.00	23558.00	21254.00	22874.67	1409.27
STRESS-NG 4 CPU	39495.00	39876.00	42370.00	40580.33	1561.56
Sysbench CPU for 1	11.06	11.05	11.05	11.05	0.01
Sysbench CPU for 2	5.62	5.57	5.54	5.57	0.04
Sysbench CPU for 4	2.84	2.88	2.82	2.85	0.03
Sysbench Mem t 1	2537704.01	2536025.17	2472121.34	2515283.51	37388.96
Sysbench mem t 2	3182800.43	3379413.65	3306495.59	3289569.89	99393.41
Sysbench mem t 4	4300089.71	4163689.93	4163996.47	4209258.70	78662.11

Well, there are reference data, now directly the results of testing providers.

Further, I will not give complete logs so as not to inflate the article, but they are stored with me, if you wish, ask for a link, I will share it, although the data from them are transferred to a table.

Yandex.Oblako

Results for ru-central1-a zone:

Result table

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	554.00	543.00	545.00	547.33	5.86
FIO WRITE IOPS	237.00	232.00	233.00	234.00	2.65
STRESS-NG 1 CPU	10236.00	10045.00	10161.00	10147.33	96.23
STRESS-NG 2 CPU	19756.00	19479.00	20291.00	19842.00	412.77
STRESS-NG 4 CPU	18743.00	17906.00	18192.00	18280.33	425.43
Sysbench CPU for 1	11.94	11.95	11.98	11.96	0.02
Sysbench CPU for 2	7.19	7.23	6.16	6.86	0.61
Sysbench CPU for 4	3.72	3.72	3.70	3.71	0.01
Sysbench Mem t 1	2080442.66	2085059.55	2079872.00	2081791.40	2844.64
Sysbench mem t 2	2460594.62	2715142.01	2536824.57	2570853.73	130641.04
Sysbench mem t 4	2978385.59	2928369.70	3020014.59	2975589.96	45886.36

Results for ru-central1-b zone:

Result table

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	543.00	537.00	523.00	534.33	10.26
FIO WRITE IOPS	232.00	230.00	224.00	228.67	4.16
STRESS-NG 1 CPU	10634.00	10848.00	11870.00	11117.33	660.55
STRESS-NG 2 CPU	22109.00	20861.00	21020.00	21330.00	679.30
STRESS-NG 4 CPU	18964.00	19449.00	18992.00	19135.00	272.29
Sysbench CPU for 1	11.30	11.35	11.34	11.33	0.03
Sysbench CPU for 2	5.87	5.88	5.89	5.88	0.01
Sysbench CPU for 4	3.56	3.55	3.54	3.55	0.01
Sysbench Mem t 1	2190808.15	2197111.57	2197600.12	2195173.28	3788.20
Sysbench mem t 2	2442631.19	2433028.20	2415710.66	2430456.68	13643.25
Sysbench mem t 4	3010239.12	3168720.68	3088677.50	3089212.43	79242.13

Results for ru-central1-c zone:

Result table

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	541.00	551.00	558.00	550.00	8.54
FIO WRITE IOPS	232.00	236.00	239.00	235.67	3.51
STRESS-NG 1 CPU	10424.00	10192.00	10325.00	10313.67	116.41
STRESS-NG 2 CPU	19637.00	20330.00	19585.00	19850.67	415.93
STRESS-NG 4 CPU	28884.00	28477.00	28750.00	28703.67	207.42
Sysbench CPU for 1	11.67	11.64	11.68	11.67	0.02
Sysbench CPU for 2	6.02	6.05	7.06	6.38	0.59
Sysbench CPU for 4	3.40	3.40	3.40	3.40	0.00
Sysbench Mem t 1	2131168.41	2130201.75	2142809.68	2134726.61	7016.81
Sysbench mem t 2	2777100.50	2592860.27	2226863.89	2532274.89	280076.82
Sysbench mem t 4	2834838.09	2935298.85	2753443.73	2841193.56	91093.99

Summary results:

Test	Average	Avg min	Avg max	Stdev	StDev%
FIO READ IOPS	543.89	534.33	550.00	8.38	1.5%
FIO WRITE IOPS	232.78	228.67	235.67	3.66	1.6%
STRESS-NG 1 CPU	10526.11	10147.33	11117.33	518.72	4.9%
STRESS-NG 2 CPU	20340.89	19842.00	21330.00	856.61	4.2%
STRESS-NG 4 CPU	22039.67	18280.33	28703.67	5786.99	26.3%
Sysbench CPU for 1	11.65	11.33	11.96	0.31	2.7%
Sysbench CPU for 2	6.37	5.88	6.86	0.49	7.7%
Sysbench CPU for 4	3.55	3.40	3.71	0.16	4.5%
Sysbench Mem t 1	2137230.43	2081791.40	2195173.28	56732.39	2.7%
Sysbench mem t 2	2511195.10	2430456.68	2570853.73	72533.45	2.9%
Sysbench mem t 4	2968665.32	2841193.56	3089212.43	124154.35	4.2%

I want to draw special attention to one remarkable fact.

With a full load of all cores of virtual machines in zones A and B, the total performance is LOWER than with a load of only two cores out of four.

Moreover, I took additional cars in one of the zones and drove the test for them - the problem did not go anywhere.

I suppose that this is a technical problem and it is connected with the hardware features of the machines used for hypervisors and taking them into account when allocating resources (such an experience case is recalled). Well, or with something else, I can’t look inside, but I don’t want to guess much.

Hopefully, comrades from Ya. Oblak will read this article and do something with it, and if you are very lucky, they will tell you what it is, but it turns out somewhat insulting and sometimes unpleasant (many applications focus on the number of cores to calculate the number of threads) .

Mail.RU Cloud (MCS)

Mail.ru has only two availability zones, so two tests were performed on different machines in the same zone.

Results for the Moscow-East zone (the first VM):

Results table

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	487.00	538.00	534.00	519.67	28.36
FIO WRITE IOPS	209.00	231.00	229.00	223.00	12.17
STRESS-NG 1 CPU	7359.00	6567.00	7022.00	6982.67	397.46
STRESS-NG 2 CPU	14144.00	14916.00	13137.00	14065.67	892.08
STRESS-NG 4 CPU	21381.00	21199.00	21032.00	21204.00	174.55
Sysbench CPU for 1	15.54	16.20	14.98	15.57	0.61
Sysbench CPU for 2	7.30	7.70	7.53	7.51	0.20
Sysbench CPU for 4	4.02	4.09	3.79	3.96	0.16
Sysbench Mem t 1	1117493.99	1161261.85	1423941.92	1234232.59	165744.17
Sysbench mem t 2	1819474.62	1692128.17	1668347.81	1726650.20	81262.88
Sysbench mem t 4	2357943.97	2379492.56	2312976.14	2350137.56	33938.38

Results for the Moscow-East zone (the second VM):

Test results

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	475.00	509.00	472.00	485.33	20.55
FIO WRITE IOPS	205.00	218.00	204.00	209.00	7.81
STRESS-NG 1 CPU	6953.00	7030.00	7127.00	7036.67	87.19
STRESS-NG 2 CPU	14623.00	13945.00	13523.00	14030.33	554.94
STRESS-NG 4 CPU	27022.00	27184.00	27670.00	27292.00	337.23
Sysbench CPU for 1	14.88	13.44	14.45	14.26	0.74
Sysbench CPU for 2	6.89	7.13	6.69	6.90	0.22
Sysbench CPU for 4	3.52	3.49	3.68	3.57	0.10
Sysbench Mem t 1	1129165.42	1238462.80	1344025.16	1237217.79	107435.28
Sysbench mem t 2	1904396.37	1740914.98	1733216.87	1792842.74	96684.92
Sysbench mem t 4	2416702.17	2437844.98	2384159.80	2412902.32	27043.55

Results for the zone "Moscow-North":

Test results

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	510.00	647.00	613.00	590.00	71.34
FIO WRITE IOPS	218.00	277.00	262.00	252.33	30.66
STRESS-NG 1 CPU	9657.00	9742.00	9867.00	9755.33	105.63
STRESS-NG 2 CPU	19251.00	20069.00	19677.00	19665.67	409.12
STRESS-NG 4 CPU	39020.00	38665.00	38461.00	38715.33	282.88
Sysbench CPU for 1	12.45	12.53	12.66	12.55	0.11
Sysbench CPU for 2	6.25	6.20	6.22	6.22	0.02
Sysbench CPU for 4	3.18	3.16	3.16	3.17	0.01
Sysbench Mem t 1	2003899.51	1990350.38	1974380.86	1989543.58	14775.85
Sysbench mem t 2	1990419.20	2022621.53	1934822.52	1982621.08	44415.93
Sysbench mem t 4	2337084.52	2227633.06	2021779.21	2195498.93	160090.01

Summary results:

Test	Average	Avg min	Avg max	Stdev	StDev%
FIO READ IOPS	531.67	485.33	590.00	53.36	10.0%
FIO WRITE IOPS	228.11	209.00	252.33	22.11	9.7%
STRESS-NG 1 CPU	7924.89	6982.67	9755.33	1585.44	20.0%
STRESS-NG 2 CPU	15920.56	14030.33	19665.67	3243.41	20.4%
STRESS-NG 4 CPU	29070.44	21204.00	38715.33	8890.10	30.6%
Sysbench CPU for 1	14.13	12.55	15.57	1.52	10.7%
Sysbench CPU for 2	6.88	6.22	7.51	0.64	9.3%
Sysbench CPU for 4	3.57	3.17	3.96	0.40	11.2%
Sysbench Mem t 1	1486997.99	1234232.59	1989543.58	435219.81	29.3%
Sysbench mem t 2	1834038.01	1726650.20	1982621.08	132864.82	7.2%
Sysbench mem t 4	2319512.93	2195498.93	2412902.32	111890.39	4.8%

From the interesting, I would like to note that the performance degradation problems with the use of four threads are not here, and it seems that honest (though rather weak) kernels are issued.

Also, in the “North” zone, much more powerful processors are used than in the “East” zone, the difference in performance at full load reaches two times. For the same money. Draw your own conclusions.

Selectel

The results of his testing were very interesting. In the absolute, it provides the most powerful 4-core machines from all the tested providers.

Results for the zone "Moscow - Berzarina-1":

Test results

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	2319.00	2294.00	2312.00	2308.33	12.90
FIO WRITE IOPS	998.00	986.00	995.00	993.00	6.24
STRESS-NG 1 CPU	11320.00	11038.00	10936.00	11098.00	198.91
STRESS-NG 2 CPU	23164.00	22093.00	22558.00	22605.00	537.04
STRESS-NG 4 CPU	43879.00	44118.00	44086.00	44027.67	129.74
Sysbench CPU for 1	12.01	11.96	11.97	11.98	0.02
Sysbench CPU for 2	6.01	5.99	5.99	6.00	0.02
Sysbench CPU for 4	3.01	3.00	3.00	3.00	0.01
Sysbench Mem t 1	2158876.40	2162098.22	2158738.03	2159904.22	1901.32
Sysbench mem t 2	2413547.34	2340801.67	2569554.40	2441301.14	116874.54
Sysbench mem t 4	2858920.38	2935705.54	2714476.62	2836367.51	112325.57

Results for the zone "Moscow - Berzarina-2":

Results table

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	1735.00	1729.00	1724.00	1729.33	5.51
FIO WRITE IOPS	745.00	742.00	740.00	742.33	2.52
STRESS-NG 1 CPU	18231.00	18462.00	18518.00	18403.67	152.13
STRESS-NG 2 CPU	36965.00	36495.00	37006.00	36822.00	283.93
STRESS-NG 4 CPU	74272.00	74428.00	74218.00	74306.00	109.05
Sysbench CPU for 1	11.22	11.17	11.15	11.18	0.03
Sysbench CPU for 2	5.60	5.60	5.60	5.60	0.00
Sysbench CPU for 4	2.83	2.81	2.81	2.82	0.01
Sysbench Mem t 1	2396762.92	2405750.19	2394240.05	2398917.72	6050.06
Sysbench mem t 2	1980511.45	2079328.96	1968664.26	2009501.56	60761.74
Sysbench mem t 4	2283159.05	2271698.71	2299665.98	2284841.25	14059.32

Results for the SPB - Dubrovka-1 zone:

Results table

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	2550.00	2618.00	2666.00	2611.33	58.29
FIO WRITE IOPS	1096.00	1126.00	1147.00	1123.00	25.63
STRESS-NG 1 CPU	10801.00	10512.00	11175.00	10829.33	332.41
STRESS-NG 2 CPU	21418.00	21642.00	23179.00	22079.67	958.62
STRESS-NG 4 CPU	44183.00	44557.00	43012.00	43917.33	806.03
Sysbench CPU for 1	11.97	11.99	11.99	11.99	0.01
Sysbench CPU for 2	5.99	5.99	6.00	5.99	0.01
Sysbench CPU for 4	3.02	3.00	3.00	3.01	0.01
Sysbench Mem t 1	2159958.70	2162062.66	2158540.58	2160187.31	1772.13
Sysbench mem t 2	2430650.73	2512678.85	2417945.57	2453758.38	51420.53
Sysbench mem t 4	3171660.68	3018827.14	3343661.47	3178049.76	162511.39

Summary table with the results:

Test	Average	Avg min	Avg max	Stdev	StDev%
FIO READ IOPS	2216.33	1729.33	2611.33	448.14	20.2%
FIO WRITE IOPS	952.78	742.33	1123.00	193.49	20.3%
STRESS-NG 1 CPU	13443.67	10829.33	18403.67	4297.59	32.0%
STRESS-NG 2 CPU	27168.89	22079.67	36822.00	8363.96	30.8%
STRESS-NG 4 CPU	54083.67	43917.33	74306.00	17513.14	32.4%
Sysbench CPU for 1	11.72	11.18	11.99	0.46	4.0%
Sysbench CPU for 2	5.86	5.60	6.00	0.23	3.9%
Sysbench CPU for 4	2.94	2.82	3.01	0.11	3.7%
Sysbench Mem t 1	2239669.75	2159904.22	2398917.72	137912.86	6.2%
Sysbench mem t 2	2301520.36	2009501.56	2453758.38	252972.39	11.0%
Sysbench mem t 4	2766419.51	2284841.25	3178049.76	450693.81	16.3%

As I have already said, of all the tested, this provider provides the most efficient machines per 4 streams. But even here there is a feature - again, for the same money, we get performance that differs by almost 2 times - compare the results of Berezin-2 with the rest.

Also, I would like to note very fast drives at a reasonable price, the best of the three domestic providers that are available from the three tested. At the same time, the machine with the fastest processor has the slowest disk out of three.

It turns out a kind of lottery, however, taking into account that even if you are not lucky, everything will still be very, very decent.

Google cloud

GCE test results did not bring any special surprises.

Everything is completely predictable, homogeneous and in general corresponds to the declared.

Results for europe-west1-b zone:

Test results

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	924.00	910.00	888.00	907.33	18.15
FIO WRITE IOPS	396.00	391.00	380.00	389.00	8.19
STRESS-NG 1 CPU	14237.00	14137.00	14094.00	14156.00	73.37
STRESS-NG 2 CPU	28576.00	28419.00	28544.00	28513.00	82.96
STRESS-NG 4 CPU	29996.00	29880.00	29449.00	29775.00	288.22
Sysbench CPU for 1	12.63	12.66	12.67	12.65	0.02
Sysbench CPU for 2	6.52	6.41	6.38	6.44	0.08
Sysbench CPU for 4	3.35	3.56	3.56	3.49	0.12
Sysbench Mem t 1	2055240.49	2056617.63	2054720.94	2055526.35	980.13
Sysbench mem t 2	1377683.73	1346931.63	1397680.79	1374098.72	25563.81
Sysbench mem t 4	2279937.89	2275427.56	2278615.94	2277993.80	2318.63

Results for europe-west-1c zone:

Test results

FIO READ IOPS	946.00	995.00	984.00	975.00	25.71
FIO WRITE IOPS	406.00	428.00	422.00	418.67	11.37
STRESS-NG 1 CPU	14256.00	14250.00	14423.00	14309.67	98.20
STRESS-NG 2 CPU	28875.00	29057.00	29256.00	29062.67	190.56
STRESS-NG 4 CPU	30317.00	30462.00	29478.00	30085.67	531.23
Sysbench CPU for 1	12.52	12.49	12.61	12.54	0.06
Sysbench CPU for 2	6.28	6.30	6.31	6.29	0.02
Sysbench CPU for 4	3.38	3.57	3.52	3.49	0.10
Sysbench Mem t 1	2085832.84	2066794.24	2086303.39	2079643.49	11130.26
Sysbench mem t 2	1368168.11	1535725.51	1710618.59	1538170.74	171238.33
Sysbench mem t 4	2375534.54	2307610.22	2386046.89	2356397.22	42576.47

Results for europe-west1-d zone:

Test results

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	885.00	910.00	943.00	912.67	29.09
FIO WRITE IOPS	379.00	390.00	405.00	391.33	13.05
STRESS-NG 1 CPU	14254.00	14230.00	14008.00	14164.00	135.63
STRESS-NG 2 CPU	28262.00	28321.00	28473.00	28352.00	108.86
STRESS-NG 4 CPU	29615.00	29312.00	29138.00	29355.00	241.39
Sysbench CPU for 1	12.61	12.65	12.66	12.64	0.03
Sysbench CPU for 2	6.37	6.35	6.35	6.36	0.01
Sysbench CPU for 4	3.43	3.56	3.55	3.52	0.07
Sysbench Mem t 1	2050031.60	2068677.64	2052707.70	2057138.98	10081.96
Sysbench mem t 2	1228313.90	1530374.73	1345581.79	1368090.14	152283.14
Sysbench mem t 4	2335035.15	2420871.72	2361505.39	2372470.75	43956.33

Summary table with the results:

Test	Average	Avg min	Avg max	Stdev	StDev%
FIO READ IOPS	931.67	907.33	975.00	37.62	4.0%
FIO WRITE IOPS	399.67	389.00	418.67	16.50	4.1%
STRESS-NG 1 CPU	14209.89	14156.00	14309.67	86.50	0.6%
STRESS-NG 2 CPU	28642.56	28352.00	29062.67	372.63	1.3%
STRESS-NG 4 CPU	29738.56	29355.00	30085.67	366.69	1.2%
Sysbench CPU for 1	12.61	12.54	12.65	0.06	0.5%
Sysbench CPU for 2	6.36	6.29	6.44	0.07	1.1%
Sysbench CPU for 4	3.50	3.49	3.52	0.01	0.4%
Sysbench Mem t 1	2064102.94	2055526.35	2079643.49	13482.64	0.7%
Sysbench mem t 2	1426786.53	1368090.14	1538170.74	96508.32	6.8%
Sysbench mem t 4	2335620.59	2277993.80	2372470.75	50549.23	2.2%

There is nothing to even comment on.

Performance in 4 threads hardly differs from two, but does not degrade.

In general, each core is very productive and half as powerful as the kernel of the test virtuals, which goes out-of-competition, and they are not to be said to be the weakest.

The disks of stars from the sky are not enough, but for most tasks there will be plenty of them.

The only thing worth mentioning is the excellent homogeneity. Each of the machines differs in performance by no more than the measurement error, which gives excellent predictability and ease of planning.

AWS

The market leader, his test surprised me somewhat, since they have the same problem that Ya.Oblak showed up.

Despite the fact that I have been working with him for quite a long time, I somehow didn’t have much time to figure out the difference in performance between full load and partial load, so the results were a surprise to me to some extent.

For testing, we used the c5.xlarge type, as the cheapest of the matching requirements.

Results for the eu-central-1a zone:

Test results

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	1839.00	1976.00	2083.00	1966.00	122.31
FIO WRITE IOPS	789.00	850.00	895.00	844.67	53.20
STRESS-NG 1 CPU	21422.00	21722.00	21736.00	21626.67	177.38
STRESS-NG 2 CPU	43305.00	43331.00	43197.00	43277.67	71.06
STRESS-NG 4 CPU	40876.00	40884.00	40888.00	40882.67	6.11
Sysbench CPU for 1	8.77	8.77	8.77	8.77	0.00
Sysbench CPU for 2	4.40	4.40	4.40	4.40	0.00
Sysbench CPU for 4	2.52	2.52	2.52	2.52	0.00
Sysbench Mem t 1	3063495.18	3064238.67	3063452.11	3063728.65	442.21
Sysbench mem t 2	1848705.16	1841708.24	1751938.22	1814117.21	53962.11
Sysbench mem t 4	2413033.89	2249609.19	2299986.20	2320876.43	83691.15

Results for the zone eu-central-1b:

Test results

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	1723.00	1988.00	2101.00	1937.33	194.03
FIO WRITE IOPS	739.00	855.00	903.00	832.33	84.32
STRESS-NG 1 CPU	21785.00	21733.00	21741.00	21753.00	28.00
STRESS-NG 2 CPU	43370.00	43323.00	40351.00	42348.00	1729.61
STRESS-NG 4 CPU	40857.00	40864.00	40916.00	40879.00	32.23
Sysbench CPU for 1	8.77	8.77	8.77	8.77	0.00
Sysbench CPU for 2	4.39	4.40	4.39	4.39	0.00
Sysbench CPU for 4	2.52	2.52	2.52	2.52	0.00
Sysbench Mem t 1	3065227.23	3065688.95	3063830.23	3064915.47	967.78
Sysbench mem t 2	2032840.35	1987864.46	1968489.39	1996398.07	33013.31
Sysbench mem t 4	2684716.32	2654257.87	2618592.53	2652522.24	33096.05

Results for the eu-central-1c zone:

Test results

Test	Iter 1	Iter 2	Iter 3	Average	Stdev
FIO READ IOPS	1761.00	2003.00	2108.00	1957.33	177.95
FIO WRITE IOPS	756.00	861.00	906.00	841.00	76.97
STRESS-NG 1 CPU	21632.00	21708.00	21615.00	21651.67	49.52
STRESS-NG 2 CPU	43247.00	43236.00	43283.00	43255.33	24.58
STRESS-NG 4 CPU	39931.00	39359.00	40835.00	40041.67	744.20
Sysbench CPU for 1	8.77	8.77	8.77	8.77	0.00
Sysbench CPU for 2	4.40	4.40	4.40	4.40	0.00
Sysbench CPU for 4	2.52	2.52	2.52	2.52	0.00
Sysbench Mem t 1	3064343.66	3064434.20	2998820.16	3042532.67	37856.17
Sysbench mem t 2	2235882.60	2088501.51	2166875.91	2163753.34	73740.15
Sysbench mem t 4	2870035.79	2813221.50	2771999.66	2818418.98	49224.29

Summary table of results:

Test	Average	Avg min	Avg max	Stdev	StDev%
FIO READ IOPS	1953.56	1937.33	1966.00	14.70	0.8%
FIO WRITE IOPS	839.33	832.33	844.67	6.33	0.8%
STRESS-NG 1 CPU	21677.11	21626.67	21753.00	66.90	0.3%
STRESS-NG 2 CPU	42960.33	42348.00	43277.67	530.41	1.2%
STRESS-NG 4 CPU	40601.11	40041.67	40882.67	484.50	1.2%
Sysbench CPU for 1	8.77	8.77	8.77	0.00	0.0%
Sysbench CPU for 2	4.40	4.39	4.40	0.00	0.1%
Sysbench CPU for 4	2.52	2.52	2.52	0.00	0.1%
Sysbench Mem t 1	3057058.93	3042532.67	3064915.47	12594.10	0.4%
Sysbench mem t 2	1991422.87	1814117.21	2163753.34	174871.16	8.8%
Sysbench mem t 4	2597272.55	2320876.43	2818418.98	253330.90	9.8%

As I said above - the results surprised me.

Yes, I understand that the problem manifests itself explicitly only under certain types of load (it is not visible in Sysbench), but considering the results of other platforms, this is clearly not a problem with the test, but performance limitations.

In defense of AWS, I can say that when creating a machine, it allows you to disable HyperThreading, which at least helps to eliminate the problem with a performance drop in some applications.

Otherwise, the disks do not guarantee such performance, but they support Burst to smooth the loads, so if you need to read / pee relatively much, quickly, but not very often (say, every few minutes), then everything will be fine.

Also, the homogeneity of the results is simply excellent, everything is predictable and without surprises.

Azure

Initially, I did not want to include it in the test, because I have never worked with him very much, and even I didn’t have an account there. But, on reflection, I decided to test it all the same, for even bill ~~, for which I paid~~ .

At once I want to explain that the region was chosen from the principle “somewhere in Europe”, and the type of machine is 100% suitable for the conditions (4 processors, 8GB of memory).
In the first iteration of the test, it was A4 v2, marked as “General purpose”, with which this article was published. Experts who came to the comments explained to me what I did wrong and that Azure has a machine that is slower can cost more than the one that is faster and without reading the documentation or googling about it . After that, the results were updated based on the type F4s

Results for France-Central-1:

Test results

Test	Iter 1	Iter 2	Iter 3	Awerage	Stdev
FIO READ IOPS	1066.00	1102.00	1038.00	1068.67	32.08
FIO WRITE IOPS	457.00	473.00	445.00	458.33	14.05
STRESS-NG 1 CPU	9470.00	10059.00	10759.00	10096.00	645.30
STRESS-NG 2 CPU	20424.00	20502.00	20940.00	20622.00	278.14
STRESS-NG 4 CPU	39039.00	39294.00	39141.00	39158.00	128.35
Sysbench CPU for 1	10.32	10.42	10.50	10.42	0.09
Sysbench CPU for 2	5.35	5.35	5.33	5.35	0.01
Sysbench CPU for 4	2.77	2.78	2.76	2.77	0.01
Sysbench Mem t 1	2449793.14	2467589.35	2456056.19	2457812.89	9027.22
Sysbench mem t 2	2370286.78	2388077.81	2299377.92	2352580.84	46925.93
Sysbench mem t 4	2697042.08	2625447.20	2707918.64	2676802.64	44806.37

Results for France-Central-2:

Test results

Test	Iter 1	Iter 2	Iter 3	Awerage	Stdev
FIO READ IOPS	1037.00	1104.00	1102.00	1081.00	38.12
FIO WRITE IOPS	445.00	473.00	473.00	463.67	16.17
STRESS-NG 1 CPU	10159.00	10360.00	10452.00	10323.67	149.84
STRESS-NG 2 CPU	21027.00	20025.00	20415.00	20489.00	505.08
STRESS-NG 4 CPU	39530.00	40927.00	40170.00	40209.00	699.32
Sysbench CPU for 1	10.39	9.95	9.91	10.08	0.27
Sysbench CPU for 2	5.09	5.13	5.19	5.14	0.05
Sysbench CPU for 4	2.69	2.75	2.66	2.70	0.04
Sysbench Mem t 1	2568336.75	2450640.64	2567906.16	2528961.18	67827.92
Sysbench mem t 2	2401273.88	2362027.64	2372950.76	2378750.76	20255.79
Sysbench mem t 4	2740927.62	2787787.19	2770497.39	2766404.07	23696.44

Results for France-Central-3:

Test results

Test	Iter 1	Iter 2	Iter 3	Awerage	Stdev
FIO READ IOPS	1436.00	830.00	1136.00	1134.00	303.00
FIO WRITE IOPS	614.00	355.00	487.00	485.33	129.51
STRESS-NG 1 CPU	10834.00	10326.00	10763.00	10641.00	275.10
STRESS-NG 2 CPU	21505.00	21108.00	21428.00	21347.00	210.53
STRESS-NG 4 CPU	42194.00	41540.00	41427.00	41720.33	414.08
Sysbench CPU for 1	9.87	9.75	9.79	9.80	0.06
Sysbench CPU for 2	5.04	5.05	5.13	5.08	0.05
Sysbench CPU for 4	2.67	2.65	2.67	2.66	0.01
Sysbench Mem t 1	2622263.24	2616326.80	2632668.25	2623752.76	8271.93
Sysbench mem t 2	2495841.62	2438685.04	2556294.51	2496940.39	58812.43
Sysbench mem t 4	2814306.59	2783117.34	2846909.91	2814777.95	31898.90

Summary table of results:

Test	Average	Avg min	Avg max	Stdev	StDev%
FIO READ IOPS	1094.56	1068.67	1134.00	34.71	3.2%
FIO WRITE IOPS	469.11	458.33	485.33	14.30	3.0%
STRESS-NG 1 CPU	10353.56	10096.00	10641.00	273.73	2.6%
STRESS-NG 2 CPU	20819.33	20489.00	21347.00	461.79	2.2%
STRESS-NG 4 CPU	40362.44	39158.00	41720.33	1288.04	3.2%
Sysbench CPU for 1	10.10	9.80	10.42	0.31	3.0%
Sysbench CPU for 2	5.19	5.08	5.35	0.14	2.7%
Sysbench CPU for 4	2.71	2.66	2.77	0.05	2.0%
Sysbench Mem t 1	2536842.28	2457812.89	2623752.76	83250.19	3.3%
Sysbench mem t 2	2409424.00	2352580.84	2496940.39	76912.65	3.2%
Sysbench mem t 4	2752661.55	2676802.64	2814777.95	70006.71	2.5%

Good performance, one of the best among the platforms presented. True price spoils everything.

Results

Performance

Let's start with a pivot table of results.

I insert it with an image, because I want to use colors, but the data there is from the tables presented above.

The smaller the better

Let's take a closer look at CPU performance:

In general, AWS holds the lead in average measured performance for single and dual nuclear loads. In second place is Google Cloud.

Of the Russian providers, Selectel showed itself the best. Кроме третьего места по неполной нагрузке у него однозначное первое при нагрузке всех ядер, даже с учетом неравномерности результатов между зонами (что неприятно, но в данном случае не влияет).

Теперь память:

По скорости работы с памятью ~~плашку памяти~~ пальму первенства удерживает AWS для однопоточного режима, для двух-поточного — Azure и Я.Облако для четырех-поточного.

Диски:

По скорости дисков у нас однозначный победитель — Selectel. Ничего подобного за сопоставимые деньги никто из участников сравнения не предлагает.

На втором месте — AWS благодаря разрешенному Burst-у и в целом приличной скорости.
За ним GCE и Azure, а замыкают список Я.Облако и MSC, которые предлагают примерно одинаковые по производительности решения.

Стоимость относительно производительности

А теперь поговорим о еще одном интересном факторе — стоимости.

Это сравнение не в коем случае не покрывает совокупности стоимости решений на разных платформах, его цель проста — сопоставить стоимость единицы производительности у разных провайдеров.

За основу расчета возьмем тест stress-ng.
Расчетные цены за 1 месяц использования каждого инстанса (без НДС):

Provider	Yandex.Cloud	MCS	Selectel	GCE	AWS	Azure
Price (cur)	3799.12	3608	4050.624	103.08	147.57	147.46
Price (rub)	3799.12	3608	4050.624	6747.6168	9659.9322	9652.7316
Alt price (cur)	3799.12	3608	3,454.94	35.6	56.07	9652.7316
Alt price (rub)	3799.12	3608	3,454.94	2330.376	3670.3422	9652.7316

Таблица стоимости требует некоторого пояснения.

Для тех провайдеров, у которых есть возможности снижения стоимости ресурсов, описанных в начале статьи, есть две стоимости — основная и альтернативная, рассчитанная с учетом этих возможностей.

Так как это не скидка и завязано на сценарии использования, которые бывают разными, я счет хорошей идеей посчитать стоимость и с их учетом.

Так же, из-за разницы валют, стоимость AWS, Azure (да, я знаю что он умеет показывать в рублях (как-то), их калькулятор показал мне значения в долларах) и GCE приведена к рублевому эквиваленту, соответствующему курсу 65.46 рублей за доллар США.

Так же, для Azure у меня не получилось выделить стоимость диска, стандартный диск инстанса там 16 Гб, сколько будет действительно стоить диск из калькулятора не очень понятно (там еще и количество запросов учитывается), так что цена указана только за непосредственно инстанс , хотя общей ситуации это все равно не меняет, Azure остается самым дорогим.

Итак, стоимость каждого решения, приведенная к рублям за попугай в тесте stress-ng за минимальное количество ресурсов, которые было получено в тесте:

Меньше — лучше

Если посчитать на основании средних результатов теста, картина принципиально не поменяется, но кое что-что все таки изменится:

Меньше — лучше

Получается, что во всех категориях, если считать без НДС, побеждает Selectel, причем в категории тяжелой нагрузки практически с двухкратным отрывом.

Теперь посмотрим что будет, если перечитать стоимость с учетом возможной экономии в зависимости от сценария использования.

Альтернативная стоимость с учетом экономии за счет сценария использования, приведенная к рублям за попугай в тесте stress-ng за минимальное количество ресурсов, которые было получено в тесте:

Меньше — лучше

Оно же, но к среднему количеству ресурсов:

Здесь картина меняется.

Во всех сценариях, кроме тяжелой постоянно полной нагрузки с приличным перевесом вперед выходят AWS и GCE с практически идентичной стоимостью за единицу ресурсов.

В случае тяжелой нагрузки конкуренцию им составляет Selectel, предлагающий ресурсы практически за те же деньги, но с меньшим количеством «уступков» (все же, его ноды постоянны и не выключаются в произвольным момент времени, в отличии от AWS Spot и Google Preemptible инстансов).

Вот так, если аккуратно и грамотно подойти к архитектуре, можно здорово экономить на казалось бы пустом месте.

Вместо выводов

Тест получился длинный, но как по мне — интересный.

Для себя я сделал некоторые выводы по результатам, надеюсь он поможет и вам посмотреть на вопрос производительности облачных платформ немного с другой стороны и возможно немного облегчит муки выбора, а так же поможет в диагностике проблем производительности на некоторых платформах из-за выявленных «особенностей».
**UPDATE** Обновлены выводы и цены Selectel, т.к. в них бы некорректно учтен НДС
**UPDATE2** Обновлены результаты Azure на новый тип нод, обновлены выводы, но принципиально все равно ничего не поменялось

Source: https://habr.com/ru/post/439690/

Virtual machine performance comparison of 6 cloud platforms: Selectel, MCS, Ya.Oblako, Google Cloud, AWS and Azure

Technique

Virtual machines

Cost calculation

Tests

Testing

The machine is an example outside the test.

Yandex.Oblako

Mail.RU Cloud (MCS)

Selectel

Google cloud

AWS

Azure

Results

Performance

Стоимость относительно производительности

Вместо выводов

More articles: