<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.fnord.greeley.co.us/mediawiki/index.php?action=history&amp;feed=atom&amp;title=Ceph_performance_metrics</id>
	<title>Ceph performance metrics - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.fnord.greeley.co.us/mediawiki/index.php?action=history&amp;feed=atom&amp;title=Ceph_performance_metrics"/>
	<link rel="alternate" type="text/html" href="https://wiki.fnord.greeley.co.us/mediawiki/index.php?title=Ceph_performance_metrics&amp;action=history"/>
	<updated>2026-05-06T14:25:13Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.39.17</generator>
	<entry>
		<id>https://wiki.fnord.greeley.co.us/mediawiki/index.php?title=Ceph_performance_metrics&amp;diff=1490&amp;oldid=prev</id>
		<title>Adj at 14:28, 24 July 2025</title>
		<link rel="alternate" type="text/html" href="https://wiki.fnord.greeley.co.us/mediawiki/index.php?title=Ceph_performance_metrics&amp;diff=1490&amp;oldid=prev"/>
		<updated>2025-07-24T14:28:02Z</updated>

		<summary type="html">&lt;p&gt;&lt;/p&gt;
&lt;table style=&quot;background-color: #fff; color: #202122;&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;col class=&quot;diff-marker&quot; /&gt;
				&lt;col class=&quot;diff-content&quot; /&gt;
				&lt;tr class=&quot;diff-title&quot; lang=&quot;en&quot;&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan=&quot;2&quot; style=&quot;background-color: #fff; color: #202122; text-align: center;&quot;&gt;Revision as of 14:28, 24 July 2025&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 11:&lt;/td&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 11:&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Let the S1 agent be installed, repeat the benchmarks.  Any decrease of benchmark values of more than 5% should be investigated and a cause determined.&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* Let the S1 agent be installed, repeat the benchmarks.  Any decrease of benchmark values of more than 5% should be investigated and a cause determined.&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* When installation and benchmarking are completed on the low sensitivity servers, proceed to the next group and repeat.&lt;/div&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;background-color: #f8f9fa; color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #eaecf0; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* When installation and benchmarking are completed on the low sensitivity servers, proceed to the next group and repeat.&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;br /&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;References:&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [https://www.thomas-krenn.com/en/wiki/Ceph_Perfomance_Guide_-_Sizing_%26_Testing Ceph Performance Guide]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [https://www.cloudseedrive.com/benchmarking-amazon-s3-performance/ Benchmarking Amazon S3]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
  &lt;td colspan=&quot;2&quot; class=&quot;diff-empty diff-side-deleted&quot;&gt;&lt;/td&gt;
  &lt;td class=&quot;diff-marker&quot; data-marker=&quot;+&quot;&gt;&lt;/td&gt;
  &lt;td style=&quot;color: #202122; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;* [https://documentation.alluxio.io/ee-ai-en/benchmark/cosbench COSBench (S3) Benchmark]&lt;/div&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/table&gt;</summary>
		<author><name>Adj</name></author>
	</entry>
	<entry>
		<id>https://wiki.fnord.greeley.co.us/mediawiki/index.php?title=Ceph_performance_metrics&amp;diff=1489&amp;oldid=prev</id>
		<title>Adj: Created page with &quot;This has come up because the Sentinel 1 endpoint detection and response (EDR) agent is being installed across all our servers.  In order to minimize potential customer impact we will: * Divide servers into three groups based on client IO sensitivity.  Purely development environments being low sensitivity, and certain database workload being highly sensitive.  S3 workloads will probably fall in the middle. * In each group, before S1 agent is installed and running, gather...&quot;</title>
		<link rel="alternate" type="text/html" href="https://wiki.fnord.greeley.co.us/mediawiki/index.php?title=Ceph_performance_metrics&amp;diff=1489&amp;oldid=prev"/>
		<updated>2025-07-24T14:25:07Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;This has come up because the Sentinel 1 endpoint detection and response (EDR) agent is being installed across all our servers.  In order to minimize potential customer impact we will: * Divide servers into three groups based on client IO sensitivity.  Purely development environments being low sensitivity, and certain database workload being highly sensitive.  S3 workloads will probably fall in the middle. * In each group, before S1 agent is installed and running, gather...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;This has come up because the Sentinel 1 endpoint detection and response (EDR) agent is being installed across all our servers.  In order to minimize potential customer impact we will:&lt;br /&gt;
* Divide servers into three groups based on client IO sensitivity.  Purely development environments being low sensitivity, and certain database workload being highly sensitive.  S3 workloads will probably fall in the middle.&lt;br /&gt;
* In each group, before S1 agent is installed and running, gather some baseline metrics for 3 random cluster member servers (OSD and other services) including the following:&lt;br /&gt;
** &amp;lt;code&amp;gt;/usr/bin/sar&amp;lt;/code&amp;gt;, specifically looking at CPU (%system and %idle especially) and memory usage (%memused and active memory)&lt;br /&gt;
* In each cluster, before S1 agent is deployed, measure the cluster&amp;#039;s overall performance:&lt;br /&gt;
** &amp;lt;code&amp;gt;rados bench -p rados_bench 300 write -t 8 --object_size=4MB --no-cleanup&amp;lt;/code&amp;gt; is the Ceph tool used for this.  It exercises the RADOS layer, not client access.  This will decrease cluster client IO while it is running, so is important to be mindful of customer impact.  As explanation, this command will create 8 threads, each writing 4MiByte RADOS objects into the &amp;lt;code&amp;gt;rados_bench&amp;lt;/code&amp;gt; pool for five minutes (300 seconds.)  When the run is complete, record the bandwidth, IOPS, and latency numbers.&lt;br /&gt;
** Do a read benchmark with the same settings as above: &amp;lt;code&amp;gt;rados bench -p rados_bench 300 read -t 8 --object_size=4MB&amp;lt;/code&amp;gt;&lt;br /&gt;
* Client benchmarks to be run now:&lt;br /&gt;
** &amp;lt;code&amp;gt;fio&amp;lt;/code&amp;gt; can be run to measure iSCSI client systems&amp;#039; perceived performance.  Again, this will have an impact on other customers&amp;#039; use of the clusters.&lt;br /&gt;
** S3 performance can be established by uploading and downloading largish objects to a cluster&amp;#039;s S3 endpoints.  Use any of the AWS SDK CLI tools, &amp;lt;code&amp;gt;s3cmd&amp;lt;/code&amp;gt;, or &amp;lt;code&amp;gt;mc&amp;lt;/code&amp;gt; (Minio client) for this.&lt;br /&gt;
* Let the S1 agent be installed, repeat the benchmarks.  Any decrease of benchmark values of more than 5% should be investigated and a cause determined.&lt;br /&gt;
* When installation and benchmarking are completed on the low sensitivity servers, proceed to the next group and repeat.&lt;/div&gt;</summary>
		<author><name>Adj</name></author>
	</entry>
</feed>