50 posts tagged with "operation"

ceph auth error

August 31, 2020 · One min read

Managing Director at Clyso

monclient(hunting): handle_auth_bad_method server allowed_methods [2] but i only support [2]

We see this example again and again with customers who copy their keyring file directly from the output of:

 ceph auth ls

In the client.\<name\>.keyring the name is enclosed in square brackets and the key is separated by an equal sign and in the ceph auth ls by a colon.

Search for certificates for a specific domain

July 30, 2020 · One min read

Joachim Kraftmayer

Managing Director at Clyso

Rather by chance, I came across the following website with the service for searching for certificates with the history for a domain.

If you are wondering how many and which certificates you have used for a domain, you can use this link.

crt.sh

ceph bug of the year 2020 - CERN

June 29, 2020 · One min read

Joachim Kraftmayer

Managing Director at Clyso

interesting insights into how dependency on external operating system libraries can affect the operation of Ceph.

https://www.youtube.com/watch?v=_4HUR00oCGo

https://codimd.web.cern.ch/p/rkNZH4JR8?print-pdf#/

speed up or slow down ceph recovery

June 12, 2020 · One min read

Joachim Kraftmayer

Managing Director at Clyso

osd max backfills: This is the maximum number of backfill operations allowed to/from OSD. The higher the number, the quicker the recovery, which might impact overall cluster performance until recovery finishes.
osd recovery max active: This is the maximum number of active recover requests. Higher the number, quicker the recovery, which might impact the overall cluster performance until recovery finishes.
osd recovery op priority: This is the priority set for recovery operation. Lower the number, higher the recovery priority. Higher recovery priority might cause performance degradation until recovery completes.

ceph tell 'osd.*' injectargs --osd-max-backfills=2 --osd-recovery-max-active=2

Recommendation

Start in small steps, observe the Ceph status, client IOPs and throughput and then continue to increase in small steps.

In the producton with regard to the applications and hardware infrastructure, we recommend setting these settings back to default as soon as possible.

Sources

https://www.suse.com/support/kb/doc/?id=000019693

Ceph Client Sessions on the Ceph Monitor (ceph-mon)

September 25, 2019 · One min read

Joachim Kraftmayer

Managing Director at Clyso

ceph daemon mon.clyso-mon1 sessions

If you are looking for the IP addresses for the output of ceph features.

ceph balancer up-map

May 15, 2019 · One min read

Joachim Kraftmayer

Managing Director at Clyso

supported by kernel version 4.13

ceph features - wrong display

ceph tries to determine the ceph client version based on the feature flags. However, the kernel ceph client is not the same codestream.

So the output is not always correct.

verify ceph osd DB and WAL setup

March 14, 2019 · One min read

Joachim Kraftmayer

Managing Director at Clyso

When configuring osd in mixed setup with db and wal colocated on a flash device, ssd or NVMe. There were always changes and irritations where the DB and the WAL are really located. With a simple test it can be checked: The location of the DB for the respective OSD can be verified via ceph osd metadata osd.<id> and the variable "bluefs_dedicated_db": "1".

The WAL was created separately in earlier Ceph versions and automatically on the same device as the DB in later Ceph versions. The WAL can be easily tested by using the ceph osd.<id> tell bench command.

First you check larger write operations with the command:

ceph tell osd.0 bench 65536 409600

Second, you check with smaller objects that are smaller than the bluestore_prefer_deferred_size_hdd (64k).

ceph tell osd.0 bench 65536 4096

If you compare the IOPs of the two tests, one result should correspond to the IOPs of an SSD and the other result should be quite low for the HDD. From this you can know if the WAL is on the HDD or the flash device.

RocksDB - Leveled Compaction

February 24, 2019 · One min read

Joachim Kraftmayer

Managing Director at Clyso

Bluestore/RocksDB will only put the next level up size of DB on flash if the whole size will fit. These sizes are roughly 3GB,30GB,300GB. Anything in-between those sizes are pointless. Only ~3GB of SSD will ever be used out of a 28GB partition. Likewise a 240GB partition is also pointless as only ~30GB will be used.

How do I find the right SSD/NVMe partition size for hot DB.

https://github.com/facebook/rocksdb/wiki/Leveled-Compaction

ceph tell osd.* bench

October 12, 2018 · One min read

Joachim Kraftmayer

Managing Director at Clyso

When commissioning a cluster, it is always advisable to log and evaluate the ceph osd bench results.

The values can also be helpful for performance analysis in a productive Ceph cluster.

ceph tell osd.<int|*> bench {<int>} {<int>} {<int>}

OSD benchmark: write <count> <size> -byte objects, (default 1G size 4MB)

osd_bench_max_block_size=65536 kB

Example:

1G size 4MB (default)

ceph tell osd.* bench

1G size 64MB

ceph tell osd.* bench 1073741824 67108864

ceph deep-scrub monitoring and distribution

September 17, 2018 · One min read

Joachim Kraftmayer

Managing Director at Clyso

for date in \`ceph pg dump | grep active | awk '{print $20}'\`; do date +%A -d $date; done | sort | uniq -c

Monday
Saturday
Sunday

for date in \`ceph pg dump | grep active | awk '{print $21}'\`; do date +%H -d $date; done | sort | uniq -c

dumped all
00
01
02
03
04
05
06
07
08
09
10
11
12
13
14
15
16
17
18
19
20
21
22
23

Recommendation​

Sources​

Recommendation

Sources