Clyso Blog | Clyso GmbH

Post Mortem: Ceph v18.2.7 OSD High CPU Usage Caused by the Messenger

July 28, 2026 · 11 min read

Software Engineer at Clyso

A single object was being requested by >100 clients triggered the messenger to throttle on one OSD, causing CPU to spike above 300% and cause slow operations across unrelated PGs. We trace the root cause to the osd_client_message_cap throttle and its interaction with a known upstream bug Ceph Tracker #62512.

Post Mortem: Tentacle v20.2.0 OSD crashing due to EC Bug

January 13, 2026 · 6 min read

Joshua Blanch

Software Engineer at Clyso

Zac Dover

Technical Writer at Clyso

On January 11, 2026, at 2:48 PST, an emergency support request was opened for OSD crashes in v20.2.0 that rendered CephFS inaccessible.

The incident was resolved, restoring cluster availability.

A secondary post-recovery issue related to scrubbing errors was subsequently identified and fixed.

The fix involved deploying a new build of Ceph that contained the patches for the bugs. The engineering team made use of Clyso's new build system, delivering the fix to the client as fast as possible.

Critical Known Bugs in Ceph Quincy, Reef and Squid Versions

April 7, 2025 · 4 min read

Joshua Blanch

Software Engineer at Clyso

Below are a list of known bugs in ceph versions that we want to highlight. As of time of writing this, the latest versions for each release are the following:

reef: 18.2.7
squid: 19.2.3

There are more bugs in Ceph that were not included here, we've highlighted a few that we wanted to share.

Cross-version Issues

These critical bugs affect multiple major versions of Ceph.

RadosGW --bypass-gc Data Loss Bug

Severity: Critical
Affected Versions: Quincy (17.2.x), Reef (18.2.x), Squid (19.2.x) Bug Tracker: https://tracker.ceph.com/issues/73348

S3 migration with chorus

March 26, 2025 · 10 min read

Artem Torubarov

Software Engineer at Clyso

An S3 system holds data—call it source. It keeps applications running, but migration is needed, maybe due to scale limits or costs creeping up. A new S3 setup, target, is set to replace it. The challenge is to move all data from source to target with no downtime, no data lost, and no breaks for the apps using source. What can get this done?

Kubernetes upgrade 1.31

March 12, 2025 · 2 min read

Dominik Rieder

Head of Kubernetes at Clyso

We see on some Kubernetes cluster upgrading from 1.30 -> 1.31 following errors on cilium, coredns, kube-proxy, ... pods on Control Planes:

CLYSO: Kubernetes Analyzer

February 24, 2025 · 3 min read

Dominik Rieder

Head of Kubernetes at Clyso

In 2023, Clyso released the Ceph Analyzer, giving your operations teams a great tool for inspecting the health of your Ceph clusters, offering in-depth reporting and recommendations to fix many non-trivial issues. Two years later, we are pleased to announce the release of Clyso Kubernetes Analyzer!

Adding Capacity to Ceph -- the CLYSO Way!

January 8, 2025 · 2 min read

Dan van der Ster

CTO at Clyso

One of my favourite things to assist users with is simplifying their workflows for making major changes to their Ceph clusters, such as adding or removing multiple hosts at once. Ceph is inherently excellent at handling these tasks – one of its greatest strengths is the ability to transparently add or remove capacity, replace servers, and perform maintenance, all without downtime.

Introducing MCB: Simplifying Cloud Resource Management

October 31, 2024 · 7 min read

Talita Amaral

Product Owner at Clyso

Managing cloud resources can be complex, especially when you work across multiple cloud providers. That’s where MCB (Multi-Cloud Broker) comes in—a powerful platform designed to simplify and unify cloud management, enabling users to interact with various cloud services through a single interface.

ceph-volume - ceph osd migrate DB to larger ssd/flash device

September 9, 2024 · One min read

Joachim Kraftmayer

Managing Director at Clyso

Building container images with Gardener

August 21, 2024 · 2 min read

Róbert Vašek

Software Engineer at Clyso

Cross-version Issues​

RadosGW --bypass-gc Data Loss Bug​

Cross-version Issues

RadosGW --bypass-gc Data Loss Bug