Splitting Light: Season 2 - Episode 11


Splitting light

Season 2 Episode 11

The bunker

If you are no longer interested in the newsletter, please unsubscribe

When we had launched Carbon14 a year and half back, Arnaud had decided to buy a specific building in Paris to build a new datacenter inside. This building was very particular as it used to be a ministry building and most importantly exactly 30 meters underneath the building was a nuclear strike proof bunker that had been built in the 1960. Its purpose was to safeguard government officials in case of a nuclear attack on Paris.

This bunker was also famous because it used to be connected to underground tunnels in Paris. The tunnels were named catacombs. It was where you would go crawling underground for fun.

It was a good place to host a long term storage product. The transformation process to transform the building and the bunker to a datacenter was almost finished. A 30 meter elevator shaft had been dug and the alcoves had been rehabilitated. Interesting fact, the tiles used were the same as the one used for the Paris metro. Around this time, in February 2018, we could start finally plugging computer hardware there.

The team huddled up. We needed to deploy Carbon14 there. The storage elements were being assembled. Multiple 10 petabyte racks, 600 watts each. But we needed a storage buffer as well. Florian had experience with ZFS and we decided to go fancy. Using two caching mechanisms on SSD. A detail that would be very significant later.

Loic took in hands to add deployment automation using salt stack to deploy the software. We deployed, did a few tests and were happy with the system. We plugged the zone into the old Scaleway console, we were a click away from launch.

During that time I had to go several times down in the bunker. Under ground, you had no phone signal. Nothing. Wifi had not yet been installed. You could get network connectivity if you physically plugged into an ethernet cable. Even then, it had to be the right socket. Similarly to any datacenter, it was very cold. You would sit on the floor but had to remember not lean against the walls where water infiltration would flow in specifically made gutters. It was not a comfortable place. But unlike most data centers, there were no screaming fans around me. It was near silent.

Fun facts about the bunker, aka DC4. Each carbon14 server was a bit more than 50 kg. So a rack full of them, with 22 of them, was about 1 metric ton. The ground had to be engineered in a specific way to support 1 metric ton per cm2. C14 racks were very heavy. The heat generated from the machines underground was reused in a nearby building for district heating. This would reduce the carbon footprint of the datacenter.

After we pushed to production, as soon as we had opened the zone to customers, we started to have a strange issue.

Sometimes, the container where we exported the customer data would not shut down. A restart made it work again, but by itself it would hang. Searching the web, I eventually found a github issue that described exactly what was happening. The issue was on ZFS. We didn’t really have the time to dig into the ZFS codebase, so we waited for a fix. For several months, we connected by hand to the machines and performed the restarts.

A few months later, someone posted a comment that by disabling a specific cache mechanism that we also used, the bug disappeared. I promptly did that with the help of Florian. Lo and behold, it fixed the issue. More importantly, it had no visible performance impact. We found out that we had over-engineered that specific part.

The few minutes that the person took to report a single comment in the bug report saved us many hours and enabled a final fix to be implemented.

But before that happened, we continued on Object Storage!

(1) https://x.com/Scaleway_fr/status/969634754235682816

(2) https://x.com/Scaleway_fr/status/859728642724835328/photo/1

(3) https://github.com/openzfs/zfs/issues/7038

If you have missed it, you can read the previous episode here

To pair with :

  • Suicide mission - Groundislava, Baths
  • The Three-Body Problem by Liu Cixin

Vincent Auclair

Connect with me on your favorite network!

Oud metha, Dubai, Dubai 00000
Unsubscribe · Preferences

Symbol Sled

Business, tech, and life by a nerd. New every Tuesday: Splitting Light: The Prism of Growth and Discovery.

Read more from Symbol Sled

Splitting light Season 2 Episode 10 Finding someone If you are no longer interested in the newsletter, please unsubscribe It’s hard to explain everything that led to meeting this person. From my side you could sum it up to: I let go. I let myself feel the tide instead of trying to control it. As it rose and fell back, I would meet people. I was more comfortable with myself. I enjoyed myself more. This was the change that, for me, made this happen and work. I met Djazia in February 2018....

Splitting light Season 2 Episode 09 Redundancy is key If you are no longer interested in the newsletter, please unsubscribe Around February 2018, as we moved forward and validated individual hardware pieces, we now had to bring everything together. This is where my experience in the lab had a lot of impact, coupled with the experience we had gathered maintaining the existing storage products. What most software engineers fail to realize when working with hardware is that the time flows...

Splitting light Season 2 Episode 08 Compiling knowledge If you are no longer interested in the newsletter, please unsubscribe To be able to use OpenIO and offer it as a public facing product we had to amass quite a large amount of knowledge. We had to understand how it worked in detail. We had to understand the hardware requirements as well as how we wanted to make it filled and cabled. We had to understand how Scaleway’s information system worked and how we would connect to it. Skunk Works:...