diff --git a/blog/index.html b/blog/index.html
index 353f26f..deb2350 100644
--- a/blog/index.html
+++ b/blog/index.html
@@ -200,6 +200,14 @@ <h1>2023</h1>
       
         
       
+      <p class="post">
+        <a href="/whos-watching-the-watchdog">Who's watching the watchdog?</a>
+        <span class="date">11/22</span>
+      </p>
+    
+      
+        
+      
       <p class="post">
         <a href="/hacker-gifts">A side project story: Hacker Gifts (2018-2024)</a>
         <span class="date">10/29</span>
diff --git a/blogpost-contexts/index.html b/blogpost-contexts/index.html
index 304d028..f12d83e 100644
--- a/blogpost-contexts/index.html
+++ b/blogpost-contexts/index.html
@@ -230,6 +230,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/cdtmp/index.html b/cdtmp/index.html
index 4790854..7a2340b 100644
--- a/cdtmp/index.html
+++ b/cdtmp/index.html
@@ -213,6 +213,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/copy-with-syntax/index.html b/copy-with-syntax/index.html
index e198bd8..99ec63c 100644
--- a/copy-with-syntax/index.html
+++ b/copy-with-syntax/index.html
@@ -217,6 +217,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
           
diff --git a/ctrl-r/index.html b/ctrl-r/index.html
index 9823d74..1f937b0 100644
--- a/ctrl-r/index.html
+++ b/ctrl-r/index.html
@@ -212,6 +212,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/e2e-tests/index.html b/e2e-tests/index.html
index 88c592c..ea49a7e 100644
--- a/e2e-tests/index.html
+++ b/e2e-tests/index.html
@@ -269,6 +269,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/feed.xml b/feed.xml
index 227fd3d..ca68ea7 100644
--- a/feed.xml
+++ b/feed.xml
@@ -5,12 +5,40 @@
   <link rel="self" href="https://frantic.im/feed.xml" />
   <icon>https://frantic.im/favicon.png</icon>
   <subtitle>Occasional posts on technology and stuff</subtitle>
-  <updated>2023-10-24T19:13:00.962Z</updated>
+  <updated>2023-11-22T18:31:03.008Z</updated>
   <author>
     <name>Alex Kotliarskyi</name>
   </author>
 
   
+    <entry>
+      <id>https://frantic.im/whos-watching-the-watchdog</id>
+      <title>Who&#39;s watching the watchdog?</title>
+      <updated>2023-11-22T12:00:00+00:00</updated>
+
+      <link rel="alternate" href="https://frantic.im/whos-watching-the-watchdog" />
+      <summary>Making reliable systems that expect things to go wrong</summary>
+      <content type="html"><![CDATA[
+        
+        
+        
+        <p>At my current company we have an automated pipeline for processing customer’s orders. It’s pretty complex — talking to multiple different services, training models, storing large files, updating the database, sending emails and push notifications.</p>
+<p>Sometimes things get stuck because of a temporary 3rd party outage or a bug in our code.</p>
+<p>So we built a watchdog service: it monitors the stream of orders and makes sure the orders get processed within reasonable timeframe (3 hours). The watchdog only looks at the final invariant — was the order fulfilled and delivered to the customer? It doesn’t care about any intermediary steps.</p>
+<p>This system has saved us many times. When the watchdog finds a stuck order, it posts in our special channel in Slack. We investigate the problem and address the root cause, so hopefully we won’t see new orders stuck for the same reason.</p>
+<p>But who’s watching the watchdog? What if it fails to run?</p>
+<p>It actually happened to us once. The watchdog is running on the job scheduling system, and that system went down. That meant no orders were getting processed and watchdog also wasn’t running. The alerts channel in Slack was blissfully silent.</p>
+<p>To address this case, we need a system that can watch the watchdog. We are using these two:</p>
+<ul>
+<li><a href="https://docs.sentry.io/product/crons/">Sentry Cron</a></li>
+<li><a href="https://www.checklyhq.com/blog/heartbeat-monitoring-with-checkly/">Checkly Heartbeat</a></li>
+</ul>
+<p>The idea behind both systems is the same: they expect a regular cron job to “check in” on a pre-defined schedule. If it misses a check-in, there’s likely a problem and we get an alert in Slack.</p>
+<p>Complex systems always find surprising ways to fail. When adding an end-to-end quality watchdog (and ways to watch the watchdog) you can create a positive loop of detecting issues and hardening the system.</p>
+
+      ]]></content>
+    </entry>
+  
     <entry>
       <id>https://frantic.im/hacker-gifts</id>
       <title>A side project story: Hacker Gifts (2018-2024)</title>
@@ -652,46 +680,4 @@ Things you can do:
       ]]></content>
     </entry>
   
-    <entry>
-      <id>https://frantic.im/octave</id>
-      <title>A side project story: octave.im (2013-2016)</title>
-      <updated>2021-02-23T12:00:00+00:00</updated>
-
-      <link rel="alternate" href="https://frantic.im/octave" />
-      <summary>A story about my attempt at SaaS</summary>
-      <content type="html"><![CDATA[
-        
-        
-        
-        <p>It all started around 2013: I was going through a course on <a href="https://www.coursera.org/learn/machine-learning">Machine Learning by Andrew Ng</a>.</p>
-<p>The practical part of the course depended on GNU Octave (open source math toolkit), but installing it on a Mac was a huge pain. I did manage to do it, but noticed that many people on forums complanied about the same thing.</p>
-<p>So I had a brilliant idea — wouldn’t it be great if Octave was available via SaaS model? With fancy features like built in code editor, command line and plots?</p>
-<h1>Node, React &amp; Docker</h1>
-<p>I built the first prototype in one night on June 8, 2013. I used NodeJS 0.10-ish with socket.io on the server side and CodeMirror with some plugins on the frontend.</p>
-<p>In October that year I rewrote the frontend in React — the experience of doing so was amazing! React was young (<code>createClass</code>/<code>autobind</code>/<code>mixins</code>) but its programming model “clicked” with me. I remember hanging out in their IRC channel looking for help with autoscrolling. I was really impressed at how quick and friendly the response was (thanks <a href="https://twitter.com/sophiebits">@sophiebits</a>!).</p>
-<p>The initial version of the backend would just run <code>octave</code> in a dedicated folder. My second iteration ued Docker, which at the time was very new and unproven. It all ran on a Digital Ocean 2GB RAM droplet.</p>
-<p>The killer feature was displaying plots inline in a REPL. You can see it on this gif:</p>
-<p><img src="https://frantic.im/assets/octave.im/octave-demo.gif" alt="" /></p>
-<p>It worked through a clever hack: I pre-configured Octave to use gnuplot with special arguments that made it save the graph to a file (instead of showing it on the screen). My NodeJS backend listened to filesystem changes and notified the frontend when it detected the update.</p>
-<h1>Product market fit</h1>
-<p>I tried to promote octave.im for the students of the ML course. I posted the link on forums couple of times and added it to the course wiki page (that was surprisingly very hidden). The reception among students has been really positive, but the course moderators weren’t happy: they wanted some kind of validation that it’s a serious thing (which it wasn’t).</p>
-<p>Overall I had more than 3500 people sign up over the course of several years. Unfortunately I didn’t keep any metrics screenshots. The twitter account, <a href="https://twitter.com/OctaveCloud">@OctaveCloud</a>, got 57 followers (organically).</p>
-<p>Speaking of which, I used Mixpanel and loved its simple API and dashboards. They even sent me a free T-shirt :)</p>
-<h1>Total profit: -$420</h1>
-<p>As every other hacker out there I also hoped to make it sustainable, so in October 2015 I added $4 monthly subscription with 2 weeks trial. To be honest I wasn’t very serious about it at that point. I just wanted to play with Stripe, see if people would actually pay. And they did! Overall I have collected about $300 in revenue.</p>
-<p>An interesting thing that I noticed was that people subscribe and then stop using the product, without unsubscribing (I did have the unsubscribe button on the profile, no questions asked). I ended up manually cancelling a bunch of subscriptions on Stripe without updating the app DB, so people could still use the service (which they didn’t anyways).</p>
-<h1>In numbers</h1>
-<ul>
-<li>308 commits</li>
-<li>3,500 accounts created</li>
-<li>450,000 commands executed</li>
-<li>$300 total revenue</li>
-<li>$720 spent on hosting</li>
-</ul>
-<p>Screenshot, for posterity:</p>
-<p><img src="https://frantic.im/assets/octave.im/screenshot.png" alt="" /></p>
-
-      ]]></content>
-    </entry>
-  
 </feed>
\ No newline at end of file
diff --git a/figma/og_watchdog.png b/figma/og_watchdog.png
new file mode 100644
index 0000000..371b051
Binary files /dev/null and b/figma/og_watchdog.png differ
diff --git a/good-errors-leave-trace/index.html b/good-errors-leave-trace/index.html
index 5e74d03..5ee3e24 100644
--- a/good-errors-leave-trace/index.html
+++ b/good-errors-leave-trace/index.html
@@ -266,6 +266,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/hacker-gifts/index.html b/hacker-gifts/index.html
index 7fc0ca8..cf9e9fb 100644
--- a/hacker-gifts/index.html
+++ b/hacker-gifts/index.html
@@ -257,6 +257,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
     
 
diff --git a/hello-world/index.html b/hello-world/index.html
index 343ef7b..40d81b5 100644
--- a/hello-world/index.html
+++ b/hello-world/index.html
@@ -206,6 +206,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/how-not-to-flux-loops/index.html b/how-not-to-flux-loops/index.html
index 7098177..676db3e 100644
--- a/how-not-to-flux-loops/index.html
+++ b/how-not-to-flux-loops/index.html
@@ -250,6 +250,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/how-not-to-flux-set-actions/index.html b/how-not-to-flux-set-actions/index.html
index f02659b..097cac2 100644
--- a/how-not-to-flux-set-actions/index.html
+++ b/how-not-to-flux-set-actions/index.html
@@ -283,6 +283,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/how-to-convince-your-boss-to-use-react-native/index.html b/how-to-convince-your-boss-to-use-react-native/index.html
index ac0df75..600d1db 100644
--- a/how-to-convince-your-boss-to-use-react-native/index.html
+++ b/how-to-convince-your-boss-to-use-react-native/index.html
@@ -253,6 +253,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/keynote/index.html b/keynote/index.html
index 857ceda..3ffe5f7 100644
--- a/keynote/index.html
+++ b/keynote/index.html
@@ -248,6 +248,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/macos-app-shortcuts/index.html b/macos-app-shortcuts/index.html
index a2d730a..108cf47 100644
--- a/macos-app-shortcuts/index.html
+++ b/macos-app-shortcuts/index.html
@@ -224,6 +224,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/no-constraints-no-fun/index.html b/no-constraints-no-fun/index.html
index 231ae60..8d90c55 100644
--- a/no-constraints-no-fun/index.html
+++ b/no-constraints-no-fun/index.html
@@ -208,6 +208,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
           
diff --git a/notify-on-completion/index.html b/notify-on-completion/index.html
index 9e4b07c..38c8451 100644
--- a/notify-on-completion/index.html
+++ b/notify-on-completion/index.html
@@ -244,6 +244,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/octave/index.html b/octave/index.html
index f143c04..b14b553 100644
--- a/octave/index.html
+++ b/octave/index.html
@@ -230,6 +230,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
           
diff --git a/onityper/index.html b/onityper/index.html
index a3aed3c..27d8a5f 100644
--- a/onityper/index.html
+++ b/onityper/index.html
@@ -292,6 +292,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
           
diff --git a/plotting-ideas/index.html b/plotting-ideas/index.html
index 2bd2edc..d941e64 100644
--- a/plotting-ideas/index.html
+++ b/plotting-ideas/index.html
@@ -228,6 +228,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/react-and-javascript-in-5-min/index.html b/react-and-javascript-in-5-min/index.html
index 6e131a8..776e97a 100644
--- a/react-and-javascript-in-5-min/index.html
+++ b/react-and-javascript-in-5-min/index.html
@@ -396,6 +396,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/react-api-evolution/index.html b/react-api-evolution/index.html
index ee358c4..18b2354 100644
--- a/react-api-evolution/index.html
+++ b/react-api-evolution/index.html
@@ -459,6 +459,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/react-conf-2018/index.html b/react-conf-2018/index.html
index 7b4d0c3..96db4cc 100644
--- a/react-conf-2018/index.html
+++ b/react-conf-2018/index.html
@@ -303,6 +303,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/replacing-jekyll/index.html b/replacing-jekyll/index.html
index 4e7332e..30f7e44 100644
--- a/replacing-jekyll/index.html
+++ b/replacing-jekyll/index.html
@@ -239,6 +239,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/side-projects-are-hard/index.html b/side-projects-are-hard/index.html
index 9d9ad11..7319eed 100644
--- a/side-projects-are-hard/index.html
+++ b/side-projects-are-hard/index.html
@@ -230,6 +230,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
           
diff --git a/test-plan/index.html b/test-plan/index.html
index f78597c..372074a 100644
--- a/test-plan/index.html
+++ b/test-plan/index.html
@@ -216,6 +216,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/the-first-react-native-app/index.html b/the-first-react-native-app/index.html
index 3262075..c1ef951 100644
--- a/the-first-react-native-app/index.html
+++ b/the-first-react-native-app/index.html
@@ -240,6 +240,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/using-redux-with-flow/index.html b/using-redux-with-flow/index.html
index 1b4831f..7d15bf2 100644
--- a/using-redux-with-flow/index.html
+++ b/using-redux-with-flow/index.html
@@ -300,6 +300,14 @@ <h4>Related posts:</h4>
     
 
     
+
+    
+
+  
+
+    
+
+    
       
         
       
diff --git a/whos-watching-the-watchdog/index.html b/whos-watching-the-watchdog/index.html
new file mode 100644
index 0000000..f0ac6f7
--- /dev/null
+++ b/whos-watching-the-watchdog/index.html
@@ -0,0 +1,239 @@
+<!DOCTYPE html>
+<html lang="en">
+  <head>
+    <meta charset="utf-8">
+    <meta name="viewport" content="width=640" />
+
+    <title>Who&#39;s watching the watchdog? / frantic.im</title>
+    <meta name="author" content="Alex Kotliarskyi">
+    <meta name="description" content="Making reliable systems that expect things to go wrong">
+
+    <link rel="canonical" href="https://frantic.im/whos-watching-the-watchdog">
+    <link rel="alternate" type="application/rss+xml" title="frantic.im" href="https://frantic.im/feed.xml">
+
+    <meta property="og:title" content="Who&#39;s watching the watchdog?">
+    <meta property="og:image" content="https://frantic.im/assets/og_watchdog.png">
+    <meta name="og:description" content="Making reliable systems that expect things to go wrong">
+
+    <meta name="twitter:card" content="summary_large_image">
+    <meta name="twitter:site" content="@alex_frantic">
+    <meta name="twitter:creator" content="@alex_frantic">
+    <meta name="twitter:title" content="Who&#39;s watching the watchdog?">
+    <meta name="twitter:description" content="Making reliable systems that expect things to go wrong">
+    <meta name="twitter:image" content="https://frantic.im/assets/og_watchdog.png">
+
+    <link id="favicon" rel="icon" type="image/png" href="/favicon.png">
+    <script>
+      if ('ethereum' in window) {
+        document.getElementById('favicon').href = '/assets/favicon-hex.png';
+      }
+    </script>
+
+    <style type="text/css" media="screen">
+      * {
+  -moz-box-sizing: border-box;
+  -webkit-box-sizing: border-box;
+  box-sizing: border-box;
+}
+
+@font-face { font-family: 'body';                     src: url('/fonts/IBMPlexSans-Text.woff') format('woff'); }
+@font-face { font-family: 'body'; font-style: italic; src: url('/fonts/IBMPlexSans-TextItalic.woff') format('woff'); }
+@font-face { font-family: 'body'; font-weight: 800;   src: url('/fonts/IBMPlexSans-Bold.woff') format('woff'); }
+
+@font-face { font-family: 'mono';                     src: url('/fonts/IBMPlexMono-Text-Latin1.woff') format('woff'); }
+
+body { font: 18px/28px body, sans-serif; }
+pre, code { font-family: mono, monospace; }
+
+body {
+  background-color: #FFF;
+  color: #000;
+  -webkit-font-feature-settings: "kern" 1,"liga" 1,"calt" 1;
+  -moz-font-feature-settings: "kern" 1,"liga" 1,"calt" 1;
+  font-feature-settings: "kern" 1,"liga" 1,"calt" 1;
+  -webkit-font-smoothing: antialiased;
+  -moz-osx-font-smoothing: grayscale;
+  text-rendering: optimizeLegibility;
+  margin: 50px auto;
+}
+.page { width: 600px; margin: 0 auto; padding: 0 0 0 2px; }
+.page_wide { width: 810px; }
+
+.menu { width: 544px; margin: 0 auto 0; padding: 0 0 50px 0; vertical-align: top; }
+.menu > li { list-style: none; display: inline-block; margin: 0 1.5em 0 0; vertical-align: top; }
+.menu__item, a.menu__item { color: #00000050; border-color: transparent; display: inline-block; }
+.menu__item_selected,
+a.menu__item_selected,
+a.menu__item:hover { color: #000; border-bottom: 2px solid #000; }
+.menu__item_inside, a.menu__item_inside { border-bottom: 2px solid #00000030; }
+
+article { width: 544px; margin: 15px auto; }
+
+a { color: inherit; text-decoration: none; border-bottom: 2px solid #00000030; }
+a:hover { border-color: currentColor; }
+p, blockquote { margin: 15px 0; }
+h1 + p + blockquote { margin-bottom: 30px; margin-right: 1em; }
+blockquote { padding-left: 1em; color: #00000090; }
+blockquote::before {content: "> "; float: left; margin: 0 0 0 -1em; }
+
+.quote-author { text-align: right; font-size: 15px; }
+strong { font-weight: 600; }
+h1, h2 { margin: 2.5em 0 0.5em; }
+h1 { font-size: 1.7em; }
+h2 { font-size: 1.4em; }
+.title { font-size: 2.5em; line-height: 50px; margin: 1.5em 0 0.75em 0; }
+
+p > img, .fig, figure { margin: 2em 0; }
+img { max-width: 100%; }
+
+.fig, figure { text-align: center; font-size: 12px; line-height: 20px; font-style: italic; width: 600px; margin-left: -28px; margin-right: -28px; }
+.fig img, figure > img, figure > video, figure > a > img { margin: 0 auto 1em; display: block; border-radius: 3px; }
+figure > video { max-width: 100%; }
+.label { text-align: center; font-size: 12px; font-style: italic; margin: -1em 0 1em 0; }
+
+code { font-style: normal; background: #00000010; padding: 2px 6px; border-radius: 4px; font-size: 17px; white-space: nowrap; }
+pre { font-size: 16px;
+      background: #00000010;
+      padding: 16px 30px 14px;
+      margin: 1em -30px;
+      border-radius: 8px;
+      white-space: pre;
+      word-wrap: break-word;
+      font-style: normal;
+      overflow-x: auto; }
+pre > code { background: none; padding: 0; font-size: inherit; white-space: unset; }
+
+ul { padding: 0 0 0 1em; list-style-type: square; }
+ul > li, ol > li { margin: 0.5em 0; }
+
+sup, sub, .note-ref, .note-number, .footnote { vertical-align: baseline; position: relative; font-size: .7em; line-height: 1; }
+sup, .note-ref, .note-number, .footnote { bottom: 1.4ex; }
+sub { top: .5ex; }
+
+.about { margin: 60px 0;}
+.about_photo { float: left; width: 100px; height: 160px; margin-left: -150px; margin-top: -10px; background: url("/photo.png"); background-size: 200px; }
+.about_photo:hover { background-position: 100%; }
+.about_inner { font-size: 16px; line-height: 24px; border: 2px solid #00000030; border-radius: 4px; padding: 10px 20px; margin: -12px -22px; }
+.about_inner > p { margin: 0; }
+.about_inner > p:not(:last-child) { margin-bottom: 8px; }
+.btn-subscribe { line-height: 20px; text-decoration: none; background: #00000015; border: none; font-size: 12px; padding: 0px 7px; display: inline-block; border-radius: 4px; position: relative; top: -1px; }
+.btn-subscribe:hover { background: #00000030; }
+.btn-subscribe > svg { width: 21px; height: 21px; vertical-align: bottom; margin: 0 -2px 0 -5px; }
+
+.footnote { margin: 0 5px;  }
+.footnotes-br { width: 100px; height: 2px; background: #000000; margin-top: 5em; }
+.footnotes, .footnotes_alt { padding-left: 1em;  }
+.footnotes_alt > li > .dagger { margin-left: -13px; }
+.footnotes_alt { list-style: none; } 
+
+.notes { font-size: 0.8em; }
+.note-number { margin-left: -1em; }
+
+.date { color: #00000090; font-size: 14px; margin-left: 4px; }
+
+footer { color: #00000090; }
+footer { font-size: 16px; margin-bottom: 5em; }
+footer > .separator { margin: 0 4px; }
+footer > a { margin-right: 5px; }
+footer > a:hover { color: #000; }
+
+/* syntax */
+.highlight .kd, .highlight .k {font-weight: bold; }
+.highlight .mi { color: blue; }
+.highlight .cm { color: grey; }
+
+.hljs-built_in, .hljs-keyword {font-weight: bold; }
+.hljs-string, hljs-number { color: blue; }
+.hljs-comment { color: grey; }
+      
+    </style>
+  </head>
+
+  <body>
+    <ul class="menu">
+  
+  <li>
+    <a class="menu__item " style="" href="/blog/">
+      Blog
+    </a>
+  </li>
+  
+  <li>
+    <a class="menu__item " style="" href="/talks/">
+      Talks
+    </a>
+  </li>
+  
+  <li>
+    <a class="menu__item " style="" href="/projects/">
+      Projects
+    </a>
+  </li>
+  
+  <li>
+    <a class="menu__item " style="" href="/about/">
+      About
+    </a>
+  </li>
+  
+</ul>
+
+
+<article>
+  <header>
+    
+    <h1 class="title">Who's watching the watchdog?</h1>
+  </header>
+  <p>At my current company we have an automated pipeline for processing customer’s orders. It’s pretty complex — talking to multiple different services, training models, storing large files, updating the database, sending emails and push notifications.</p>
+<p>Sometimes things get stuck because of a temporary 3rd party outage or a bug in our code.</p>
+<p>So we built a watchdog service: it monitors the stream of orders and makes sure the orders get processed within reasonable timeframe (3 hours). The watchdog only looks at the final invariant — was the order fulfilled and delivered to the customer? It doesn’t care about any intermediary steps.</p>
+<p>This system has saved us many times. When the watchdog finds a stuck order, it posts in our special channel in Slack. We investigate the problem and address the root cause, so hopefully we won’t see new orders stuck for the same reason.</p>
+<p>But who’s watching the watchdog? What if it fails to run?</p>
+<p>It actually happened to us once. The watchdog is running on the job scheduling system, and that system went down. That meant no orders were getting processed and watchdog also wasn’t running. The alerts channel in Slack was blissfully silent.</p>
+<p>To address this case, we need a system that can watch the watchdog. We are using these two:</p>
+<ul>
+<li><a href="https://docs.sentry.io/product/crons/">Sentry Cron</a></li>
+<li><a href="https://www.checklyhq.com/blog/heartbeat-monitoring-with-checkly/">Checkly Heartbeat</a></li>
+</ul>
+<p>The idea behind both systems is the same: they expect a regular cron job to “check in” on a pre-defined schedule. If it misses a check-in, there’s likely a problem and we get an alert in Slack.</p>
+<p>Complex systems always find surprising ways to fail. When adding an end-to-end quality watchdog (and ways to watch the watchdog) you can create a positive loop of detecting issues and hardening the system.</p>
+
+  <footer>
+    <time datetime="2023-11-22T12:00:00+00:00">Nov 22, 2023</time>
+  </footer>
+  
+
+
+
+
+
+
+  <div class="about">
+  <div class="about_inner">
+    <p>Hello! This text lives here to convince you to subscribe. If you are reading this, consider clicking that subscribe button for more details.</p>
+    <p>I write about programming, software design and side projects <a style="margin-left: 5px" class="btn-subscribe" href="/subscribe/" target="_blank"><svg viewBox="0 0 800 800"><path d="M493 652H392c0-134-111-244-244-244V307c189 0 345 156 345 345zm71 0c0-228-188-416-416-416V132c285 0 520 235 520 520z"/><circle cx="219" cy="581" r="71"/></svg> Subscribe</a></p>
+  </div>
+</div>
+
+</article>
+
+    <script>
+      window.GoogleAnalyticsObject = 'ga';
+      window.ga = window.ga || function() {
+        (window.ga.q = window.ga.q || []).push(arguments);
+      };
+      window.ga.l = Date.now();
+
+      ga('create', 'UA-96545608-1', 'auto');
+      ga(function(tracker) {
+        tracker.set('sendHitTask', function(model) {
+          var xhr = new XMLHttpRequest();
+          xhr.open('GET', 'https://curiosity-seven.vercel.app/api/ev?' + model.get('hitPayload'), true);
+          xhr.send();
+        });
+      });
+      ga('send', 'pageview');
+    </script>
+    <script async defer src="https://curiosity-seven.vercel.app/api/leet" async></script>
+  </body>
+</html>