1. 13
  1.  

  2. 4

    I’m not aware of any other company that goes to these lengths to make their service so reliable.

    I’d be really interested in seeing the lengths that Google goes to.

    1. 4

      There’s some insight into that in this talk https://www.youtube.com/watch?v=H4vMcD7zKM0

    2. 4

      “I’m not aware of any other company that goes to these lengths to make their service so reliable.”

      Probably any company whose services are high in usage but low in downtime. One might also automatically count most systems running on VMS Clusters, Tandem/HP NonStop, and Stratus. They tend to pick those for high availability with careful development practices that leverage their availability-enhancing properties. There is also a niche in the market that buys development tooling specifically for high-quality despite quick development. That’s the Ada community for sure. Add a subset of Common LISP, Smalltalk, and Eiffel companies. Add those using conservative languages w/ lots of static analysis and testing. In embedded, add companies using RTOS’s such as QNX or INTEGRITY. In FOSS, Minix 3 and Muen have similar aims with Linux and BSD modifications happening, too.

      So, there’s a lot of companies and projects doing this kind of stuff at every layer of the stack.