from Hacker News

Systems Correctness Practices at AWS: Leveraging Formal and Semi-Formal Methods

by yarapavan on 4/1/25, 2:59 PM with 52 comments

  • by jlcases on 4/1/25, 5:38 PM

    I've noticed that the formalization of methods described by AWS parallels what we need in technical documentation. Complex systems require not just formal verification but also structured documentation following MECE principles (Mutually Exclusive, Collectively Exhaustive).

    In my experience, the interfaces between components (where most errors occur) are exactly where fragmented documentation fails. I implemented a hierarchical documentation system for my team that organizes knowledge as a conceptual tree, and the accuracy of code generation with AI assistants improved notably.

    Formal verification tools and structured documentation are complementary: verification ensures algorithmic correctness while MECE documentation guarantees conceptual and contextual correctness. I wonder if AWS has experimented with structured documentation systems specifically for AI, or if this remains an area to explore.

  • by Cyphase on 4/2/25, 1:15 AM

    Leslie Lamport gave the closing keynote at SCaLE 22x this year, talking about formal methods and TLA+. He mentioned some previous work Amazon has done in that area.

    https://www.youtube.com/watch?v=tsSDvflzJbc

    > Coding isn't Programming - Closing Keynote with Leslie Lamport - SCaLE 22x

  • by pera on 4/1/25, 4:32 PM

    > we also sought a language that would allow us to model check (and later prove) key aspects of systems designs while being more approachable to programmers.

    I find it a bit surprising that TLA+ with PlusCal can be challenging to learn for your average software engineer, could anyone with experience in P show an example of something that can be difficult to express in TLA+ which is significantly easier in P?

  • by gqgs on 4/1/25, 7:17 PM

    A key concern I've consistently had regarding formal verification systems is: how does one confirm the accuracy of the verifier itself?

    This issue appears to present an intrinsically unsolvable problem, implying that a formally verified system could still contain bugs due to potential issues in the verification software.

    While this perspective doesn't necessarily render formal verification impractical, it does introduce certain caveats that, in my experience, are not frequently addressed in discussions about these systems.

  • by csbartus on 4/1/25, 6:08 PM

    I've recently created a likely-correct piece of software based on these principles.

    https://www.osequi.com/studies/list/list.html

    The structure (ontology, taxonomy) is created with ologs, a formal method from category theory. The behavior (choreography) is created with a semi-formal implementation (XState) of a formal method (Finite State Machines)

    The user-facing aspect of the software is designed with Concept Design, a semi-formal method from MIT CSAIL.

    Creating software with these methods is refreshing and fun. Maybe one day we can reach Tonsky's "Diagrams are code" vision.

    https://tonsky.me/blog/diagrams/

  • by OhMeadhbh on 4/1/25, 4:06 PM

    I find this highly unlikely. My first day at Amazon I encountered an engineer puzzling over a perfect sine wave in a graph. After looking at the scale I made the comment "oh. you're using 5 minute metrics." Their response was "how could you figure that out just by looking at the graph?" When I replied "Queuing theory and control theory," their response was "what's that?"

    Since then, Amazon's hiring practices have only gotten worse. Can you invert a tree? Can you respond "tree" or "hash map" when you're asked what is the best data structure for a specific situation? Can you solve a riddle or code an ill-explained l33tcode problem? Are you sure you can parse HTML with regexes? You're Amazon material.

    Did you pay attention to the lecture about formal proofs. TLA+ or Coq/Kami? That's great, but it won't help you get a job at Amazon.

    The idea that formal proofs are used anywhere but the most obscure corners of AWS is laughable.

    Although... it is a nice paper. Props to Amazon for supporting Ph.D.'s doing pure research that will never impact AWS' systems or processes.

  • by nullorempty on 4/1/25, 4:08 PM

    And what teams use these methods exactly?
  • by neuroelectron on 4/1/25, 11:23 PM

    Great April 1st gag. Seems to have gone over everyone's head.