Fancy Proofs

A natural deduction proof is a tree. In practice, reading and writing proof trees has some drawbacks. When proofs reach a certain size it can be difficult to lay out a tree-formatted proof on the page. Also, trees aren’t a great fit for the way we usually read text from beginning to end.

There’s an alternative syntax for natural deduction, which is basically a depth-first serialization of the proof tree. For instance, the tree

In other words, write the nodes of the tree as a numbered list, where each node is accompanied by a list of references to its children. There are many valid ways to serialize a tree, but a valid serialization represents a unique tree.

This is almost Fitch notation, with one exception – because a serialized proof represents a unique tree, it isn’t necessary to use indentation to signify what hypotheses have been introduced but not discharged. (It’s a good idea to do this anyway for human readers, but the proof checker doesn’t need it.)

The serialized syntax has some advantages. It doesn’t require arbitrary indentation – which is a big problem for large proofs. It is more natural to read from top to bottom. And it facilitates writing shorter proofs when a subproof appears more than once, although this is not very common.

In this module we’ll define an alternative abstract syntax for serialized proofs, which we’ll call fancy style.

In a concrete sense we can think of fancy style as being compiled to raw proof trees. This is an interesting point of view. In general a compiler translates expressions from one structured language to another, and it’s possible to use this translation to compress complex patterns in one language to simple ones in another. In addition to Fitch style notation, we will augment fancy proofs with some additional features.

{-# LANGUAGE LambdaCase #-}
module Fancy where

import Data.List
import qualified Data.Map as M
import qualified Data.Set as S
import Test.QuickCheck

import Var
import Sub
import Expr
import Jud
import Proof

The Representation

Fancy proof statements will look suspiciously like the nodes in a nonfancy proof, except that subproofs will be replaced by references. The lines in a fancy proof are usually numbered, so the references can just be Ints.

There’s one special case. A very common pattern is the equational proof; one way to show that two expressions are equal is to give a list of equalities with the two expressions on either end. Doing this “raw”, in terms of eq-intro and eq-elim, is verbose and unenlightening. However, we can introduce a special syntactic form to condense these proofs quite a bit. This is the FancyChain line; we’ll explain it later.

data FancyProof
  = FancyProof (M.Map Int FancyProofLine)
  deriving (Eq, Show)

data FancyProofLine
  = FancyUse Loc RuleName Jud [Int]
  | FancyInvoke Loc RuleName Jud [Int] (Sub Jud)
  | FancyHyp Loc HypName Jud
  | FancyDis Loc HypName Jud Int
  | FancyIntroEq Loc Jud
  | FancyElimEq Loc Jud (Var Expr) Jud Int Int
  | FancyIntroU Loc Jud (Var Expr) (Var Expr) Int
  | FancyElimU Loc Jud (Var Expr) Expr Int
  | FancyIntroE Loc Jud (Var Expr) Expr Int
  | FancyElimE Loc Jud (Var Expr) (Var Expr) Int Int
  | FancyAssume Loc Int Jud
  | FancyChain Loc Expr [(Expr, Maybe (Var Expr, Expr), Bool, ChainRef)]
  deriving (Show)

data ChainRef
  = ChainUse Loc RuleName [Int]
  | ChainHyp Loc HypName
  | ChainAssume Loc Int
  deriving (Show)

For testing we’ll need an Eq instance for FancyProofLine that ignores the Loc parameters.

instance Eq FancyProofLine where
  l1 == l2 = case (l1,l2) of
    (FancyUse _ n1 q1 ps1, FancyUse _ n2 q2 ps2) ->
      (n1 == n2) && (q1 == q2) && (ps1 == ps2)
    (FancyInvoke _ n1 q1 ps1 t1, FancyInvoke _ n2 q2 ps2 t2) ->
      (n1 == n2) && (q1 == q2) && (ps1 == ps2) && (t1 == t2)
    (FancyHyp _ n1 q1, FancyHyp _ n2 q2) ->
      (n1 == n2) && (q1 == q2)
    (FancyDis _ n1 q1 p1, FancyDis _ n2 q2 p2) ->
      (n1 == n2) && (q1 == q2) && (p1 == p2)
    (FancyIntroEq _ e1, FancyIntroEq _ e2) ->
      (e1 == e2)
    (FancyElimEq _ q1 x1 w1 u1 v1, FancyElimEq _ q2 x2 w2 u2 v2) ->
      (q1 == q2) && (x1 == x2) && (w1 == w1) && (u1 == u2) && (v1 == v2)
    (FancyIntroU _ q1 x1 y1 p1, FancyIntroU _ q2 x2 y2 p2) ->
      (q1 == q2) && (x1 == x2) && (y1 == y2) && (p1 == p2)
    (FancyElimU _ q1 x1 e1 p1, FancyElimU _ q2 x2 e2 p2) ->
      (q1 == q2) && (x1 == x2) && (e1 == e2) && (p1 == p2)
    (FancyIntroE _ q1 x1 e1 p1, FancyIntroE _ q2 x2 e2 p2) ->
      (q1 == q2) && (x1 == x2) && (e1 == e2) && (p1 == p2)
    (FancyElimE _ q1 u1 x1 p1 h1, FancyElimE _ q2 u2 x2 p2 h2) ->
      (q1 == q2) && (u1 == u2) && (x1 == x2) && (p1 == p2) && (h1 == h2)
    (FancyAssume _ n1 q1, FancyAssume _ n2 q2) ->
      (n1 == n2) && (q1 == q2)
    (FancyChain _ e1 w1, FancyChain _ e2 w2) ->
      (e1 == e2) && (w1 == w2)
    _ -> False

instance Eq ChainRef where
  r1 == r2 = case (r1,r2) of
    (ChainUse _ n1 t1, ChainUse _ n2 t2) ->
      (n1 == n2) && (t1 == t2)
    (ChainHyp _ n1, ChainHyp _ n2) ->
      (n1 == n2)
    (ChainAssume _ a1, ChainAssume _ a2) ->
      (a1 == a2)
    _ -> False

instance Arbitrary FancyProofLine where
  arbitrary = oneof
    [ FancyUse <$> arbitrary <*> arbitrary <*> arbitrary
        <*> ((map abs) <$> arbitrary)
    , FancyInvoke <$> arbitrary <*> arbitrary <*> arbitrary
        <*> ((map abs) <$> arbitrary) <*> (return emptySub)
    , FancyHyp <$> arbitrary <*> arbitrary <*> arbitrary
    , FancyDis <$> arbitrary <*> arbitrary <*> arbitrary
        <*> (abs <$> arbitrary)
    , FancyIntroEq <$> arbitrary <*> arbitrary
    , FancyElimEq <$> arbitrary <*> arbitrary <*> arbitrary
        <*> arbitrary <*> (abs <$> arbitrary) <*> (abs <$> arbitrary)
    , FancyIntroU <$> arbitrary <*> arbitrary <*> arbitrary
        <*> arbitrary <*> (abs <$> arbitrary)
    , FancyElimU <$> arbitrary <*> arbitrary <*> arbitrary
        <*> arbitrary <*> (abs <$> arbitrary)
    , FancyIntroE <$> arbitrary <*> arbitrary <*> arbitrary
        <*> arbitrary <*> (abs <$> arbitrary)
    , FancyElimE <$> arbitrary <*> arbitrary <*> arbitrary
        <*> arbitrary <*> (abs <$> arbitrary) <*> (abs <$> arbitrary)
    , FancyAssume <$> arbitrary <*> (abs <$> arbitrary) <*> arbitrary
    ]

  shrink = \case
    FancyUse loc name q ps ->
      [ FancyUse loc name q' ps | q' <- shrink q ]
    FancyInvoke loc name q ps t ->
      [ FancyInvoke loc name q' ps t | q' <- shrink q ]
    FancyHyp loc name q ->
      [ FancyHyp loc name q' | q' <- shrink q ]
    FancyDis loc name q p ->
      [ FancyDis loc name q' p | q' <- shrink q ]
    FancyIntroEq loc w ->
      [ FancyIntroEq loc w' | w' <- shrink w ]
    FancyElimEq loc q x w pe pf ->
      [ FancyElimEq loc q' x w' pe pf |
          q' <- shrink q, w' <- shrink w ] ++
      [ FancyElimEq loc q x w' pe pf | w' <- shrink w ] ++
      [ FancyElimEq loc q' x w pe pf | q' <- shrink q ]
    FancyIntroU loc q x y p ->
      [ FancyIntroU loc q' x y p | q' <- shrink q ]
    FancyElimU loc q x e p ->
      [ FancyElimU loc q' x e' p | q' <- shrink q, e' <- shrink e ] ++
      [ FancyElimU loc q' x e p | q' <- shrink q ] ++
      [ FancyElimU loc q x e' p | e' <- shrink e ]
    FancyIntroE loc q x e p ->
      [ FancyIntroE loc q' x e' p | q' <- shrink q, e' <- shrink e ] ++
      [ FancyIntroE loc q' x e p | q' <- shrink q ] ++
      [ FancyIntroE loc q x e' p | e' <- shrink e ]
    FancyElimE loc q x w pe pf ->
      [ FancyElimE loc q' x w' pe pf |
          q' <- shrink q, w' <- shrink w ] ++
      [ FancyElimE loc q x w' pe pf | w' <- shrink w ] ++
      [ FancyElimE loc q' x w pe pf | q' <- shrink q ]
    FancyAssume loc t q ->
      [ FancyAssume loc t q' | q' <- shrink q ]
    FancyChain loc q ps ->
      [ FancyChain loc q' ps' | q' <- shrink q, ps' <- shrink ps ]

instance Arbitrary ChainRef where
  arbitrary = oneof
    [ ChainUse <$> arbitrary <*> arbitrary <*> ((map abs) <$> arbitrary)
    , ChainHyp <$> arbitrary <*> arbitrary
    , ChainAssume <$> arbitrary <*> (abs <$> arbitrary)
    ]

instance Arbitrary FancyProof where
  arbitrary = do
    m <- M.mapKeys abs <$> arbitrary
    k <- abs <$> arbitrary
    a <- arbitrary
    return $ FancyProof $ M.insert k a m

  shrink (FancyProof m) = map FancyProof $ shrink m

We will not directly validate fancy proofs – instead we will convert them to their canonical nonfancy form and validate that. The fancy form is just an alternative abstract (and later, concrete) syntax.

Deserialization of a fancy proof can fail in a few ways; we might have a reference to a nonexistent proof step. We might have a reference from an early step to a later step; to ensure that deserialization terminates this should not be allowed. Also we will demand that the step labels be natural numbers.

data DeserializeError
  = StepNotPresent Int
  | NegativeRef Int
  | ForwardRef Int Int
  | EmptyProof
  | EmptyChain
  | ChainMatchError
  | ChainBadSub (Var Expr) (Sub Expr)
  deriving (Eq, Show)

deserialize :: FancyProof -> Either DeserializeError Proof
deserialize (FancyProof m) = case M.lookupMax m of
  Nothing -> Left EmptyProof
  Just (t,_) -> deserializeAt t
  where
    deserializeAt k = if k < 0
      then Left $ NegativeRef k
      else case M.lookup k m of
        Nothing -> Left $ StepNotPresent k
        Just t -> case t of
          FancyHyp loc name q -> Right $ Hyp loc name q
          FancyDis loc name q u -> do
            p <- deserializeBelowAt k u
            return $ Dis loc name q p
          FancyIntroEq loc q -> do
            return $ IntroEq loc q
          FancyElimEq loc q x e u v -> do
            pu <- deserializeBelowAt k u
            pv <- deserializeBelowAt k v
            return $ ElimEq loc q x e pu pv
          FancyIntroU loc q x y u -> do
            p <- deserializeBelowAt k u
            return $ IntroU loc q x y p
          FancyElimU loc q x e u -> do
            p <- deserializeBelowAt k u
            return $ ElimU loc q x e p
          FancyIntroE loc q x e u -> do
            p <- deserializeBelowAt k u
            return $ IntroE loc q x e p
          FancyElimE loc q x e u v -> do
            pu <- deserializeBelowAt k u
            pv <- deserializeBelowAt k v
            return $ ElimE loc q x e pu pv
          FancyAssume loc t q -> Right $ Assume loc t q
          FancyUse loc name q us -> do
            ps <- mapM (deserializeBelowAt k) us
            return $ Use loc name q ps
          FancyInvoke loc name q us t -> do
            ps <- mapM (deserializeBelowAt k) us
            return $ Invoke loc name q ps t
          FancyChain loc e ws ->
            deserializeChainAt k e (reverse ws)

    deserializeBelowAt k u =
      if u >= k
        then Left $ ForwardRef u k
        else do
          p <- deserializeAt u
          return p

Let’s talk about deserializing chains. The idea here is that we’ve got a list of expressions \(E_1\), \(E_2\), …, \(E_k\), each one equal to the last, and we want to expand this to a proof tree with \(E_1 = E_k\) at the root. We can do this by recursively constructing trees for \(E_1 = E_2\), \(E_1 = E_3\) and so on as follows.

For the general case, suppose we can build a proof tree for an equation chain of length \(n\). Given one of length \(n+1\),

Letting \(x\) be fresh in \(E_1\), \(E_n\), and \(E_{n+1}\), we can build a proof tree for \(E_1 = E_{n+1}\) like so.

Abbreviating equation chains like this makes the reasoning much closer to the way we’d write an equational proof by hand – we can have both formal verification and readability.

    deserializeChainAt
      :: Int -> Expr -> [(Expr, Maybe (Var Expr, Expr), Bool, ChainRef)]
      -> Either DeserializeError Proof
    deserializeChainAt k e ws = case ws of
      [] -> Left EmptyChain
      (e2, h, flop, ref):rest -> do
        let
          (w,f) = case h of
            Just m -> m
            Nothing -> let g = fresh [ freeExprVars e, freeExprVars e2 ]
              in (g, EVar Q g)
        case rest of
          [] -> do
            u <- matchVar w f $ if flop then e2 else e
            v <- matchVar w f $ if flop then e else e2
            p <- deserializeChainRefAt k u v flop ref
            return $
              ElimEq Q (JEq Q e e2) w (JEq Q e f)
                p (IntroEq Q (JEq Q e e))
          (e3,_,_,_):_ -> do
            u <- matchVar w f $ if flop then e2 else e3
            v <- matchVar w f $ if flop then e3 else e2
            p <- deserializeChainRefAt k u v flop ref
            let x = fresh [ freeExprVars e, freeExprVars e2, freeExprVars e3 ]
            p2 <- deserializeChainAt k e rest
            return $
              ElimEq Q (JEq Q e e2) x (JEq Q e (EVar Q x))
                (ElimEq Q (JEq Q e3 e2) w (JEq Q e3 f)
                  p (IntroEq Q (JEq Q e3 e3)))
                p2
      where
        deserializeChainRefAt
          :: Int -> Expr -> Expr -> Bool -> ChainRef -> Either DeserializeError Proof
        deserializeChainRefAt k u v flop ref = do
          let
            p = JEq Q u v
          pf <- case ref of
            ChainHyp loc name -> Right $ Hyp loc name p
            ChainAssume loc i -> Right $ Assume loc i p
            ChainUse loc name ds -> do
              ps <- mapM (deserializeBelowAt k) ds
              return $ Use loc name p ps
          if flop
            then do
              let
                x = fresh [ freeExprVars u, freeExprVars v ]
              return $
                ElimEq Q (JEq Q v u) x (JEq Q (EVar Q x) u)
                  pf (IntroEq Q (JEq Q u u))
            else return pf

        matchVar
          :: Var Expr -> Expr -> Expr
          -> Either DeserializeError Expr
        matchVar w f e = case matchExpr f e of
          Nothing -> Left ChainMatchError
          Just s -> case applySub w s of
            Just u -> return u
            Nothing -> Left $ ChainBadSub w s

Serialization

In practice there isn’t really a good reason to turn a nonfancy proof into a fancy proof, since the most important thing we do with proofs is validate them.

But a good test of our deserialization function is that deserializing a serialized nonfancy proof should be the identity. In the spirit of overdoing it, lets write that test.

In the following code, we’ll be implicitly assuming that the lines of a proof are numbered contiguously from 1 to \(n\). This won’t be checked, since this is just test code which should break if that invariant is not satisfied. This constraint is not necessary when we actually use the library, but makes the test much simpler.

We need a couple of utilities. First is catFancyProof, which concatenates two fancy proofs and updates the line labels in the second.

catFancyProof :: FancyProof -> FancyProof -> FancyProof
catFancyProof p@(FancyProof m1) (FancyProof m2) =
  let k = sizeOf p in
  FancyProof $ M.union m1 (fmap (bump k) $ M.mapKeys (k+) m2)
  where
    bump :: Int -> FancyProofLine -> FancyProofLine
    bump k = \case
      FancyUse loc name q ps -> FancyUse loc name q (map (k+) ps)
      FancyInvoke loc name q ps t -> FancyInvoke loc name q (map (k+) ps) t
      FancyHyp loc name q -> FancyHyp loc name q
      FancyDis loc name q p -> FancyDis loc name q (k+p)
      FancyIntroEq loc w -> FancyIntroEq loc w
      FancyElimEq loc q x w u v -> FancyElimEq loc q x w (k+u) (k+v)
      FancyIntroU loc q x y p -> FancyIntroU loc q x y (k+p)
      FancyElimU loc q x e p -> FancyElimU loc q x e (k+p)
      FancyIntroE loc q x e p -> FancyIntroE loc q x e (k+p)
      FancyElimE loc q x w u v -> FancyElimE loc q x w (k+u) (k+v)
      FancyAssume loc t q -> FancyAssume loc t q
      FancyChain loc w qs -> FancyChain loc w (map (f k) qs)

    bumpChainRef :: Int -> ChainRef -> ChainRef
    bumpChainRef k = \case
      ChainUse loc name us -> ChainUse loc name (map (k+) us)
      ChainHyp loc name -> ChainHyp loc name
      ChainAssume loc i -> ChainAssume loc i

    f k (x,w,flop,ref) = (x,w,flop,bumpChainRef k ref)

sizeOf :: FancyProof -> Int
sizeOf (FancyProof m) = case M.lookupMax m of
  Nothing -> 0
  Just (k,_) -> k

We’re now prepared to serialize proofs. The result is a list of fancy proofs, because there’s generally more than one way to serialize a proof. We don’t enumerate all of them for simplicity’s sake.

serialize :: Proof -> [FancyProof]
serialize = \case
  Assume loc k p ->
    [ FancyProof $ M.fromList [ (1, FancyAssume loc k p) ] ]
  Hyp loc name p ->
    [ FancyProof $ M.fromList [ (1, FancyHyp loc name p) ] ]
  Dis loc name q p -> do
    m <- serialize p
    let k = sizeOf m
    [ append m $ FancyProof $ M.fromList
      [ (1, FancyDis loc name q k) ] ]
  IntroEq loc w ->
    [ FancyProof $ M.fromList [ (1, FancyIntroEq loc w) ] ]
  ElimEq loc w x q pe pf -> do
    me <- serialize pe
    mf <- serialize pf
    let ke = sizeOf me
    let kf = sizeOf mf
    [   append (catFancyProof me mf) $ FancyProof $ M.fromList
        [ (1, FancyElimEq loc w x q ke (ke+kf)) ]
      , append (catFancyProof mf me) $ FancyProof $ M.fromList
        [ (1, FancyElimEq loc w x q (ke+kf) kf) ]
      ]
  IntroU loc w x y p -> do
    m <- serialize p
    let k = sizeOf m
    [ append m $ FancyProof $ M.fromList
      [ (1, FancyIntroU loc w x y k) ] ]
  ElimU loc w x e p -> do
    m <- serialize p
    let k = sizeOf m
    [ append m $ FancyProof $ M.fromList
      [ (1, FancyElimU loc w x e k) ] ]
  IntroE loc w x e p -> do
    m <- serialize p
    let k = sizeOf m
    [ append m $ FancyProof $ M.fromList
      [ (1, FancyIntroE loc w x e k) ] ]
  ElimE loc w x q pe pf -> do
    me <- serialize pe
    mf <- serialize pf
    let ke = sizeOf me
    let kf = sizeOf mf
    [   append (catFancyProof me mf) $ FancyProof $ M.fromList
        [ (1, FancyElimE loc w x q ke (ke+kf)) ]
      , append (catFancyProof mf me) $ FancyProof $ M.fromList
        [ (1, FancyElimE loc w x q (ke+kf) kf) ]
      ]
  Use loc name q ps -> do
    ms <- mapM serialize ps
    let ks = map sizeOf ms
    let accum = map sum . tail . inits
    [ append
      (foldr catFancyProof (FancyProof mempty) ms)
        (FancyProof $ M.fromList
          [ (1, FancyUse loc name q (accum ks)) ]) ]
  Invoke loc name q ps t -> do
    ms <- mapM serialize ps
    let ks = map sizeOf ms
    let accum = map sum . tail . inits
    [ append
      (foldr catFancyProof (FancyProof mempty) ms)
        (FancyProof $ M.fromList
          [ (1, FancyInvoke loc name q (accum ks) t) ]) ]

  where
    append p@(FancyProof m1) (FancyProof m2) =
      let k = sizeOf p in
      FancyProof $ M.union m1 (M.mapKeys (k+) m2)

And we can test that deserializing a serialized proof returns the same proof. Going in the other direction is less straightforward, since fancy proofs are subject to syntactic validation.