[spec] Extended subtype validity relation missing validity on supertype

The spec defines the following [generalised version](https://github.com/WebAssembly/spec/blob/f9c743aeccadabbcdf7b05508b95d753d9fa1bcd/specification/wasm-3.0/2.1-validation.types.spectec#L161) for validity of `subtype` that allow the supertype to be a general `typeuse` instead of just a type index `_IDX` (this corresponds to the extension of `typeuse` from just an index to also include `rec n` and `deftype`).:
```
rule Subtype_ok2:
  C |- SUB FINAL? typeuse* compttype : OK x i
  -- if |typeuse*| <= 1
  -- (if $before(typeuse, x, i))*
  -- (if $unrollht(C, typeuse) = SUB typeuse'* comptype')*
  ----
  -- Comptype_ok: C |- comptype : OK
  -- (Comptype_sub: C |- comptype <: comptype')*
```
But the rule is missing a check that the `typeuse` itself needs to be valid (`(Typeuse_ok: C |- typeuse : OK)*`). Note that the original version of `subtype_ok` does not require such a check, since only a type index `_IDX` is allowed. 

This only affects the metatheoretic soundness proofs, since the extended supertype definition cannot appear in real modules, and the above definition is only involved in the soundness proofs. I noticed that this definition is rendered in the appendix despite being formulated in 2.1, so maybe it can be fixed as a part of #2125 .

The specific soundness proof that got me stuck and raise this issue is a bit involved -- I include a description of the issue below.

<details> 
<summary>Long text of explanation </summary>

I've crafted a mechanised interpreter and have been trying to proving interpreter correctness against the opsem. Several instructions (e.g. ref.test, ref.cast, br_on_cast) require the matching relation to be dynamically checked at runtime and return different results depending on whether the matching holds. The interpreter calls a computable matching oracle (following the design of the reference interpreter's match.ml), and correctness of the "else" branch requires oracle completeness: if the declarative relation Heaptype_sub C ht1 ht2 holds, then the oracle returns true.

The completeness proof is by structural induction on Heaptype_sub. The critical case is Heaptype_sub/trans:
```
rule Heaptype_sub/trans:
  C |- ht_1 <: ht_2
  -- Heaptype_ok: C |- ht' : OK
  -- Heaptype_sub: C |- ht_1 <: ht'
  -- Heaptype_sub: C |- ht' <: ht_2
```
The inductive hypotheses give oracle results for ht_1 <: ht' and ht' <: ht_2, but combining them requires proving oracle transitivity -- which in turn requires Heaptype_ok on both endpoints (instead of just the middle heaptype) -- otherwise there are counterexamples with ill-formed heaptypes where the oracle is not transitive while the declarative rules are transitive (see below). Nevertheless, I don't consider this to be a bug for the declarative rules of heaptype matching, since it only manifests for ill-typed heaptypes at the endpoints. However, this forces the completeness theorem to carry Heaptype_ok as a precondition on both endpoints.

Now, Deftype_sub/super says:
```
rule Deftype_sub/super:
  C |- deftype_1 <: deftype_2
  -- if $unrolldt(deftype_1) = SUB final? typeuse* ct
  -- Heaptype_sub: C |- typeuse*[i] <: deftype_2
```
To prove oracle completeness for this case, it is needed to invoke the completeness result on the `Heaptype_sub` premise, which requires `Heaptype_ok C (.typeuse (typeuse*[i]))`. Since `typeuse*[i]` comes from `$unrolldt(deftype_1)`, this can only be from `Deftype_ok C deftype_1`. This is the case for `_IDX` supertypes (validity from context lookup) and `REC` supertypes (validity from the same `Rectype_ok`). But for inline `deftype` supertypes, `Subtype_ok2` only checks `$unrollht` (unrollability) without checking `Typeuse_ok` (validity), so `Deftype_ok` does not guarantee `Heaptype_ok` for the supertype. Compounded with the above requirement of heaptype_ok in the completeness proof, a counterexample can be crafted by plugging in the above ill-typed transitivity example into the supertype of a deftype.

Note that there's no concrete module-level counterexample, because the module format does not support generalised supertypes. The following is an encoding of the counterexample directly in the OCaml AST.
```ocaml
let debug_inline_supertype_witness () =
    let open Types in
    let open Match in

    let show_bool label b =
      Printf.printf "%s = %b\n" label b
    in

    let dt_func : deftype =
      DefT (RecT [
        SubT (NoFinal, [], FuncT ([], []))
      ], 0l)
    in

    let dt_bad : deftype =
      DefT (RecT [
        SubT (NoFinal, [Def dt_func], StructT [])
      ], 0l)
    in

    let dt_outer : deftype =
      DefT (RecT [
        SubT (NoFinal, [Def dt_bad], StructT [])
      ], 0l)
    in

    let c : Match.context = [] in

    let rt_func : reftype =
      (Null, UseHT (Def dt_func))
    in

    let rt_outer : reftype =
      (Null, UseHT (Def dt_outer))
    in

    let rt_abs_func : reftype =
      (Null, FuncHT)
    in

    let bad_sub_func =
      match_deftype c dt_bad dt_func
    in

    let outer_sub_func =
      match_deftype c dt_outer dt_func
    in

    let func_sub_abs =
      match_reftype c rt_func rt_abs_func
    in

    let outer_sub_abs =
      match_reftype c rt_outer rt_abs_func
    in

    Printf.printf "dt_func  = %s\n" (string_of_deftype dt_func);
    Printf.printf "dt_bad   = %s\n" (string_of_deftype dt_bad);
    Printf.printf "dt_outer = %s\n\n" (string_of_deftype dt_outer);

    show_bool "match_deftype [] dt_outer dt_func" outer_sub_func;
    show_bool "match_reftype [] (Null, UseHT (Def dt_func)) (Null, FuncHT)" func_sub_abs;
    show_bool "match_reftype [] (Null, UseHT (Def dt_outer)) (Null, FuncHT)" outer_sub_abs;

    print_endline "";
```
The first two matchings are true while the last one is false, and `dt_outer/dt_func` are both valid deftypes given the definition in the current appendix (as subtype validity does not check supertype validity).

</details>


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[spec] Extended subtype validity relation missing validity on supertype #2141

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[spec] Extended subtype validity relation missing validity on supertype #2141

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions