*start*
07201 00024 USt
Date: 7 Aug. 1981 4:41 pm PDT (Friday)
From: Mitchell.PA
Subject: Current Level 0/1 Interdoc status/rev. 15
To: Mitchell, Horning
Last edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
We envision an Interdoc script being processed in any manner equivalent to the
following:
Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.
BASIC INTERDOC
GRAMMAR
node ::= "{" labels expression* "}"
labels ::= [label* ":"]
label ::= "#" name
expression ::= [ lhs ] [ "'" | "." | op ] rhs
rhs ::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
[ "VAL" ] "(" expression* ")"
literal ::= Boolean | integer | hexint | real | string | label
name ::= id ( "." id)*
id ::= (letter | "" ) ( letter | "" | digit )* -- "" is the null id
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs ::= name binding
binding ::= "=" | ":" | ":=" | "←"
op ::= "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
"LT" | " LE" | "EQ" | "NE" | "GE" | "GT"
SYNTACTIC EXAMPLES:
{#examplenode:
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
Environments bind expressions and "modes" to identifiers:
Null denotes the "empty" environment
[E | id←e, m] means "E with (e, m) bound to id"
E(id) denotes the value locally bound to id in E
Null(id) = id
[E | id'←e, m](id) = if id=id' then e else E(id)
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id'←e, m]) =
if id=id' then m else locBinding(id, E)
-- Basis
R<>(E) = nothing -- just what it says
T<>(E) = E -- Identity
-- Expression sequence
R<nothing e*>(E) = R<e*>(E) -- "nothing" disappears
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<"'" p>(E) = p
T<"'" p>(E) = E
R<literal>(E) = literal
T<literal>(E) = E
R<id>(E) = if valOf(id, E)=id then id else R<valOf(id, E)>(E)
T<id>(E) = if valOf(id, E)=id then E else T<valOf(id, E)>(E)
R<p "." id>(E) = R<id>(R<p>(E))
T<p "." id>(E) = if valOf(id, R<p>(E))=id then E else T<valOf(id, R<p>(E))>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<p1 op p2>(E) = apply(op, p1, p2, E)
T<p1 op p2>(E) = E
R<"NOT" p>(E) = if R<p>(E)=True then False else True
T<"NOT" p>(E) = E
R<n m op e>(E) = nothing -- Empty list
T<n m op e>(E) = bindq(n, m, apply(op, n, e, E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | Outer←E, Const]) "}"
T<"{" labels e* "}">(E) = (T<Sub e*>([Null | Outer←E, Const]))(Outer)
R<"(" e* ")">(E) = [T<"(" e* ")">(E) | Outer←Null, Const]
T<"(" e* ")">(E) = T<e*>([Null | Outer←E, Const])
R<"VAL(" e* ")">(E) = R<e*>(E)
T<"VAL(" e* ")">(E) = E
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding(Outer, E) ~= None => whereBound(id, E(Outer))
True => Null
valOf(id, E) = (whereBound(id, E))(id) -- Gets innermost value
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
bindq(id, m, e, E) =
bindingOf(id, E) = "=" => E
m = ":=" => assign(id, e, E)
True => [E | id←e, m]
bindq(id.n, m, e, E) = [E | id←bindq(n, m, e, R<id>(E)), bindingOf(id, E)]
assign(id, e, E) = locBinding(id, E) = ":" => [E | id←e, ":"]
bindingOf(id, E) = ":" => bindq(Outer.id, ":=", e, E)
True => E
apply(op, lhs, rhs, E) =
op = "" => R<rhs>(E)
op = "." => R<rhs>([R<lhs>(E) | Outer←E, Const])
op = "+" => R<lhs>(E)+R<rhs>(E)
. . .
Missing or in question:
literal sequences
binary & relational operators
-------------------
Expressions in an Interdoc script may denote
literal values:
Boolean: (F, T)
integer: ... -3, -2, -1, 0, 1, 2, 3, ...
real: 1.2E5, . . .
string: <this is a string>
label: #A123, #anId, #Paragraph
the empty environment: Null
the empty list: NIL
id: (the null id), bold, thisIsAnId, Helvetica, . . .
(unless bound, taken to denote a primitive)
environments
unevaluated expressions
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bindq(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in E(Outer). Perverse explicit bindings to outer might create loops,
leaving some ids undefined, but there seems to be little reason to clutter up the
semantics by forbidding such assignments.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.
Braces create a nested environment; if preceded by a dot, it is initialized to the
value of the name in the binding; if preceded by VAL, it is executed for value,
and the environment is then discarded.
Semantics of labels:
A label #id on a node in the dominant structure gives that node membership in
the set named by id. Multiple labels place the node in multiple sets, and a
unique label on a node places it in a singleton set, i.e., identifies it uniquely.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Last edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Last edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
*start*
07956 00024 USt
Date: 13 Aug. 1981 5:36 pm PDT (Thursday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 16
To: Mitchell, Horning
[Jim,
I did a fair amount of doodling, primarily in an attempt to transform the semantic
equations to a form where their correctness will be "more nearly obvious." Some
of the changes I like, but I'm not really wedded to any of them.]
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
In question:
treatment of primitive identifiers
side effects in expressions
operations on nested environments: font←.(size←10)
Missing:
literal sequences
-------------------
We envision an Interdoc script being processed in any manner equivalent to the
following:
Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.
BASIC INTERDOC
GRAMMAR
node ::= "{" labels expression* "}"
labels ::= [label* ":"]
label ::= "#" name
expression ::= [ lhs ] [ "'" | op ] rhs
rhs ::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
[ "ENV" ] "(" expression* ")"
literal ::= Boolean | integer | hexint | real | string | label
name ::= id ( "." id)*
id ::= (letter | "" ) ( letter | "" | digit )* -- "" is the null id
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs ::= name bindingMode
bindingMode ::= "=" | ":" | ":=" | "←"
op ::= "." | "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
"LT" | " LE" | "EQ" | "NE" | "GE" | "GT"
SYNTACTIC EXAMPLE:
{#examplenode:
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E,
if locBinding(id, E) ~= None
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
N. B.
T<construct>(E) = E , if no explicit value is given below.
-- Basis
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, e, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, e, E) -- Assign at right level
True => [E | id m e]
bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]
assign(id, e, E) = locBinding(id, E) = ":" => [E | id ":" e]
bindingOf(id, E) = ":" => bind("Outer".id, ":=", e, E)
True => E -- Can only assign to vars
-------------------
Expressions in an Interdoc script may denote
literal values:
Boolean: (F, T)
integer: ... -3, -2, -1, 0, 1, 2, 3, ...
real: 1.2E5, . . .
string: <this is a string>
label: #A123, #anId, #Paragraph
the empty environment: Null
the empty list: NIL
id: (the null id), bold, thisIsAnId, Helvetica, . . .
(unless bound, taken to denote a primitive)
environments
unevaluated expressions
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). Perverse explicit bindings to outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.
Braces create a nested environment; if preceded by a dot, it is initialized to the
value of the name in the binding; if preceded by VAL, it is executed for value,
and the environment is then discarded.
Semantics of labels:
A label #id on a node in the dominant structure gives that node membership in
the set named by id. Multiple labels place the node in multiple sets, and a
unique label on a node places it in a singleton set, i.e., identifies it uniquely.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
*start*
08870 00024 USt
Date: 17 Aug. 1981 11:40 am PDT (Monday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 17
To: Mitchell, Horning
[Jim,
For discussion this afternoon.
Jim H.]
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
-------------------
In question:
treatment of primitive identifiers (vs. labels)
side effects in expressions (why only in conditionals?)
operations on nested environments: font←.(size←10)
merging environments
reduced transformations
Missing:
literal sequences
operations on sequences (subscripting)
-------------------
We envision an Interdoc script being processed in any manner equivalent to the
following:
Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.
BASIC INTERDOC
GRAMMAR
node ::= "{" labels expression* "}"
labels ::= [label* ":"]
label ::= "#" name
expression ::= [ lhs ] [ "'" | op ] rhs
rhs ::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
[ "ENV" ] "(" expression* ")"
literal ::= Boolean | integer | hexint | real | string | label
name ::= id ( "." id)*
id ::= (letter | "" ) ( letter | "" | digit )* -- "" is the null id
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs ::= name bindingMode
bindingMode ::= "=" | ":" | ":=" | "←"
op ::= "." | "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
"LT" | " LE" | "EQ" | "NE" | "GE" | "GT"
SYNTACTIC EXAMPLE:
{#examplenode:
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T: ( expression, environment ) > ( expression, environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Null
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
R&T<>(E) = ""; E
R&T<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)); T<e*>(T<e1>(E))
R&T<literal>(E) = literal; E
R&T<id>(E) = if bindingOf(id, E)=None then id; E else R&T<valOf(id, E)>(E)
R&T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R&T<e2*>(T<e1>(E)) else R&T<e3*>(T<e1>(E))
R&T<"NOT" p>(E) = if R<p>(E) then False else True; E
R&T<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E ?]); E
op = "+" => R<p1>(E)+R<p2>(E); E
. . .
R&T<n m e>(E) = ""; bind(n, m, R<e>(E), E)
R&T<n m "'" e>(E) = ""; bind(n, m, e, E)
R&T<n m op e>(E) = ""; bind(n, m, R<n op e>(E), E)
R&T<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<Sub e*>([Null | "Outer" = E])))
R&T<"(" e* ")">(E) = R<e*>(E); E
R&T<"ENV(" e* ")">(E) = [T<e*>(E) | "Outer" = Null]; T<e*>(E) ?
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, e, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, e, E) -- Assign at right level
True => [E | id m e]
bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]
assign(id, e, E) = locBinding(id, E) = ":" => [E | id ":" e]
bindingOf(id, E) = ":" => bind("Outer".id, ":=", e, E)
True => E -- Can only assign to vars
-------------------
Expressions in an Interdoc script may denote
literal values:
Boolean: (F, T)
integer: ... -3, -2, -1, 0, 1, 2, 3, ...
real: 1.2E5, . . .
string: <this is a string>
label: #A123, #anId, #Paragraph
the empty environment: Null
the empty list: NIL
id: (the null id), bold, thisIsAnId, Helvetica, . . .
(unless bound, taken to denote a primitive)
environments
unevaluated expressions
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). Perverse explicit bindings to Outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.
Parentheses create a nested environment; if preceded by a dot, it is initialized to
the value of the name in the binding; if not preceded by ENV, it is executed for
value, and the environment is then discarded.
Semantics of labels:
A label #id on a node in the dominant structure gives that node membership in
the set named by id. Multiple labels place the node in multiple sets, and a
unique label on a node places it in a singleton set, i.e., identifies it uniquely.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
*start*
09077 00024 USt
Date: 17 Aug. 1981 7:05 pm PDT (Monday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 18
To: Mitchell, Horning
[Jim,
Jim H.]
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
-------------------
In question:
operations on nested environments: font←.(size←10)
merging environments
reduced transformations
structured labels
getEnv operator
labels outside dominant structure
non-printing nodes
Missing:
operations on sequences and environments (subscripting and enumeration)
substitution of for Null, Nil, etc.
-------------------
We envision an Interdoc script being processed in any manner equivalent to the
following:
Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.
BASIC INTERDOC
GRAMMAR
node ::= "{" expression* "}"
expression ::= [ lhs ] [ "'" | op ] | name ":!" -- :! for label declaration
rhs ::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
[ "ENV" ] "(" expression* ")" | label
literal ::= Boolean | integer | hexint | real | string | label
name ::= id ( "." id)*
id ::= (letter | "" ) ( letter | "" | digit )* -- "" is the null id
label ::= name "!"
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs ::= name bindingMode
bindingMode ::= "=" | ":" | ":=" | "←"
op ::= "." | "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
"LT" | " LE" | "EQ" | "NE" | "GE" | "GT"
SYNTACTIC EXAMPLE:
{node.example!
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T: ( expression, environment ) > ( expression, environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Null
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
R&T<>(E) = ""; E
R&T<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)); T<e*>(T<e1>(E))
R&T<literal>(E) = literal; E
R&T<id>(E) = if bindingOf(id, E)=None then id; E else R&T<valOf(id, E)>(E)
R&T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R&T<e2*>(E) else R&T<e3*>(E)
R&T<"NOT" p>(E) = if R<p>(E) then False else True; E
R&T<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E ?]); E
op = "+" => R<p1>(E)+R<p2>(E); E
. . .
R&T<n m e>(E) = ""; bind(n, m, R<e>(E), E)
R&T<n m "'" e>(E) = ""; bind(n, m, e, E)
R&T<n m op e>(E) = ""; bind(n, m, R<n op e>(E), E)
R&T<"{" e* "}">(E) = "{" R<Sub e*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<Sub e*>([Null | "Outer" = E])))
R&T<"(" e* ")">(E) = R&T<e*>(E)
R&T<"ENV(" e* ")">(E) = [T<e*>(E) | "Outer" = Null]; T<e*>(E) ?
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, e, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, e, E) -- Assign at right level
True => [E | id m e]
bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]
assign(id, e, E) = locBinding(id, E) = ":" => [E | id ":" e]
bindingOf(id, E) = ":" => bind("Outer".id, ":=", e, E)
True => E -- Can only assign to vars
-------------------
Expressions in an Interdoc script may denote
literal values:
Boolean: (F, T)
integer: ... -3, -2, -1, 0, 1, 2, 3, ...
real: 1.2E5, . . .
string: <this is a string>
label: A123!, anId!, Paragraph!
the empty environment: Null
the empty list: NIL
id: (the null id), bold, thisIsAnId, Helvetica, . . .
(unless bound, taken to denote a primitive)
environments
unevaluated expressions
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). Perverse explicit bindings to Outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.
Parentheses create a nested environment; if preceded by a dot, it is initialized to
the value of the name in the binding; ? if not preceded by ENV, it is
executed for value, and the environment is then discarded.
Semantics of labels:
A label #id on a node in the dominant structure gives that node membership in
the set named by id. Multiple labels place the node in multiple sets, and a
unique label on a node places it in a singleton set, i.e., identifies it uniquely.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
*start*
09690 00024 USt
Date: 19 Aug. 1981 9:58 am PDT (Wednesday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 19
To: Mitchell, Horning, Lampson
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
-------------------
In question:
subscripting: sequences, yes; nodes, labels, sets, environments, no
evaluation of subscript expressions
merging environments
labels outside dominant structure
non-printing nodes
generalize apply to non-literal operands (distribute over sequences?)
IF, NOT, apply when arguments are not literals, partial evaluation
bind op in environment?
Missing:
operations on sequences and environments (subscripting and enumeration)
substitution of for Null, Nil, etc.
-------------------
We envision an Interdoc script being processed in any manner equivalent to the
following:
Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.
BASIC INTERDOC
GRAMMAR
node ::= "{" expression* "}"
expression ::= [ lhs ] [ "'" | op ] | name ":!" -- :! for label declaration
rhs ::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
[ "ENV" ] "(" expression* ")" | label
literal ::= Boolean | integer | hexint | real | string | label
name ::= id ( "." id)*
id ::= (letter | "" ) ( letter | "" | digit )* -- "" is the null id
label ::= name "!"
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs ::= name bindingMode
bindingMode ::= "=" | ":" | ":=" | "←"
op ::= "%" | "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
"LT" | " LE" | "EQ" | "NE" | "GE" | "GT"
SYNTACTIC EXAMPLE:
{node.example!
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T: ( expression, environment ) > ( expression, environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Null
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
R&T<>(E) = ""; E
R&T<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)); T<e*>(T<e1>(E))
R&T<literal>(E) = literal; E
R&T<id>(E) = if bindingOf(id, E)=None then id; E else R&T<valOf(id, E)>(E)
R&T<p "." id>(E) = R&T<valOf(id, R<p>(E))>(E)
-- Subscript should go here
R&T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R&T<e2*>(E) else R&T<e3*>(E)
R&T<"NOT" p>(E) = if R<p>(E) then False else True; E
R&T<p1 "'" p2>(E) = p2; E
R&T<p1 "%" p2>(E) = R<p2>([R<p1>(E) | "Outer" = E]);
[T<p2>([R<p1>(E) | "Outer" = E]) | "Outer" = Null]
R&T<p1 op p2>(E) = apply(R<p1>(E), op, R<p2>(E)); E
R&T<n m e>(E) = ""; bind(n, m, R<e>(E), E)
<n m op e> = <n m n op e> -- Syntactic sugar
R&T<"{" e* "}">(E) = "{" R<Sub e*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<Sub e*>([Null | "Outer" = E])))
R&T<"(" e* ")">(E) = R&T<e*>(E)
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
apply(arg1, op, arg2) =
op = "+" => arg1+arg2
. . .
bind(id, m, e, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, e, E) -- Assign at right level
True => [E | id m e]
bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]
assign(id, e, E) =
locBinding(id, E) = ":" => [E | id ":" e]
bindingOf(id, E) = ":" =>
[E | "Outer" "=" bind(id, ":=", e, locVal("Outer", E))]
True => E -- Can only assign to vars
-------------------
Expressions in an Interdoc script may denote
literal values:
Boolean: (F, T)
integer: ... -3, -2, -1, 0, 1, 2, 3, ...
real: 1.2E5, . . .
string: <this is a string>
label: A123!, anId!, Paragraph!
the empty environment: Null
the empty list: NIL
id: (the null id), bold, thisIsAnId, Helvetica, . . .
(unless bound, taken to denote a primitive)
environments
unevaluated expressions
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). Perverse explicit bindings to Outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.
Parentheses create a nested environment; if preceded by a dot, it is initialized to
the value of the name in the binding; ? if not preceded by ENV, it is
executed for value, and the environment is then discarded.
Semantics of labels:
A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members. Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
*start*
10516 00024 USt
Date: 19 Aug. 1981 6:56 pm PDT (Wednesday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 20
To: Mitchell, Horning, Lampson
Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
Drop "%"; ENV() is now the only environment-constructing operator.
Add SUB operator (first operand: sequence only, second: number only).
Add atoms, as distinct from ids.
Fix lhs op rhs syntax.
-------------------
In question:
merging environments (OPEN)
declaration of "main" labels
labels outside dominant structure
non-printing nodes (semicolon?)
bind op names in environment?
structured primitive names, naming authorities
how to syntactically distinguish operator application
infix vs. prefix for general operators (APL?)
treatment of unbound qualified names
as id vs. binary op vs. sign on numbers
Missing:
enumeration over sequences and environments
substitution of for Null, Nil, etc., as appropriate
-------------------
We envision an Interdoc script being processed in any manner equivalent to the
following:
Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.
BASIC INTERDOC
GRAMMAR
node ::= "{" expression* "}"
expression ::= [ lhs ] rhs | name ":!" -- :! for label declaration
rhs ::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
[ "ENV" ] "(" expression* ")"
literal ::= Boolean | integer | hexint | real | string | label | atom
name ::= id ( "." id)*
id ::= (letter | "" ) ( letter | "" | digit )* -- "" is the null id
label ::= name "!"
atom ::= "$" id
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs ::= name bindingMode [ "'" | op ]
bindingMode ::= "=" | ":" | ":=" | "←"
op ::= "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
"LT" | " LE" | "EQ" | "NE" | "GE" | "GT" | "SUB"
SYNTACTIC EXAMPLE:
{node.example!
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T: ( expression, environment ) > ( expression, environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Null
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
R&T<literal>(E) = literal; E
R&T<id>(E) = if bindingOf(id, E)=None then "$" id; E else R&T<valOf(id, E)>(E)
R&T<p "." id>(E) = R&T<valOf(id, R<p>(E))>(E)
R&T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R&T<e2*>(E) else R&T<e3*>(E)
R&T<"NOT" p>(E) = NOT R<p>(E); E
R&T<p1 "'" p2>(E) = p2; E
R&T<p1 op p2>(E) = apply(R<p1>(E), op, R<p2>(E)); E
R&T<n m e>(E) = ""; bind(n, m, R<e>(E), E)
<n m op e> = <n m n op e> -- Syntactic sugar
R&T<>(E) = ""; E
R&T<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)); T<e*>(T<e1>(E))
R&T<"(" e* ")">(E) = R&T<e*>(E)
R&T<"ENV(" e* ")">(E) = [T<e*>([Null | "Outer" = E]) | "Outer" = Null]; E
R&T<"{" e* "}">(E) = "{" R<Sub e*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<Sub e*>([Null | "Outer" = E])))
R&T<name ":!">(E) =
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
apply(arg1, op, arg2) =
op = "+" => arg1+arg2
. . .
op = "SUB" => arg1[arg2]
bind(id, m, e, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, e, E) -- Assign at right level
True => [E | id m e]
bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]
assign(id, e, E) =
locBinding(id, E) = ":" => [E | id ":" e]
bindingOf(id, E) = ":" =>
[E | "Outer" "=" bind(id, ":=", e, locVal("Outer", E))]
True => E -- Can only assign to vars
-------------------
id: (the null id), bold, thisIsAnId, Helvetica, . . .
Expressions in an Interdoc script may denote
literal values:
Booleans: (F, T)
integers: ... -3, -2, -1, 0, 1, 2, 3, ...
reals: 1.2E5, . . .
strings: <this is a string>
labels: A123!, anId!, Paragraph.Example!
primitive properties and operators: $id
the empty environment: Null
the empty list: NIL
unevaluated expressions
environments
sets (sequences) of nodes with given labels
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). Perverse explicit bindings to Outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.
Parentheses are used purely for grouping (e.g., creating a sequence value for a
binding). ENV is used to create a new environment, which behaves much like
a record.
Semantics of labels:
A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members. Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
OTHER NOTES
Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its properties.
(Variant: all properties on the path back to the root.)
STANDARD CARD
WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.
GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.
STANDARDIZE CONCEPTS, NOT NAMES.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
*start*
03100 00024 USt
Date: 20 Aug. 1981 12:29 pm PDT (Thursday)
From: Horning.pa
Subject: Minutes from the Midnight Skull Session
To: Mitchell, Lampson
cc: Horning
We started from the following list.
In question:
merging environments (OPEN)
declaration of "main" labels
labels outside dominant structure
non-printing nodes (semicolon?)
bind op names in environment?
structured primitive names, naming authorities
how to syntactically distinguish operator application
infix vs. prefix for general operators (APL?)
treatment of unbound qualified names
as id vs. binary op vs. sign on numbers
Missing:
enumeration over sequences and environments
substitution of for Null, Nil, etc., as appropriate
-------------------
Here is my transcription/recollection of their resolution:
merging environments (OPEN)
REJECT
declaration of "main" labels
ACCEPT (must define semantics precisely)
Note that we decided to distinguish between properties (marks) and labels.
For properties, the expression preceding # must evaluate to an external
($, atomic) name.
The label preceding ! must be a literal name, which is not evaluated.
(Analogous to lhs name in a binding.)
labels outside dominant structure
Do NOT cause their nodes to be included in labelled set (for templates).
non-printing nodes (semicolon?)
ACCEPT. More properly HIDDEN nodes; syntax still debatable.
bind op names in environment? structured primitive names, naming authorities
EXTERNAL names will be structured, to allow for NA's.
Binding will allow the use of freely-chosen local synonyms.
These decisions apply equally to properties and operators.
how to syntactically distinguish operator application
Some small variant of Cambridge Polish.
BUT, lookup operator in environment before application.
infix vs. prefix for general operators (APL?)
All prefix (function application syntax).
treatment of unbound qualified names
Should result in Null or ERROR, not an external name (fix equations).
as id vs. binary op vs. sign on numbers
REJECT as id.
No more infix ops.
No ambiguity between sign and op in functional notation
enumeration over sequences and environments
REJECT
substitution of for Null, Nil, etc., as appropriate
REJECT
OTHER NOTES
A "main" label can only be an id, not a qualified name.
An atom (external name) can be qualified.
Operator ids come out of the grammar; we need to ensure that we define the
semantic basis for SEQ/LIST, IF, NOT, ENV, PROG, QUOTE.
The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.
It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes. (E.g., text is not always there to be
"shown".)
The "safety" rules for editing partially understood scripts should be restated
entirely in terms of local properties (which may have been implicitly acquired
through Sub or other invocations).
We should check our characterset for disjointness with Interpress.DoubtfulChars.
*start*
11578 00024 USt
Date: 20 Aug. 1981 5:40 pm PDT (Thursday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 22
To: Mitchell, Horning, Lampson
[Jim,
I think I've fixed the syntax problems we discussed. However, we should discuss
how palatable this syntax is--I made a number of relatively arbitrary decisions
for the sake of getting something definite as soon as possible.
Jim H.]
Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
distinguish syntactically between properties (marks) and labels.
only the "main" id of a label is declarable.
eliminate as an id character.
eliminate op ids from grammar.
restructure the grammar for "functional" notation for operators.
update semantic equations for new grammar, etc.
fix treatment of unbound qualified names (now produce Nil).
-------------------
Not done:
State the formal semantics of labels and properties.
-------------------
We envision an Interdoc script being input and viewed in any manner
equivalent to the following:
Parse the script, alternately
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items, and
- transforming the environment as indicated by the binding items.
BASIC INTERDOC
GRAMMAR
item ::= primary | binding | id ":!" -- :! for label declaration
primary ::= id | primary "." id | literal | application | property | node
id ::= letter ( letter | digit )*
literal ::= Boolean | integer | hexint | real | string | label | external
label ::= name "!"
name ::= id ( "." id)*
external ::= "$" name
application ::= ( op | primary ) "(" item* ( "," item* )* ")"
op ::= "'" | "+" | "" | "*" | "/"
property ::= primary "#"
node ::= "{" item* "}"
binding ::= name [ op ] bindingMode ( primary | "(" item* ")" )
bindingMode ::= "=" | ":" | ":=" | "←"
SYNTACTIC EXAMPLE:
{Book.example! -- Places this in Book and Book.example
ExampleParagraph -- Invokes a definition
$UniqueMark12356# -- Adds a nonstandard property
a:='(NOT(EQ(margins.left, 120))) margins.right←100 r=*(12.5, pt)
IF(a, margins.left+←5 margins.right←5, margins.left+←10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T: ( expression, environment ) > ( expression, environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Nil = ""
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
R&T<id>(E) = R&T<valOf(id, E)>(E)
R&T<p "." id>(E) = R&T<locVal(id, R<p>(E))>(E)
R&T<literal>(E) = literal; E
R&T<op "(" arg* ")">(E) = operate(op, arg*, E)
R&T<p "(" arg* ")">(E) = operate(R<p>(E), arg*, E)
R&T<p "#">(E) = R<p>(E) "#"; E
R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))
R&T<n m p>(E) = ""; bind(n, m, R<p>(E), E)
<n op m p> = <n m op "(" n "," p ")" > -- Syntactic sugar
<n op m "(" arg* ")"> = <n m op "(" n "," arg* ")" >
R&T<"(" item* ")">(E) = R&T<item*>(E)
R&T<>(E) = ""; E
R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
T<item*>(T<item1>(E))
R&T<id ":!">(E) =
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
operate(op, arg*, E) =
op = $QUOTE => arg*; E
op = $HIDE => ""; E
op = $ENV => [T<arg*>([Null | "Outer" = E]) | "Outer" = Null]; E
True => apply(op, eval(arg*, E))
apply(op, val1, ... , valn) =
op = $IF => if val1.R then val2 else val3
op = "+" OR op = $PLUS => val1.R + ... + valn.R; E
...
op = $LIST => val1
op = $SUBSCRIPT => val1[val2.R] -- val1: sequence, val2.R: int
eval("", E) = Nil
eval(arg1 arg*, E) = R&T<arg1>(E), eval(arg*, E)
bind(id, m, val, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, val, E) -- Assign at right
level
True => [E | id m val]
bind(id.n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]
assign(id, val, E) =
locBinding(id, E) = ":" => [E | id ":" val]
bindingOf(id, E) = ":" =>
[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
True => E -- Can only assign to vars
-------------------
Expressions in an Interdoc script may denote
literal values:
Booleans: (F, T)
integers: ... -3, -2, -1, 0, 1, 2, 3, ...
reals: 1.2E5, . . .
strings: <this is a string>
labels: A123!, anId!, Paragraph.Example!
external names: $name
the empty environment: Null
the empty list: NIL
sequences of values
unevaluated expressions
environments
sets (sequences) of nodes with given labels
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). The (implicit) "outermost" environment binds
each id to the external name $id.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.
Parentheses are used for grouping (e.g., creating a sequence value for a
binding), and to delimit the argument list of an operator. The operator $ENV is
used to create a new environment, which behaves much like a record.
Semantics of labels:
A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members. Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
OTHER NOTES
It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes. (E.g., text is not always there to be
"shown".)
Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and don't
remove any of them OR if you understand ALL properties of ALL nodes in the
path back to the root.
The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.
We should check our characterset for disjointness with Interpress.DoubtfulChars.
STANDARD CARD
WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.
GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.
STANDARDIZE CONCEPTS, NOT NAMES.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
Drop "%"; ENV() is now the only environment-constructing operator.
Add SUB operator (first operand: sequence only, second: number only).
Add atoms, as distinct from ids.
Fix lhs op rhs syntax.
*start*
11654 00024 USt
Date: 21 Aug. 1981 6:58 pm PDT (Friday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 23
To: Mitchell, Horning, Lampson
Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
restore $val.
move quoting to rhs, allow quoted primaries without parentheses.
allow an op to be the rhs of a definition.
eliminate the functions operate, apply, eval by back substitution.
change semantics of () to allow "record" construction without $env.
-------------------
Not done:
State the formal semantics of labels and properties.
Sets of properties, etc. (Cf. Mitchell's Font example.)
Sort out "records" vs. quoted bindings.
-------------------
We envision an Interdoc script being input and viewed in any manner
equivalent to the following:
Parse the script, alternately
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items, and
- transforming the environment as indicated by the binding items.
BASIC INTERDOC
GRAMMAR
item ::= primary | binding
primary ::= id | primary "." id | literal | application | property | node
id ::= letter ( letter | digit )*
literal ::= Boolean | integer | hexint | real | string | label | external
label ::= name "!"
name ::= id ( "." id)*
external ::= "$" name | op
op ::= "+" | "" | "*" | "/"
application ::= primary "(" item* ( "," item* )* ")"
property ::= primary "#"
node ::= "{" item* "}"
binding ::= name [ op ] bindingMode rhs | id ":!"
bindingMode ::= "=" | ":" | ":=" | "←"
rhs ::= [ "'" ] ( primary | "(" item* ")" )
SYNTACTIC EXAMPLE:
{Book.example! -- Places this in Book and Book.example
ExampleParagraph -- Invokes a definition
$UniqueMark12356# -- Adds a nonstandard property
a:='NOT(EQ(margins.left, 120)) margins.right←100 r=*(12.5, pt)
IF(a, margins.left+←5 margins.right←5, margins.left+←10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T: ( expression, environment ) > ( expression, environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Nil = ""
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
R&T<id>(E) = R&T<valOf(id, E)>(E)
R&T<p "." id>(E) = R&T<locVal(id, R<p>(E))>(E)
R&T<literal>(E) = literal; E
R&T<p "(" arg1 "," ... "," argn ")">(E) =
CASE R<p>(E) OF
$if => if R<arg1>(E) then R&T<arg2>(E) else R&T<arg3>(E)
"+" => R<arg1>(E) + ... + R<argn>(E); E
...
$val => R<arg1>(E); E
$list => R&T<arg1>(E)
$sub => R&T<arg1>(E)[R<arg2>(E)]
-- Subscript, arg1: sequence | node, arg2.R: int
$hide => ""; E
R&T<p "#">(E) = R<p>(E) "#"; E
R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))
R&T<>(E) = ""; E
R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
T<item*>(T<item1>(E))
R&T<n m rhs>(E) = ""; bind(n, m, R<rhs>(E), E)
<n op m p> = <n m op "(" n "," p ")" > -- Syntactic sugar
<n op m "(" arg* ")"> = <n m op "(" n "," arg* ")" >
R&T<"'" p>(E) = p; E
R&T<"(" item* ")">(E) = [T<item*>([Null | "Outer" = E]) | "Outer" = Null]; E
-- Construct a "record" environment value
R&T<"'(" item* ")">(E) = item*; E
R&T<id ":!">(E) = ??; E
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, val, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, val, E) -- Assign at right
level
True => [E | id m val]
bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]
assign(id, val, E) =
locBinding(id, E) = ":" => [E | id ":" val]
bindingOf(id, E) = ":" =>
[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
True => E -- Can only assign to vars
-------------------
Expressions in an Interdoc script may denote
literal values:
Booleans: (F, T)
integers: ... -3, -2, -1, 0, 1, 2, 3, ...
reals: 1.2E5, . . .
strings: <this is a string>
labels: A123!, anId!, Paragraph.Example!
external names: $name
the empty environment: Null
the empty list: NIL
sequences of values
unevaluated expressions
environments
sets (sequences) of nodes with given labels
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). The (implicit) "outermost" environment binds
each id to the external name $id.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.
Parentheses are used for grouping (e.g., creating a sequence value for a
binding), and to delimit the argument list of an operator. The operator $ENV is
used to create a new environment, which behaves much like a record.
Semantics of labels:
A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members. Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
OTHER NOTES
It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes. (E.g., text is not always there to be
"shown".)
Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and don't
remove any of them OR if you understand ALL properties of ALL nodes in the
path back to the root.
The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.
We should check our characterset for disjointness with Interpress.DoubtfulChars.
STANDARD CARD
WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.
GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.
STANDARDIZE CONCEPTS, NOT NAMES.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
Drop "%"; ENV() is now the only environment-constructing operator.
Add SUB operator (first operand: sequence only, second: number only).
Add atoms, as distinct from ids.
Fix lhs op rhs syntax.
Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
distinguish syntactically between properties (marks) and labels.
only the "main" id of a label is declarable.
eliminate as an id character.
eliminate op ids from grammar.
restructure the grammar for "functional" notation for operators.
update semantic equations for new grammar, etc.
fix treatment of unbound qualified names (now produce Nil).
*start*
12386 00024 USt
Date: 24 Aug. 1981 6:42 pm PDT (Monday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 24
To: Mitchell, Horning
Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
"It's OK to edit a node if you understand ALL of its (local) properties, and
either don't remove any of them or also understand ALL properties
of its parent."
"Put in contents if: Put in environment if: ..."
Add connection syntax to syntactically rule out a+←'b.
-------------------
Not done:
Determine the (informal) semantics of labels.
State the formal semantics of labels and properties.
Sets of properties, etc. (Cf. Mitchell's Font example.)
SET/LIST operators ($append $union ?)
notation for list constants
Sort out "records" vs. quoted bindings.
Some syntactic marker to replace $env.
Consider style for use of temporary local definitions.
-------------------
We envision an Interdoc script being input and viewed in any manner
equivalent to the following:
Parse the script, alternately
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items, and
- transforming the environment as indicated by the binding items.
BASIC INTERDOC
GRAMMAR
item ::= primary | binding
primary ::= id | primary "." id | literal | application | property | node
id ::= letter ( letter | digit )*
literal ::= Boolean | integer | hexint | real | string | label | external
label ::= name "!"
name ::= id ( "." id)*
external ::= "$" name | op
op ::= "+" | "" | "*" | "/"
application ::= primary "(" item* ( "," item* )* ")"
property ::= primary "#"
node ::= "{" item* "}"
binding ::= name connection rhs | id ":!"
connection ::= bindingMode | op bindingMode | bindingMode "'"
bindingMode ::= "=" | ":" | ":=" | "←"
rhs ::= primary | "(" item* ")"
SYNTACTIC EXAMPLE:
{Book.example! -- Places this in Book and Book.example
ExampleParagraph -- Invokes a definition
$UniqueMark12356# -- Adds a property
a:='NOT(EQ(margins.left, 120)) margins.right←100 r=*(12.5, pt)
IF(a, margins.left+←5 margins.right←5, margins.left+←10)
<text for this node>
}
SEMANTICS
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T: ( expression, environment ) > ( expression, environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Nil = ""
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
R&T<id>(E) = R&T<valOf(id, E)>(E)
R&T<p "." id>(E) = R&T<locVal(id, R<p>(E))>(E)
R&T<literal>(E) = literal; E
R&T<p "(" arg1 "," ... "," argn ")">(E) =
CASE R<p>(E) OF
$if => if R<arg1>(E) then R&T<arg2>(E) else R&T<arg3>(E)
"+" => R<arg1>(E) + ... + R<argn>(E); E
...
$val => R<arg1>(E); E
$list => R&T<arg1>(E)
$sub => R&T<arg1>(E)[R<arg2>(E)]
-- Subscript, arg1: sequence | node, arg2.R: int
$hide => ""; E
R&T<p "#">(E) = R<p>(E) "#"; E
R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))
R&T<>(E) = ""; E
R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
T<item*>(T<item1>(E))
R&T<n m rhs>(E) = ""; bind(n, m, R<rhs>(E), E)
<n op m p> = <n m op "(" n "," p ")" > -- Syntactic sugar
<n op m "(" arg* ")"> = <n m op "(" n "," arg* ")" >
R&T<"'" p>(E) = p; E
R&T<"(" item* ")">(E) = [T<item*>([Null | "Outer" = E]) | "Outer" = Null]; E
-- Construct a "record" environment value
R&T<"'(" item* ")">(E) = item*; E
R&T<id ":!">(E) = ??; E
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, val, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, val, E) -- Assign at right
level
True => [E | id m val]
bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]
assign(id, val, E) =
locBinding(id, E) = ":" => [E | id ":" val]
bindingOf(id, E) = ":" =>
[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
True => E -- Can only assign to vars
-------------------
Expressions in an Interdoc script may denote
literal values:
Booleans: (F, T)
integers: ... -3, -2, -1, 0, 1, 2, 3, ...
reals: 1.2E5, . . .
strings: <this is a string>
labels: A123!, anId!, Paragraph.Example!
external names: $name
the empty environment: Null
the empty list: NIL
sequences of values
unevaluated expressions
environments
sets (sequences) of nodes with given labels
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). The (implicit) "outermost" environment binds
each id to the external name $id.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.
Parentheses are used for grouping (e.g., creating a sequence value for a
binding), and to delimit the argument list of an operator. The operator $ENV is
used to create a new environment, which behaves much like a record.
Semantics of labels:
A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members. Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
OTHER NOTES
It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes. (E.g., text is not always there to be
"shown".)
Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.
We should check our characterset for disjointness with Interpress.DoubtfulChars.
Put in contents if: Put in environment if:
effect is local to node has scope
is directly edited is only indirectly edited
is to be bound locally needs delayed or global binding
STANDARD CARD
WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.
GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.
STANDARDIZE CONCEPTS, NOT NAMES.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
Drop "%"; ENV() is now the only environment-constructing operator.
Add SUB operator (first operand: sequence only, second: number only).
Add atoms, as distinct from ids.
Fix lhs op rhs syntax.
Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
distinguish syntactically between properties (marks) and labels.
only the "main" id of a label is declarable.
eliminate as an id character.
eliminate op ids from grammar.
restructure the grammar for "functional" notation for operators.
update semantic equations for new grammar, etc.
fix treatment of unbound qualified names (now produce Nil).
Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
restore $val.
move quoting to rhs, allow quoted primaries without parentheses.
allow an op to be the rhs of a definition.
eliminate the functions operate, apply, eval by back substitution.
change semantics of () to allow "record" construction without $env.
*start*
01734 00024 USt
Date: 25 Aug. 1981 9:28 am PDT (Tuesday)
From: Horning.pa
Subject: Interdoc Label Thoughts
To: Mitchell
cc: Lampson, Horning
Overnight, I've come to the conclusion that the reason we were having trouble
yesterday with the semantics of labels is that we were trying to attach too much
semantics to them--much as if we had gone beyond numbers to specify the use
of numbers in spline curves.
I propose that we go back to something much closer to Brian Reid's "link and
mark" semantics. I.e., the "meaning" of a reference, mark pair is simply: "record
the existence of a directed arc from here to there," without saying what an editor
would use such arcs for. They are simply the escape mechanism from a strict
tree structure.
I believe that we should keep the present "sequence" semantics for multiple
nodes marked with the same label (i.e., there is a directed arc from every
reference to a label to each node marked with that label.)
I also accept your stricture that no environment information should flow along
these arcs, so that we can simply ignore them when determining the
Reduced&Transformed values of scripts and environments.
We should continue to declare the scope of main labels, for all the previously
discussed reasons, but we should not try to use the environment to record label
values (as (un)evaluated nodes or whatever). We should thus syntactically
distinguish a label reference from a name invocation.
Except for the syntactic ambiguity, it would be tempting to adopt some
"symmetric" notation for references and marks, e.g., name> and >name .
However, I don't doubt our ability to find a satisfactory syntax once we have
agreed on the semantics.
Comments, improvements?
Jim H.
*start*
13389 00024 USt
Date: 25 Aug. 1981 11:34 am PDT (Tuesday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 25
To: Mitchell, Horning
Edited by Jim H. on 25 Aug. 1981 11:33 am PDT (Tuesday).
Syntactically separate label references and name invocation.
Put in distinct syntax in rhs for environment construction.
Informal semantics of labels.
( ... ) > [ ... ] in applications; permitting ( ... ) as a primary.
-------------------
Open questions:
Use of ( ... ) vs. [ ... ] (especially in application).
Sort out "records" vs. quoted bindings.
Sets of properties, etc. (Cf. Mitchell's Font example.)
SET/LIST operators ($append $union ?)
semantics of $list
Non-uniform semantics of quote.
Subtle distinctions between quoted and unquoted ( ... )
Semantics of quoted [ ... ] ?
We should check our characterset for disjointness with
Interpress.DoubtfulChars.
Not done:
State the formal semantics of labels and properties.
Consider style for use of temporary local definitions.
-------------------
We envision an Interdoc script being input and viewed in any manner
equivalent to the following:
Parse the script, repeatedly
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items,
- transforming the environment as indicated by the binding items, and
- recording the links indicated by label references and marks.
BASIC INTERDOC
SYNTACTIC EXAMPLE:
{Book.example! -- Links to this from Book@ and Book.example@
ExampleParagraph -- Invokes a definition
$UniqueMark12356# -- Adds a property
a:='NOT[EQ[margins.left, 120]] margins.right←100 r=*[12.5, pt]
IF[a, margins.left+←5 margins.right←5, margins.left+←10]
<text for this node>
}
GRAMMAR
item ::= primary | binding | label
primary ::= literal | invocation | application | property | node | "(" item* ")"
literal ::= Boolean | integer | hexint | real | string | external | op
external ::= "$" name
name ::= id ( "." id)*
id ::= letter ( letter | digit )*
op ::= "+" | "" | "*" | "/"
invocation ::= id | primary "." id
application ::= primary "[" item* ( "," item* )* "]"
property ::= primary "#"
node ::= "{" item* "}"
binding ::= name connection rhs
connection ::= bindingMode | op bindingMode | bindingMode "'"
bindingMode ::= "=" | ":" | ":=" | "←"
rhs ::= primary | "[" item* "]"
label ::= id ":!" | name "!" | name "@"
SEMANTICS
R&T<id>(E) = R&T<valOf(id, E)>(E)
R&T<p "." id>(E) = R&T<locVal(id, R<p>(E))>(E)
R&T<literal>(E) = literal; E
R&T<p "(" arg1 "," ... "," argn ")">(E) =
CASE R<p>(E) OF
$if => if R<arg1>(E) then R&T<arg2>(E) else R&T<arg3>(E)
"+" => R<arg1>(E) + ... + R<argn>(E); E
...
$val => R<arg1>(E); E
$list => R&T<arg1>(E)
$subscript => R&T<arg1>(E)[R<arg2>(E)]
-- arg1: sequence | node, arg2.R: int
$hide => "" ; E
R&T<p "#">(E) = R<p>(E) "#"; E
R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))
R&T<"(" item* ")">(E) = "(" R<item*>(E) ")" ; E -- List constructor
R&T<"[" item* "]">(E) = [T<item*>([Null | "Outer" = E]) | "Outer" = Null]; E
-- Construct a "record" environment value
R&T<>(E) = ""; E
R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
T<item*>(T<item1>(E))
R&T<n m rhs>(E) = "" ; bind(n, m, R<rhs>(E), E)
<n op m rhs> = <n m op "(" n "," rhs ")"> -- Syntactic sugar
<n op m "(" arg* ")"> = <n m op "(" n "," arg* ")">
R&T<"'" p>(E) = p; E
R&T<"'(" item* ")">(E) = item*; E ??
R&T<"'[" item* "]">(E) = ??
R&T<label>(E) = label; E
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, val, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, val, E) -- Assign at right
level
True => [E | id m val]
bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]
assign(id, val, E) =
locBinding(id, E) = ":" => [E | id ":" val]
bindingOf(id, E) = ":" =>
[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
True => E -- Can only assign to vars
R denotes the expression reduction function:
R: expression > ( environment > expression )
T denotes the environment transformation function:
T: expression > ( environment > environment )
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T: ( expression, environment ) > ( expression, environment )
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Nil = ""
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
-------------------
Expressions in an Interdoc script may denote
literal values:
Booleans: (F, T)
integers: ... -3, -2, -1, 0, 1, 2, 3, ...
reals: 1.2E5, . . .
strings: <this is a string>
labels: A123!, anId!, Paragraph.Example!
external names: $name
the empty environment: Null
the empty list: NIL
sequences of values
unevaluated expressions
environments
sets (sequences) of nodes with given labels
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). The (implicit) "outermost" environment binds
each id to the external name $id.
The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.
Parentheses are used for grouping (e.g., creating a sequence value for a
binding), and to delimit the argument list of an operator. The operator $ENV is
used to create a new environment, which behaves much like a record.
Semantics of labels:
A mark name! on a node makes that node a "target" of the label name (and its
prefixes); a reference name@ makes it a "source." The "main" identifier of a
label must be declared (using name:!) at the root of a subtree containing all its
sources and targets. The label represents a set of directed arcs, one from each of
its sources to each of its destinations. Multiple marks make the node a target of
in multiple labels. A unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
OTHER NOTES
It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes. (E.g., text is not always there to be
"shown".)
Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
-It's OK to copy a node if that doesn't move any labels outside their scope, and
you understand ALL properties of its new parent.
-it's OK to delete a (subtree rooted at a) node if you understand ALL properties
of its parent.
The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.
Put in contents if: Put in environment if:
effect is local to node has scope
is directly edited is only indirectly edited
is to be bound locally needs delayed or global binding
STANDARD CARD
WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.
GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.
STANDARDIZE CONCEPTS, NOT NAMES.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
Drop "%"; ENV() is now the only environment-constructing operator.
Add SUB operator (first operand: sequence only, second: number only).
Add atoms, as distinct from ids.
Fix lhs op rhs syntax.
Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
distinguish syntactically between properties (marks) and labels.
only the "main" id of a label is declarable.
eliminate as an id character.
eliminate op ids from grammar.
restructure the grammar for "functional" notation for operators.
update semantic equations for new grammar, etc.
fix treatment of unbound qualified names (now produce Nil).
Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
restore $val.
move quoting to rhs, allow quoted primaries without parentheses.
allow an op to be the rhs of a definition.
eliminate the functions operate, apply, eval by back substitution.
change semantics of () to allow "record" construction without $env.
Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
"It's OK to edit a node if you understand ALL of its (local) properties, and
either don't remove any of them or also understand ALL properties
of its parent."
"Put in contents if: Put in environment if: ..."
Add connection syntax to syntactically rule out a+←'b.
*start*
15765 00024 USt
Date: 27 Aug. 1981 7:51 pm PDT (Thursday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 29
To: Mitchell, Horning
Edited by Jim H. on 26 Aug. 1981 7:11 pm PDT (Wednesday).
' ... ' in rhs
Restore infix operators, right to left.
Modify syntax to rule out more nonsense, add semantically meaningful
nonterminals.
Introduce special syntax for selections.
Eliminate side-effects for $subscript (actually, all applications).
Add application of defined functions.
Note that Value[ ... ] allows use of temporary (hidden) local definitions,
Nil[ ... ] allows placement of hidden nodes.
( ... ) creates list/sequence values (without hiding bindings).
Tidy up definition of assign, using bind("Outer." ...).
Introduce value nonterminal into grammar (rule out more nonsense).
rhs ::= ... | "[" [ lookup ] "|" binding* "]" .
Remove $ name from literal (to lookup).
Change nonterminal lookup to invocation.
-------------------
Open questions:
We should rethink our character assignments.
check our characterset for disjointness with
Interpress.DoubtfulChars.
enlarge op with a few more single-character operators? %, &, \
Possible node operators (purely in semantic domain, not operators?).
$properties: node, environment > sequence -- All #'s
$marks: node, environment > sequence -- All !'s
$references: node, environment > sequence -- All @'s
$contents: node, environment > sequence -- The rest (fringe)
Consider restricting $subscript just to sequences, not nodes.
Extend selection to CASE?
Not done:
-------------------
We envision an Interdoc script being input and viewed in any manner
equivalent to the following:
Parse the script, repeatedly
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items,
- transforming the environment as indicated by the binding items (recording the
components of each node's environment in a form convenient to the editor), and
- recording the links indicated by label references and marks.
BASIC INTERDOC
SYNTACTIC EXAMPLE:
{Book.example! -- Links to this from Book@ and Book.example@
ExampleParagraph -- Invokes a definition
$UniqueMark12356# -- Adds a property
Font←[Font | size←10*pt face←bold]
a:='NOT[EQ[margins.left 120]]' margins.right←100 r=12.5*pt
(a | margins.left←+5 margins.right←5 | margins.left+←10) -- conditional: Algol68
<text for this node>
}
GRAMMAR
item ::= value | binding | property | label
value ::= term | node | sequence
term ::= primary | primary op term -- Ops apply right to left
primary ::= literal | invocation | application | selection
literal ::= Boolean | integer | hexint | real | string | op
op ::= "+" | "" | "*" | "/"
invocation ::= name | external
name ::= id ( "." id )*
id ::= letter ( letter | digit )*
external ::= "$" name
application ::= invocation "[" value* "]"
selection ::= "(" term "|" item* "|" item* ")" -- Algol 68 style conditional
node ::= "{" item* "}"
sequence ::= "(" item* ")"
binding ::= name bindingMode rhs
bindingMode ::= "=" | ":" | ":=" | "←"
rhs ::= value | op term | "'" item* "'" | "[" [ invocation ] "|" binding* "]"
property ::= invocation "#"
label ::= id ":!" | name "!" | name "@"
SEMANTICS
R: expression > environment > expression -- Reduction
T: expression > environment > environment -- Transformation
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
[Unless explicitly given below, T<construct>(E) = E.]
R<primary op term>(E) = R<primary>(E) op R<term>(E)
R<literal>(E) = literal
R&T<id>(E) = R&T<valOf(id, E)>(E)
R&T<name "." id>(E) = R&T<valOf(id, R<name>(E))>(E)
R<"$" name>(E) = "$" name
R<invocation "[" value1 ... valuek "]">(E) =
CASE R<invocation>(E) OF
"$equal" => R<value1>(E) = R<value2>(E)
"$greater" => R<value1>(E) > R<value2>(E)
. . .
"$subscript" => R<value1>(E)[R<value2>(E)]
-- value1: sequence | node, value2: int
ELSE => R<invocation>([E | "Value" "=" R<value1 ... valuek>(E)])
R&T<"(" term "|" item1* "|" item2* ")">(E) =
if R<term>(E) then R&T<item1*>(E) else R&T<item2*>(E)
R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" "=" E]) "}";
locVal("Outer", (T<"Sub" item*>([Null | "Outer" "=" E])))
R&T<"(" item* ")">(E) = "(" R<item*>(E) ")" ; T<item*>(E)
R<>(E) = Nil
R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
T<item*>(T<item1>(E))
R&T<n m rhs>(E) = Nil; bind(n, m, R<rhs>(E), E)
<n m op term> = <n m n op term> -- Syntactic sugar
R<"'" item* "'">(E) = item* --Usable only in rhs of binding
R<"[" invocation "|" binding* "]">(E) =
[T<binding*>([R<invocation>(E) | "Outer" "=" E]) | "Outer" "=" Null]
R<"[|" binding* "]">(E) = [T<binding*>([Null | "Outer" "=" E]) | "Outer" "=" Null]
R<invocation "#">(E) = R<invocation>(E) "#"
R<label>(E) = label
-- Subsidiary definitions for R&T
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, val, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, val, E) -- Assign at right level
True => [E | id m val]
bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, valOf(id, E))]
assign(id, val, E) =
locBinding(id, E) = ":" => [E | id ":" val]
bindingOf(id, E) = ":" => bind("Outer." id, ":=", val, E)
True => E -- Can only assign to vars
NOTATION FOR ENVIRONMENTS
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Nil = ""
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
SEMANTICS OF PROPERTIES, MARKS, REFERENCES (uses)
P: expression > environment > expression -- Sequence of properties
M: expression > environment > expression -- Sequence of marks
U: expression > environment > expression -- Sequence of references
P&M&U<e>(E) denotes the triple P<e>(E); M<e>(E); U<e>(E)
[These functions all return the empty list, Nil, except as specified below.]
P<invocation "#">(E) = R<invocation>(E)
M<name "!">(E) = prefixes(name)
U<name "@">(E) = prefixes(name)
P&M&U<invocation>(E) = P&M&U<R<invocation>(E)>(E)
P&M&U<"(" item* ")">(E) = P&M&U<item*>(E)
P&M&U<item1 item*>(E) = P<item1>(E) P<item*>(T<item1>(E));
M<item1>(E) M<item*>(T<item1>(E));
U<item1>(E) U<item*>(T<item1>(E))
prefixes(id) = id
prefixes(name "." id) = name "." id prefixes(name)
VALUE SPACE
Expressions in an Interdoc script may denote
literal values:
Booleans: (F, T)
integers: ... -3, -2, -1, 0, 1, 2, 3, ...
reals: 1.2E5, . . .
strings: <this is a string>
labels: A123!, anId!, Paragraph.Example!
external names: $name
the empty environment: Null
the empty list: NIL
sequences of values
unevaluated expressions
environments
DISCUSSION
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). The (implicit) "outermost" environment binds
each id to the external name $id.
Nodes are delimited by brackets. The contents of each node are implicitly
prefixed by Sub, which will generally be bound in the containing environment
to a quoted expression performing an environment transformation, and perhaps
supplying some properties and marks.
Parentheses are used to denote a sequence value. to delimit the argument list of
an operator. Square brackets are used to denote a new environment value, which
behaves much like a record.
Semantics of labels:
A mark name! on a node makes that node a "target" of the label name (and its
prefixes); a reference name@ makes it a "source." The "main" identifier of a
label must be declared (using name:!) at the root of a subtree containing all its
sources and targets. The label represents a set of directed arcs, one from each of
its sources to each of its destinations. Multiple marks make the node a target of
in multiple labels. A unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
OTHER NOTES
It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes. (E.g., text is not always there to be
"shown".)
Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
-It's OK to copy a node if that doesn't move any labels outside their scope, and
you understand ALL properties of its new parent.
-it's OK to delete a (subtree rooted at a) node if you understand ALL properties
of its parent.
The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.
Put in contents if: Put in environment if:
effect is local to node has scope
is directly edited is only indirectly edited
is to be bound locally needs delayed or global binding
STANDARD CARD
WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.
GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.
STANDARDIZE CONCEPTS, NOT NAMES.
CONSCIOUSLY POSTPONED
Lambda expressions.
Sets of properties, etc. (Cf. Mitchell's Font example.)
SET/LIST operators ($append $union ?)
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
Drop "%"; ENV() is now the only environment-constructing operator.
Add SUB operator (first operand: sequence only, second: number only).
Add atoms, as distinct from ids.
Fix lhs op rhs syntax.
Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
distinguish syntactically between properties (marks) and labels.
only the "main" id of a label is declarable.
eliminate as an id character.
eliminate op ids from grammar.
restructure the grammar for "functional" notation for operators.
update semantic equations for new grammar, etc.
fix treatment of unbound qualified names (now produce Nil).
Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
restore $val.
move quoting to rhs, allow quoted primaries without parentheses.
allow an op to be the rhs of a definition.
eliminate the functions operate, apply, eval by back substitution.
change semantics of () to allow "record" construction without $env.
Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
"It's OK to edit a node if you understand ALL of its (local) properties, and
either don't remove any of them or also understand ALL properties
of its parent."
"Put in contents if: Put in environment if: ..."
Add connection syntax to syntactically rule out a+←'b.
Edited by Jim H. on 25 Aug. 1981 11:33 am PDT (Tuesday).
Syntactically separate label references and name invocation.
Put in distinct syntax in rhs for environment construction.
Informal semantics of labels.
( ... ) > [ ... ] in applications; permitting ( ... ) as a primary.
Edited by Jim H. on 25 Aug. 1981 4:08 pm PDT (Tuesday).
Add sequence as a nonterminal to the syntax.
State the formal semantics of labels and properties.
Reorder presentation (hopefully to improve readability).
*start*
15962 00024 USt
Date: 28 Aug. 1981 2:09 pm PDT (Friday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 30
To: Mitchell, Horning, Guttag
Edited by Jim H. on 28 Aug. 1981 2:08 pm PDT (Friday).
[Changes since 25 August]
' ... ' in rhs
Restore infix operators, right to left.
Modify syntax to rule out more nonsense, add semantically meaningful
nonterminals.
Introduce special syntax for selections.
Eliminate side-effects for $subscript (actually, all applications).
Add application of defined functions.
Note that Value[ ... ] allows use of temporary (hidden) local definitions,
Nil[ ... ] allows placement of hidden nodes.
( ... ) creates list/sequence values (without hiding bindings).
Tidy up definition of assign, using bind("Outer." ...).
Introduce value nonterminal into grammar (rule out more nonsense).
rhs ::= ... | "[" [ lookup ] "|" binding* "]" .
Change nonterminal lookup to invocation.
Remove $ name from literal (to invocation).
Add node operators:
$properties -- All #'s
$marks -- All !'s
$references -- All @'s
$contents -- The rest (fringe)
Restrict $subscript just to sequences, not nodes.
-------------------
Open questions:
We should rethink our character assignments.
check our characterset for disjointness with
Interpress.DoubtfulChars.
enlarge op with a few more single-character operators? %, &, \
Possible node operators (purely in semantic domain, not operators?).
Extend selection to CASE?
Not done:
-------------------
We envision an Interdoc script being input and viewed in any manner
equivalent to the following:
Parse the script, repeatedly
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items,
- transforming the environment as indicated by the binding items (recording the
components of each node's environment in a form convenient to the editor), and
- recording the links indicated by label references and marks.
BASIC INTERDOC
SYNTACTIC EXAMPLE:
{Book.example! -- Links to this from Book@ and Book.example@
ExampleParagraph -- Invokes a definition
$UniqueMark12356# -- Adds a property
Font←[Font | size←10*pt face←bold]
factorial←'(LT[Value 2] | 1 | Value* factorial(Value-1))'
a:='NOT[EQ[margins.left factorial[5]]]' margins.right←100 r=12.5*pt
(a | margins.left←+5 margins.right←5 | margins.left+←10) -- conditional: Algol68
<text for this node>
}
GRAMMAR
item ::= value | binding | property | label
value ::= term | node | sequence
term ::= primary | primary op term -- Ops apply right to left
primary ::= literal | invocation | application | selection
literal ::= Boolean | integer | hexint | real | string | op
op ::= "+" | "" | "*" | "/"
invocation ::= name | external
name ::= id ( "." id )*
id ::= letter ( letter | digit )*
external ::= "$" name
application ::= invocation "[" value* "]"
selection ::= "(" term "|" item* "|" item* ")" -- Algol 68 style conditional
node ::= "{" item* "}"
sequence ::= "(" item* ")"
binding ::= name bindingMode rhs
bindingMode ::= "=" | ":" | ":=" | "←"
rhs ::= value | op term | "'" item* "'" | "[" [ invocation ] "|" binding* "]"
property ::= invocation "#"
label ::= id ":!" | name "!" | name "@"
SEMANTICS
R: expression > environment > expression -- Reduction
T: expression > environment > environment -- Transformation
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
[Unless explicitly given below, T<construct>(E) = E.]
R<primary op term>(E) = R<primary>(E) op R<term>(E)
R<literal>(E) = literal
R&T<id>(E) = R&T<valOf(id, E)>(E)
R&T<name "." id>(E) = R&T<valOf(id, R<name>(E))>(E)
R<"$" name>(E) = "$" name
R<invocation "[" value* "]">(E) = apply(invocation, R<value*>(E), E)
apply(invocation, value*, E) =
CASE R<invocation>(E) OF
"$equal" => value1 = value2
"$greater" => value1 > value2
. . .
"$subscript" => value1[value2] -- value1: sequence, value2: int
"$contents" => C<value*>
"$properties" => P<value*>(E)
"$marks" => M<value*>(E)
"$references" => U<value*>(E)
ELSE => R<invocation>([E | "Value" "=" value*])
R&T<"(" term "|" item1* "|" item2* ")">(E) =
if R<term>(E) then R&T<item1*>(E) else R&T<item2*>(E)
R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" "=" E]) "}";
locVal("Outer", (T<"Sub" item*>([Null | "Outer" "=" E])))
R&T<"(" item* ")">(E) = "(" R<item*>(E) ")" ; T<item*>(E)
R<>(E) = Nil
R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
T<item*>(T<item1>(E))
R&T<n m rhs>(E) = Nil; bind(n, m, R<rhs>(E), E)
<n m op term> = <n m n op term> -- Syntactic sugar
R<"'" item* "'">(E) = item* --Usable only in rhs of binding
R<"[" invocation "|" binding* "]">(E) =
[T<binding*>([R<invocation>(E) | "Outer" "=" E]) | "Outer" "=" Null]
R<"[|" binding* "]">(E) = [T<binding*>([Null | "Outer" "=" E]) | "Outer" "=" Null]
R<invocation "#">(E) = R<invocation>(E) "#"
R<label>(E) = label
-- Subsidiary definitions for R&T
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, val, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, val, E) -- Assign at right level
True => [E | id m val]
bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, valOf(id, E))]
assign(id, val, E) =
locBinding(id, E) = ":" => [E | id ":" val]
bindingOf(id, E) = ":" => bind("Outer." id, ":=", val, E)
True => E -- Can only assign to vars
NOTATION FOR ENVIRONMENTS
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Nil = ""
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
SEMANTICS OF PROPERTIES, MARKS, REFERENCES (uses), CONTENTS
P: expression > environment > expression -- Sequence of properties
M: expression > environment > expression -- Sequence of marks
U: expression > environment > expression -- Sequence of references
P&M&U<e>(E) denotes the triple P<e>(E); M<e>(E); U<e>(E)
[These functions all return the empty list, Nil, except as specified below.]
P<invocation "#">(E) = R<invocation>(E)
M<name "!">(E) = prefixes(name)
U<name "@">(E) = prefixes(name)
P&M&U<invocation>(E) = P&M&U<R<invocation>(E)>(E)
P&M&U<"(" item* ")">(E) = P&M&U<item*>(E)
P&M&U<item1 item*>(E) = P<item1>(E) P<item*>(T<item1>(E));
M<item1>(E) M<item*>(T<item1>(E));
U<item1>(E) U<item*>(T<item1>(E))
prefixes(id) = id
prefixes(name "." id) = name "." id prefixes(name)
C: expression > expression
C<value> = value
C<property> = C<label> = Nil
VALUE SPACE
Expressions in an Interdoc script may denote
literal values:
Booleans: (F, T)
integers: ... -3, -2, -1, 0, 1, 2, 3, ...
reals: 1.2E5, . . .
strings: <this is a string>
labels: A123!, anId!, Paragraph.Example!
external names: $name
the empty environment: Null
the empty list: NIL
sequences of values
unevaluated expressions
environments
DISCUSSION
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). The (implicit) "outermost" environment binds
each id to the external name $id.
Nodes are delimited by brackets. The contents of each node are implicitly
prefixed by Sub, which will generally be bound in the containing environment
to a quoted expression performing an environment transformation, and perhaps
supplying some properties and marks.
Parentheses are used to denote a sequence value. to delimit the argument list of
an operator. Square brackets are used to denote a new environment value, which
behaves much like a record.
Semantics of labels:
A mark name! on a node makes that node a "target" of the label name (and its
prefixes); a reference name@ makes it a "source." The "main" identifier of a
label must be declared (using name:!) at the root of a subtree containing all its
sources and targets. The label represents a set of directed arcs, one from each of
its sources to each of its destinations. Multiple marks make the node a target of
in multiple labels. A unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
OTHER NOTES
It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes. (E.g., text is not always there to be
"shown".)
Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
-It's OK to copy a node if that doesn't move any labels outside their scope, and
you understand ALL properties of its new parent.
-it's OK to delete a (subtree rooted at a) node if you understand ALL properties
of its parent.
The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.
Put in contents if: Put in environment if:
effect is local to node has scope
is directly edited is only indirectly edited
is to be bound locally needs delayed or global binding
STANDARD CARD
WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.
GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.
STANDARDIZE CONCEPTS, NOT NAMES.
CONSCIOUSLY POSTPONED
Lambda expressions.
Sets of properties, etc. (Cf. Mitchell's Font example.)
SET/LIST operators ($append $union ?)
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
Drop "%"; ENV() is now the only environment-constructing operator.
Add SUB operator (first operand: sequence only, second: number only).
Add atoms, as distinct from ids.
Fix lhs op rhs syntax.
Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
distinguish syntactically between properties (marks) and labels.
only the "main" id of a label is declarable.
eliminate as an id character.
eliminate op ids from grammar.
restructure the grammar for "functional" notation for operators.
update semantic equations for new grammar, etc.
fix treatment of unbound qualified names (now produce Nil).
Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
restore $val.
move quoting to rhs, allow quoted primaries without parentheses.
allow an op to be the rhs of a definition.
eliminate the functions operate, apply, eval by back substitution.
change semantics of () to allow "record" construction without $env.
Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
"It's OK to edit a node if you understand ALL of its (local) properties, and
either don't remove any of them or also understand ALL properties
of its parent."
"Put in contents if: Put in environment if: ..."
Add connection syntax to syntactically rule out a+←'b.
Edited by Jim H. on 25 Aug. 1981 11:33 am PDT (Tuesday).
Syntactically separate label references and name invocation.
Put in distinct syntax in rhs for environment construction.
Informal semantics of labels.
( ... ) > [ ... ] in applications; permitting ( ... ) as a primary.
Edited by Jim H. on 25 Aug. 1981 4:08 pm PDT (Tuesday).
Add sequence as a nonterminal to the syntax.
State the formal semantics of labels and properties.
Reorder presentation (hopefully to improve readability).
*start*
14892 00024 USt
Date: 25 Aug. 1981 4:08 pm PDT (Tuesday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 26
To: Mitchell, Horning
Edited by Jim H. on 25 Aug. 1981 4:08 pm PDT (Tuesday).
Add sequence as a nonterminal to the syntax.
State the formal semantics of labels and properties.
Reorder presentation (hopefully to improve readability).
-------------------
Open questions:
Use of ( ... ) vs. [ ... ] (especially in application).
Sort out "records" vs. quoted bindings.
Sets of properties, etc. (Cf. Mitchell's Font example.)
SET/LIST operators ($append $union ?)
semantics of $list
Non-uniform semantics of quote.
Subtle distinctions between quoted and unquoted ( ... )
Semantics of quoted [ ... ] ?
We should rethink our character assignments.
check our characterset for disjointness with
Interpress.DoubtfulChars.
use of various bracket pairs.
enlarge op with a few more single-character operators?
Possible node operators (purely in semantic domain, not operators?).
$properties: node, environment > sequence -- All #'s
$marks: node, environment > sequence -- All !'s
$references: node, environment > sequence -- All @'s
$contents: node, environment > sequence -- The rest (fringe)
Consider restricting $subscript just to sequences, not nodes.
Note that $if is the only operator that requires multiple sequences (hence
commas, different syntax for argument list and sequence). Worth
going back to treating it as a special case?
Not done:
Consider style for use of temporary local definitions.
-------------------
We envision an Interdoc script being input and viewed in any manner
equivalent to the following:
Parse the script, repeatedly
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items,
- transforming the environment as indicated by the binding items, and
- recording the links indicated by label references and marks.
BASIC INTERDOC
SYNTACTIC EXAMPLE:
{Book.example! -- Links to this from Book@ and Book.example@
ExampleParagraph -- Invokes a definition
$UniqueMark12356# -- Adds a property
a:='NOT[EQ[margins.left, 120]] margins.right←100 r=*[12.5, pt]
IF[a, margins.left+←5 margins.right←5, margins.left+←10]
<text for this node>
}
GRAMMAR
item ::= primary | binding | label
primary ::= literal | invocation | application | property | node | sequence
literal ::= Boolean | integer | hexint | real | string | external | op
external ::= "$" name
name ::= id ( "." id)*
id ::= letter ( letter | digit )*
op ::= "+" | "" | "*" | "/"
invocation ::= id | primary "." id
application ::= primary "[" item* ( "," item* )* "]"
property ::= primary "#"
node ::= "{" item* "}"
sequence ::= "(" item* ")"
binding ::= name connection rhs
connection ::= bindingMode | op bindingMode | bindingMode "'"
bindingMode ::= "=" | ":" | ":=" | "←"
rhs ::= primary | "[" item* "]"
label ::= id ":!" | name "!" | name "@"
SEMANTICS
R: expression > environment > expression -- Reduction
T: expression > environment > environment -- Transformation
R&T<e>(E) denotes the pair R<e>(E); T<e>(E)
R&T<literal>(E) = literal; E
R&T<id>(E) = R&T<valOf(id, E)>(E)
R&T<p "." id>(E) = R&T<valOf(id, R<p>(E))>(E)
R&T<p "[" arg1 "," ... "," argn "]">(E) =
CASE R<p>(E) OF
$if => if R<arg1>(E) then R&T<arg2>(E) else R&T<arg3>(E)
"+" => R<arg1>(E) + ... + R<argn>(E); E
...
$val => R<arg1>(E); E
$list => R&T<arg1>(E)
$subscript => R&T<arg1>(E)[R<arg2>(E)]
-- arg1: sequence | node, arg2.R: int
$hide => "" ; E
R&T<p "#">(E) = R<p>(E) "#"; E
R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))
R&T<"(" item* ")">(E) = "(" R<item*>(E) ")" ; E -- List constructor
R&T<"[" item* "]">(E) = [T<item*>([Null | "Outer" = E]) | "Outer" = Null]; E
-- Construct a "record" environment value
R&T<>(E) = ""; E
R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
T<item*>(T<item1>(E))
R&T<n m rhs>(E) = "" ; bind(n, m, R<rhs>(E), E)
<n op m rhs> = <n m op "[" n "," rhs "]"> -- Syntactic sugar
<n op m "(" arg* ")"> = <n m op "[" n "," arg* "]"> ??
R&T<"'" p>(E) = p; E
R&T<"'(" item* ")">(E) = item*; E ??
R&T<"'[" item* "]">(E) = ??
R&T<label>(E) = label; E
bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding
valOf(id, E) = locVal(id, whereBound(id, E)) -- Gets innermost value
whereBound(id, E) = -- Finds innermost binding
locBinding(id, E) ~= None => E
locBinding("Outer", E) ~= None => whereBound(id, locVal("Outer", E))
True => Null
bind(id, m, val, E) =
bindingOf(id, E) = "=" => E -- Can't rebind constants
m = ":=" => assign(id, val, E) -- Assign at right level
True => [E | id m val]
bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]
assign(id, val, E) =
locBinding(id, E) = ":" => [E | id ":" val]
bindingOf(id, E) = ":" =>
[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
True => E -- Can only assign to vars
SEMANTICS OF PROPERTIES, MARKS, REFERENCES (uses)
P: expression > environment > expression -- Sequence of properties
M: expression > environment > expression -- Sequence of marks
U: expression > environment > expression -- Sequence of references
P&M&U<e>(E) denotes the triple P<e>(E); M<e>(E); U<e>(E)
These functions all return the empty list, Nil, except as specified below:
P<p "#">(E) = R<p>(E)
M<name "!">(E) = prefixes(name)
U<name "@">(E) = prefixes(name)
P&M&U<invocation>(E) = P&M&U<R<invocation>(E)>(E)
P&M&U<"(" item* ")">(E) = P&M&U<item*>(E)
P&M&U<item1 item*>(E) = P<item1>(E) P<item*>(T<item1>(E));
M<item1>(E) M<item*>(T<item1>(E));
U<item1>(E) U<item*>(T<item1>(E))
prefixes(id) = id
prefixes(name "." id) = name "." id prefixes(name)
NOTATION FOR ENVIRONMENTS
Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
Null denotes the "empty" environment
[E | id m e] means "E with id mode m bound to e"
locBinding(id, E) denotes the binding mode of id in E
locBinding(id, Null) = None
locBinding(id, [E | id' m e]) =
if id=id' then m else locBinding(id, E)
locVal(id, E) denotes the value locally bound to id in E
locVal(id, Null) = Nil = ""
locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)
VALUE SPACE
Expressions in an Interdoc script may denote
literal values:
Booleans: (F, T)
integers: ... -3, -2, -1, 0, 1, 2, 3, ...
reals: 1.2E5, . . .
strings: <this is a string>
labels: A123!, anId!, Paragraph.Example!
external names: $name
the empty environment: Null
the empty list: NIL
sequences of values
unevaluated expressions
environments
DISCUSSION
How semantics are associated with an entire document:
Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer). Most bindings take place directly in E. However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var). Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.
When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer"). The (implicit) "outermost" environment binds
each id to the external name $id.
Nodes are delimited by brackets. The contents of each node are implicitly
prefixed by Sub, which will generally be bound in the containing environment
to a quoted expression performing an environment transformation, and perhaps
supplying some properties and marks.
Parentheses are used to denote a sequence value. to delimit the argument list of
an operator. Square brackets are used to denote a new environment value, which
behaves much like a record.
Semantics of labels:
A mark name! on a node makes that node a "target" of the label name (and its
prefixes); a reference name@ makes it a "source." The "main" identifier of a
label must be declared (using name:!) at the root of a subtree containing all its
sources and targets. The label represents a set of directed arcs, one from each of
its sources to each of its destinations. Multiple marks make the node a target of
in multiple labels. A unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.
OTHER NOTES
It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes. (E.g., text is not always there to be
"shown".)
Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
-It's OK to copy a node if that doesn't move any labels outside their scope, and
you understand ALL properties of its new parent.
-it's OK to delete a (subtree rooted at a) node if you understand ALL properties
of its parent.
The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.
Put in contents if: Put in environment if:
effect is local to node has scope
is directly edited is only indirectly edited
is to be bound locally needs delayed or global binding
STANDARD CARD
WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.
GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.
STANDARDIZE CONCEPTS, NOT NAMES.
HISTORY LOG
Bring the syntax up front.
Further develop parallelism between grammar and semantic equations.
Write semantic equations in terms of concrete syntax.
Quote general expressions.
V, E, C > R, T, E .
[...] > <...> for quotation of script expressions.
(E | id←e, m) > [E | id←e, m] for local binding.
Introduce primary to disambiguate expression* , factor lhs from binding.
Introduce Sub component to initialize nodes.
Debug semantics of braces and dot.
Mode > binding.
Debug semantics of <id> (fix up indirection).
Add VAL.
Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday): Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings. Moved history log to end of message.
Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax. Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c).
Fixed up semantic equations to reflect this. Exchanged the use of {}s and ()s.
Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.
Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
E(id) > locVal(id, E) --Remove conflict with f(E).
Outer > "Outer"
Const > "="
id lookup rule modified (R & T<id>)
[E | id←e, m] > [E | id m e]
"." as infix op
expressions are evaluated left-to-right (except for binding operator)
Reverse VAL/ENV default for parens.
bindq > bind
binding > bindingMode
expand definition of apply inline
default T<construct>(E) = E
add comments to semantic equations
-------------------
R<>(E) = Nothing -- The empty expression
-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)) -- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E)) -- Composition
R<literal>(E) = literal
R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)
R<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))
R<"NOT" p>(E) = if R<p>(E) then False else True
R<p1 op p2>(E) =
op = "." => R<p2>([R<p1>(E) | "Outer" = E])
op = "+" => R<p1>(E)+R<p2>(E)
. . .
R<n m op e>(E) = Nothing -- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)
R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))
R<"(" e* ")">(E) = R<e*>(E)
R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])
-------------------
Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
R&T<>
Nothing > ""
Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
Remove side-effects from all expressions.
Parentheses purely for grouping (don't hide environment transformations).
#label > label !
labels within nodes
Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
Rewrite <n m op e> as syntactic sugar.
structured labels
re-introduce apply function in R&T<p1 op p2>
correct syntax for "."
% for opening an environment (also replaces ENV?)
Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
Drop "%"; ENV() is now the only environment-constructing operator.
Add SUB operator (first operand: sequence only, second: number only).
Add atoms, as distinct from ids.
Fix lhs op rhs syntax.
Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
distinguish syntactically between properties (marks) and labels.
only the "main" id of a label is declarable.
eliminate as an id character.
eliminate op ids from grammar.
restructure the grammar for "functional" notation for operators.
update semantic equations for new grammar, etc.
fix treatment of unbound qualified names (now produce Nil).
Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
restore $val.
move quoting to rhs, allow quoted primaries without parentheses.
allow an op to be the rhs of a definition.
eliminate the functions operate, apply, eval by back substitution.
change semantics of () to allow "record" construction without $env.
Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
"It's OK to edit a node if you understand ALL of its (local) properties, and
either don't remove any of them or also understand ALL properties
of its parent."
"Put in contents if: Put in environment if: ..."
Add connection syntax to syntactically rule out a+←'b.
Edited by Jim H. on 25 Aug. 1981 11:33 am PDT (Tuesday).
Syntactically separate label references and name invocation.
Put in distinct syntax in rhs for environment construction.
Informal semantics of labels.
( ... ) > [ ... ] in applications; permitting ( ... ) as a primary.