File [_CD6_]<interscript>mail>interdoc-t-aug81.mail!1

*start*
07201 00024 USt
Date: 7 Aug. 1981 4:41 pm PDT (Friday)
From: Mitchell.PA
Subject: Current Level 0/1 Interdoc status/rev. 15
To: Mitchell, Horning

Last edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

We envision an Interdoc script being processed in any manner equivalent to the
following:

Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.

				BASIC INTERDOC

GRAMMAR

node	::= "{" labels expression* "}"
labels	::= [label* ":"]
label	::= "#" name
expression ::= [ lhs ] [ "'"  | "." | op ] rhs
rhs	::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
	     [ "VAL" ] "(" expression* ")" 
literal	::= Boolean | integer | hexint | real | string | label
name	::= id ( "." id)*
id	::= (letter | "" ) ( letter | "" | digit )*	-- "" is the null id
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs	::= name binding
binding ::= "=" | ":" | ":=" | "←"
op	::= "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
	   "LT" | " LE" | "EQ" | "NE" | "GE" | "GT" 

SYNTACTIC EXAMPLES:

{#examplenode:
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

Environments bind expressions and "modes" to identifiers:
	Null denotes the "empty" environment
	[E | id←e, m] means "E with (e, m) bound to id"
	E(id) denotes the value locally bound to id in E
		Null(id) = id
		[E | id'←e, m](id) = if id=id' then e else E(id)
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id'←e, m]) =
			if id=id' then m else locBinding(id, E)

							-- Basis
R<>(E) = nothing						-- just what it says
T<>(E) = E							-- Identity

							-- Expression sequence
R<nothing e*>(E) = R<e*>(E)				-- "nothing" disappears
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<"'" p>(E) = p
T<"'" p>(E) = E

R<literal>(E) = literal
T<literal>(E) = E

R<id>(E) = if valOf(id, E)=id then id else R<valOf(id, E)>(E)
T<id>(E) = if valOf(id, E)=id then E else T<valOf(id, E)>(E)

R<p "." id>(E) = R<id>(R<p>(E))
T<p "." id>(E) = if valOf(id, R<p>(E))=id then E else T<valOf(id, R<p>(E))>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<p1 op p2>(E) = apply(op, p1, p2, E)
T<p1 op p2>(E) = E

R<"NOT" p>(E) = if R<p>(E)=True then False else True
T<"NOT" p>(E) = E

R<n m op e>(E) = nothing						-- Empty list
T<n m op e>(E) = bindq(n, m, apply(op, n, e, E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | Outer←E, Const]) "}"
T<"{" labels e* "}">(E) = (T<Sub e*>([Null | Outer←E, Const]))(Outer)

R<"(" e* ")">(E) = [T<"(" e* ")">(E) | Outer←Null, Const]
T<"(" e* ")">(E) = T<e*>([Null | Outer←E, Const])

R<"VAL(" e* ")">(E) = R<e*>(E)
T<"VAL(" e* ")">(E) = E


whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding(Outer, E) ~= None	=> whereBound(id, E(Outer))
	True					=> Null

valOf(id, E) = (whereBound(id, E))(id)		-- Gets innermost value

bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

bindq(id, m, e, E) =
	bindingOf(id, E) = "="	=> E
	m = ":="			=> assign(id, e, E)
	True				=> [E | id←e, m]

bindq(id.n, m, e, E) = [E | id←bindq(n, m, e, R<id>(E)), bindingOf(id, E)]

assign(id, e, E) = locBinding(id, E) = ":"	=> [E | id←e, ":"]
		     bindingOf(id, E) = ":"	=> bindq(Outer.id, ":=", e, E)
		     True			=> E 

apply(op, lhs, rhs, E) =
	op = ""	=> R<rhs>(E)
	op = "." 	=> R<rhs>([R<lhs>(E) | Outer←E, Const])
	op = "+"	=> R<lhs>(E)+R<rhs>(E)
	. . .


Missing or in question:
	literal sequences
	binary & relational operators

-------------------
Expressions in an Interdoc script may denote
	literal values:
		Boolean: (F, T)
		integer: ... -3, -2, -1, 0, 1, 2, 3, ...
		real: 1.2E5, . . .
		string: <this is a string>
		label: #A123, #anId, #Paragraph
		the empty environment: Null
		the empty list: NIL
		id:  (the null id), bold, thisIsAnId, Helvetica, . . .
			(unless bound, taken to denote a primitive)
	environments
	unevaluated expressions


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bindq(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in E(Outer).  Perverse explicit bindings to outer might create loops,
leaving some ids undefined, but there seems to be little reason to clutter up the
semantics by forbidding such assignments.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.

Braces create a nested environment; if preceded by a dot, it is initialized to the
value of the name in the binding; if preceded by VAL, it is executed for value,
and the environment is then discarded.


Semantics of labels:

A label #id on a node in the dominant structure gives that node membership in
the set named by id.  Multiple labels place the node in multiple sets, and a
unique label on a node places it in a singleton set, i.e., identifies it uniquely.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Last edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Last edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.
*start*
07956 00024 USt
Date: 13 Aug. 1981 5:36 pm PDT (Thursday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 16
To: Mitchell, Horning

[Jim,

I did a fair amount of doodling, primarily in an attempt to transform the semantic
equations to a form where their correctness will be "more nearly obvious."  Some
of the changes I like, but I'm not really wedded to any of them.]

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
In question:
	treatment of primitive identifiers
	side effects in expressions
	operations on nested environments: font←.(size←10)

Missing:
	literal sequences

-------------------

We envision an Interdoc script being processed in any manner equivalent to the
following:

Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.

				BASIC INTERDOC

GRAMMAR

node	::= "{" labels expression* "}"
labels	::= [label* ":"]
label	::= "#" name
expression ::= [ lhs ] [ "'"  | op ] rhs
rhs	::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
	     [ "ENV" ] "(" expression* ")" 
literal	::= Boolean | integer | hexint | real | string | label
name	::= id ( "." id)*
id	::= (letter | "" ) ( letter | "" | digit )*	-- "" is the null id
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs	::= name bindingMode
bindingMode ::= "=" | ":" | ":=" | "←"
op	::= "." | "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
	   "LT" | " LE" | "EQ" | "NE" | "GE" | "GT" 

SYNTACTIC EXAMPLE:

{#examplenode:
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E,
			if locBinding(id, E) ~= None
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)

N. B.
	T<construct>(E) = E , if no explicit value is given below.

							-- Basis
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, e, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":="			=> assign(id, e, E)	-- Assign at right level
	True				=> [E | id m e]

bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]

assign(id, e, E) = locBinding(id, E) = ":"	=> [E | id ":" e]
		     bindingOf(id, E) = ":"	=> bind("Outer".id, ":=", e, E)
		     True			=> E 		-- Can only assign to vars


-------------------
Expressions in an Interdoc script may denote
	literal values:
		Boolean: (F, T)
		integer: ... -3, -2, -1, 0, 1, 2, 3, ...
		real: 1.2E5, . . .
		string: <this is a string>
		label: #A123, #anId, #Paragraph
		the empty environment: Null
		the empty list: NIL
		id:  (the null id), bold, thisIsAnId, Helvetica, . . .
			(unless bound, taken to denote a primitive)
	environments
	unevaluated expressions


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  Perverse explicit bindings to outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.

Braces create a nested environment; if preceded by a dot, it is initialized to the
value of the name in the binding; if preceded by VAL, it is executed for value,
and the environment is then discarded.


Semantics of labels:

A label #id on a node in the dominant structure gives that node membership in
the set named by id.  Multiple labels place the node in multiple sets, and a
unique label on a node places it in a singleton set, i.e., identifies it uniquely.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.


*start*
08870 00024 USt
Date: 17 Aug. 1981 11:40 am PDT (Monday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 17
To: Mitchell, Horning

[Jim,

For discussion this afternoon.

Jim H.]

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

-------------------
In question:
	treatment of primitive identifiers (vs. labels)
	side effects in expressions (why only in conditionals?)
	operations on nested environments: font←.(size←10)
	merging environments
	reduced transformations

Missing:
	literal sequences
	operations on sequences (subscripting)

-------------------

We envision an Interdoc script being processed in any manner equivalent to the
following:

Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.

				BASIC INTERDOC

GRAMMAR

node	::= "{" labels expression* "}"
labels	::= [label* ":"]
label	::= "#" name
expression ::= [ lhs ] [ "'"  | op ] rhs
rhs	::= [ "NOT" ] primary ( op primary )*
primary ::= literal | id | primary "." id | conditional | node |
	     [ "ENV" ] "(" expression* ")" 
literal	::= Boolean | integer | hexint | real | string | label
name	::= id ( "." id)*
id	::= (letter | "" ) ( letter | "" | digit )*	-- "" is the null id
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs	::= name bindingMode
bindingMode ::= "=" | ":" | ":=" | "←"
op	::= "." | "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
	   "LT" | " LE" | "EQ" | "NE" | "GE" | "GT" 

SYNTACTIC EXAMPLE:

{#examplenode:
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)
	R&T: ( expression, environment ) > ( expression, environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Null
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)


R&T<>(E) = ""; E

R&T<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)); T<e*>(T<e1>(E))

R&T<literal>(E) = literal; E

R&T<id>(E) = if bindingOf(id, E)=None then id; E else R&T<valOf(id, E)>(E)

R&T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R&T<e2*>(T<e1>(E)) else R&T<e3*>(T<e1>(E))

R&T<"NOT" p>(E) = if R<p>(E) then False else True; E

R&T<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E ?]); E
	op = "+"	=> R<p1>(E)+R<p2>(E); E
	. . .

R&T<n m e>(E) 	= ""; bind(n, m, R<e>(E), E)
R&T<n m "'" e>(E) 	= ""; bind(n, m, e, E)
R&T<n m op e>(E) 	= ""; bind(n, m, R<n op e>(E), E)

R&T<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}";
			  locVal("Outer", (T<Sub e*>([Null | "Outer" = E])))

R&T<"(" e* ")">(E) = R<e*>(E); E

R&T<"ENV(" e* ")">(E) = [T<e*>(E) | "Outer" = Null]; T<e*>(E) ?


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, e, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":="			=> assign(id, e, E)	-- Assign at right level
	True				=> [E | id m e]

bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]

assign(id, e, E) = locBinding(id, E) = ":"	=> [E | id ":" e]
		     bindingOf(id, E) = ":"	=> bind("Outer".id, ":=", e, E)
		     True			=> E 		-- Can only assign to vars


-------------------
Expressions in an Interdoc script may denote
	literal values:
		Boolean: (F, T)
		integer: ... -3, -2, -1, 0, 1, 2, 3, ...
		real: 1.2E5, . . .
		string: <this is a string>
		label: #A123, #anId, #Paragraph
		the empty environment: Null
		the empty list: NIL
		id:  (the null id), bold, thisIsAnId, Helvetica, . . .
			(unless bound, taken to denote a primitive)
	environments
	unevaluated expressions


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  Perverse explicit bindings to Outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.

Parentheses create a nested environment; if preceded by a dot, it is initialized to
the value of the name in the binding; if not preceded by ENV, it is executed for
value, and the environment is then discarded.


Semantics of labels:

A label #id on a node in the dominant structure gives that node membership in
the set named by id.  Multiple labels place the node in multiple sets, and a
unique label on a node places it in a singleton set, i.e., identifies it uniquely.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

*start*
09077 00024 USt
Date: 17 Aug. 1981 7:05 pm PDT (Monday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 18
To: Mitchell, Horning

[Jim,


Jim H.]

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

-------------------
In question:
	operations on nested environments: font←.(size←10)
	merging environments
	reduced transformations
	structured labels
	getEnv operator
	labels outside dominant structure
	non-printing nodes

Missing:
	operations on sequences and environments (subscripting and enumeration)
	substitution of  for Null, Nil, etc.

-------------------

We envision an Interdoc script being processed in any manner equivalent to the
following:

Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.

				BASIC INTERDOC

GRAMMAR

node	::= "{" expression* "}"
expression ::= [ lhs ] [ "'"  | op ] | name ":!"	-- :! for label declaration
rhs	::= [ "NOT" ] primary ( op primary )* 
primary ::= literal | id | primary "." id | conditional | node |
	     [ "ENV" ] "(" expression* ")" | label
literal	::= Boolean | integer | hexint | real | string | label
name	::= id ( "." id)*
id	::= (letter | "" ) ( letter | "" | digit )*	-- "" is the null id
label	::= name "!"
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs	::= name bindingMode
bindingMode ::= "=" | ":" | ":=" | "←"
op	::= "." | "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
	   "LT" | " LE" | "EQ" | "NE" | "GE" | "GT" 

SYNTACTIC EXAMPLE:

{node.example!
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)
	R&T: ( expression, environment ) > ( expression, environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Null
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)


R&T<>(E) = ""; E

R&T<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)); T<e*>(T<e1>(E))

R&T<literal>(E) = literal; E

R&T<id>(E) = if bindingOf(id, E)=None then id; E else R&T<valOf(id, E)>(E)

R&T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R&T<e2*>(E) else R&T<e3*>(E)

R&T<"NOT" p>(E) = if R<p>(E) then False else True; E

R&T<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E ?]); E
	op = "+"	=> R<p1>(E)+R<p2>(E); E
	. . .

R&T<n m e>(E) 	= ""; bind(n, m, R<e>(E), E)
R&T<n m "'" e>(E) 	= ""; bind(n, m, e, E)
R&T<n m op e>(E) 	= ""; bind(n, m, R<n op e>(E), E)

R&T<"{" e* "}">(E) = "{" R<Sub e*>([Null | "Outer" = E]) "}";
			  locVal("Outer", (T<Sub e*>([Null | "Outer" = E])))

R&T<"(" e* ")">(E) = R&T<e*>(E)

R&T<"ENV(" e* ")">(E) = [T<e*>(E) | "Outer" = Null]; T<e*>(E) ?


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, e, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":="			=> assign(id, e, E)	-- Assign at right level
	True				=> [E | id m e]

bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]

assign(id, e, E) = locBinding(id, E) = ":"	=> [E | id ":" e]
		     bindingOf(id, E) = ":"	=> bind("Outer".id, ":=", e, E)
		     True			=> E 		-- Can only assign to vars


-------------------
Expressions in an Interdoc script may denote
	literal values:
		Boolean: (F, T)
		integer: ... -3, -2, -1, 0, 1, 2, 3, ...
		real: 1.2E5, . . .
		string: <this is a string>
		label: A123!, anId!, Paragraph!
		the empty environment: Null
		the empty list: NIL
		id:  (the null id), bold, thisIsAnId, Helvetica, . . .
			(unless bound, taken to denote a primitive)
	environments
	unevaluated expressions


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  Perverse explicit bindings to Outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.

Parentheses create a nested environment; if preceded by a dot, it is initialized to
the value of the name in the binding; ? if not preceded by ENV, it is
executed for value, and the environment is then discarded.


Semantics of labels:

A label #id on a node in the dominant structure gives that node membership in
the set named by id.  Multiple labels place the node in multiple sets, and a
unique label on a node places it in a singleton set, i.e., identifies it uniquely.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""


*start*
09690 00024 USt
Date: 19 Aug. 1981 9:58 am PDT (Wednesday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 19
To: Mitchell, Horning, Lampson


Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

-------------------
In question:
	subscripting: sequences, yes; nodes, labels, sets, environments, no
		evaluation of subscript expressions
	merging environments
	labels outside dominant structure
	non-printing nodes
	generalize apply to non-literal operands (distribute over sequences?)
	IF, NOT, apply when arguments are not literals, partial evaluation
	bind op in environment?

Missing:
	operations on sequences and environments (subscripting and enumeration)
	substitution of  for Null, Nil, etc.

-------------------

We envision an Interdoc script being processed in any manner equivalent to the
following:

Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.

				BASIC INTERDOC

GRAMMAR

node	::= "{" expression* "}"
expression ::= [ lhs ] [ "'" | op ] | name ":!"	-- :! for label declaration
rhs	::= [ "NOT" ] primary ( op primary )* 
primary ::= literal | id | primary "." id | conditional | node |
	     [ "ENV" ] "(" expression* ")" | label
literal	::= Boolean | integer | hexint | real | string | label
name	::= id ( "." id)*
id	::= (letter | "" ) ( letter | "" | digit )*	-- "" is the null id
label	::= name "!"
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs	::= name bindingMode
bindingMode ::= "=" | ":" | ":=" | "←"
op	::= "%" | "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
	   "LT" | " LE" | "EQ" | "NE" | "GE" | "GT" 

SYNTACTIC EXAMPLE:

{node.example!
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)
	R&T: ( expression, environment ) > ( expression, environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Null
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)


R&T<>(E) = ""; E

R&T<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)); T<e*>(T<e1>(E))

R&T<literal>(E) = literal; E

R&T<id>(E) = if bindingOf(id, E)=None then id; E else R&T<valOf(id, E)>(E)

R&T<p "." id>(E) = R&T<valOf(id, R<p>(E))>(E)

-- Subscript should go here

R&T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R&T<e2*>(E) else R&T<e3*>(E)

R&T<"NOT" p>(E) = if R<p>(E) then False else True; E

R&T<p1 "'" p2>(E)	= p2; E
R&T<p1 "%" p2>(E)	= R<p2>([R<p1>(E) | "Outer" = E]);
			 [T<p2>([R<p1>(E) | "Outer" = E]) | "Outer" = Null]
R&T<p1 op p2>(E)	= apply(R<p1>(E), op, R<p2>(E)); E

R&T<n m e>(E) = ""; bind(n, m, R<e>(E), E)
	<n m op e> = <n m n op e>			-- Syntactic sugar

R&T<"{" e* "}">(E) = "{" R<Sub e*>([Null | "Outer" = E]) "}";
			  locVal("Outer", (T<Sub e*>([Null | "Outer" = E])))

R&T<"(" e* ")">(E) = R&T<e*>(E)


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

apply(arg1, op, arg2) =
	op = "+"	=> arg1+arg2
	. . .

bind(id, m, e, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, e, E)	-- Assign at right level
	True				=> [E | id m e]

bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]

assign(id, e, E) =
	locBinding(id, E) = ":"	=> [E | id ":" e]
	bindingOf(id, E) = ":"	=>
				[E | "Outer" "=" bind(id, ":=", e, locVal("Outer", E))]
	True				=> E 			-- Can only assign to vars


-------------------
Expressions in an Interdoc script may denote
	literal values:
		Boolean: (F, T)
		integer: ... -3, -2, -1, 0, 1, 2, 3, ...
		real: 1.2E5, . . .
		string: <this is a string>
		label: A123!, anId!, Paragraph!
		the empty environment: Null
		the empty list: NIL
		id:  (the null id), bold, thisIsAnId, Helvetica, . . .
			(unless bound, taken to denote a primitive)
	environments
	unevaluated expressions


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  Perverse explicit bindings to Outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound to an environment transformation in the containing environment.

Parentheses create a nested environment; if preceded by a dot, it is initialized to
the value of the name in the binding; ? if not preceded by ENV, it is
executed for value, and the environment is then discarded.


Semantics of labels:

A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members.  Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes


*start*
10516 00024 USt
Date: 19 Aug. 1981 6:56 pm PDT (Wednesday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 20
To: Mitchell, Horning, Lampson


Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
	Drop "%"; ENV() is now the only environment-constructing operator.
	Add SUB operator (first operand: sequence only, second: number only).
	Add atoms, as distinct from ids.
	Fix lhs op rhs syntax.

-------------------
In question:
	merging environments (OPEN)
	declaration of "main" labels
	labels outside dominant structure
	non-printing nodes (semicolon?)
	bind op names in environment?
		structured primitive names, naming authorities
	how to syntactically distinguish operator application
	infix vs. prefix for general operators (APL?)
	treatment of unbound qualified names
	 as id vs. binary op vs. sign on numbers

Missing:
	enumeration over sequences and environments
	substitution of  for Null, Nil, etc., as appropriate

-------------------

We envision an Interdoc script being processed in any manner equivalent to the
following:

Parse the script, alternately
- reducing each expression to "primitives" by evaluating constant subexpressions
and replacing names by the values to which they are bound in the current
environment, and
- transforming the environment as indicated by the expressions.

				BASIC INTERDOC

GRAMMAR

node	::= "{" expression* "}"
expression ::= [ lhs ] rhs | name ":!"	-- :! for label declaration
rhs	::= [ "NOT" ] primary ( op primary )* 
primary ::= literal | id | primary "." id | conditional | node |
	     [ "ENV" ] "(" expression* ")"
literal	::= Boolean | integer | hexint | real | string | label | atom
name	::= id ( "." id)*
id	::= (letter | "" ) ( letter | "" | digit )*	-- "" is the null id
label	::= name "!"
atom	::= "$" id
conditional ::= "IF(" expression "," expression* [ "," expression* ] ")"
lhs	::= name bindingMode [  "'" | op ]
bindingMode ::= "=" | ":" | ":=" | "←"
op	::= "+" | "" | "*" | "/" | "MOD" | "AND" | "OR" |
	   "LT" | " LE" | "EQ" | "NE" | "GE" | "GT" | "SUB"

SYNTACTIC EXAMPLE:

{node.example!
a:='NOT margins.left EQ 120 margins.left←100 r=12.5*pt
IF(a, leftMargin←+5, leftMargin←+10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)
	R&T: ( expression, environment ) > ( expression, environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Null
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)


R&T<literal>(E) = literal; E

R&T<id>(E) = if bindingOf(id, E)=None then "$" id; E else R&T<valOf(id, E)>(E)

R&T<p "." id>(E) = R&T<valOf(id, R<p>(E))>(E)

R&T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R&T<e2*>(E) else R&T<e3*>(E)

R&T<"NOT" p>(E) = NOT R<p>(E); E

R&T<p1 "'" p2>(E)	= p2; E
R&T<p1 op p2>(E)	= apply(R<p1>(E), op, R<p2>(E)); E

R&T<n m e>(E) = ""; bind(n, m, R<e>(E), E)
	<n m op e> = <n m n op e>			-- Syntactic sugar

R&T<>(E) = ""; E

R&T<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E)); T<e*>(T<e1>(E))

R&T<"(" e* ")">(E) = R&T<e*>(E)

R&T<"ENV(" e* ")">(E) = [T<e*>([Null | "Outer" = E]) | "Outer" = Null]; E

R&T<"{" e* "}">(E) = "{" R<Sub e*>([Null | "Outer" = E]) "}";
			  locVal("Outer", (T<Sub e*>([Null | "Outer" = E])))

R&T<name ":!">(E) = 


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

apply(arg1, op, arg2) =
	op = "+"	=> arg1+arg2
	. . .
	op = "SUB"	=> arg1[arg2]

bind(id, m, e, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, e, E)	-- Assign at right level
	True				=> [E | id m e]

bind(id.n, m, e, E) = [E | id bindingOf(id, E) bind(n, m, e, R<id>(E))]

assign(id, e, E) =
	locBinding(id, E) = ":"	=> [E | id ":" e]
	bindingOf(id, E) = ":"	=>
				[E | "Outer" "=" bind(id, ":=", e, locVal("Outer", E))]
	True				=> E 			-- Can only assign to vars


-------------------
		id:  (the null id), bold, thisIsAnId, Helvetica, . . .
Expressions in an Interdoc script may denote
	literal values:
		Booleans: (F, T)
		integers: ... -3, -2, -1, 0, 1, 2, 3, ...
		reals: 1.2E5, . . .
		strings: <this is a string>
		labels: A123!, anId!, Paragraph.Example!
		primitive properties and operators: $id
		the empty environment: Null
		the empty list: NIL
	unevaluated expressions
	environments
	sets (sequences) of nodes with given labels


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", p, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  Perverse explicit bindings to Outer might create
loops, leaving some ids undefined, but there seems to be little reason to clutter
up the semantics by forbidding such assignments.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.

Parentheses are used purely for grouping (e.g., creating a sequence value for a
binding).  ENV is used to create a new environment, which behaves much like
a record.


Semantics of labels:

A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members.  Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				OTHER NOTES

Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its properties.
	(Variant: all properties on the path back to the root.)


				STANDARD CARD

 WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.

 GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.

 STANDARDIZE CONCEPTS, NOT NAMES.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

*start*
03100 00024 USt
Date: 20 Aug. 1981 12:29 pm PDT (Thursday)
From: Horning.pa
Subject: Minutes from the Midnight Skull Session
To: Mitchell, Lampson
cc: Horning

We started from the following list.

In question:
	merging environments (OPEN)
	declaration of "main" labels
	labels outside dominant structure
	non-printing nodes (semicolon?)
	bind op names in environment?
		structured primitive names, naming authorities
	how to syntactically distinguish operator application
	infix vs. prefix for general operators (APL?)
	treatment of unbound qualified names
	 as id vs. binary op vs. sign on numbers

Missing:
	enumeration over sequences and environments
	substitution of  for Null, Nil, etc., as appropriate

-------------------

Here is my transcription/recollection of their resolution:

 merging environments (OPEN)
	REJECT
 declaration of "main" labels
	ACCEPT (must define semantics precisely)
	Note that we decided to distinguish between properties (marks) and labels.
	For properties, the expression preceding # must evaluate to an external
		($, atomic) name.
	The label preceding ! must be a literal name, which is not evaluated.
		(Analogous to lhs name in a binding.)
 labels outside dominant structure
	Do NOT cause their nodes to be included in labelled set (for templates).
 non-printing nodes (semicolon?)
	ACCEPT.  More properly HIDDEN nodes; syntax still debatable.
 bind op names in environment? structured primitive names, naming authorities
	EXTERNAL names will be structured, to allow for NA's.
	Binding will allow the use of freely-chosen local synonyms.
	These decisions apply equally to properties and operators.
 how to syntactically distinguish operator application
	Some small variant of Cambridge Polish.
	BUT, lookup operator in environment before application.
 infix vs. prefix for general operators (APL?)
	All prefix (function application syntax).
 treatment of unbound qualified names
	Should result in Null or ERROR, not an external name (fix equations).
  as id vs. binary op vs. sign on numbers
	REJECT as id.
	No more infix ops.
	No ambiguity between sign and op in functional notation
 enumeration over sequences and environments
	REJECT
 substitution of  for Null, Nil, etc., as appropriate
	REJECT

			OTHER NOTES

A "main" label can only be an id, not a qualified name.

An atom (external name) can be qualified.

Operator ids come out of the grammar; we need to ensure that we define the
semantic basis for SEQ/LIST, IF, NOT, ENV, PROG, QUOTE.

The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.

It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes.  (E.g., text is not always there to be
"shown".)

The "safety" rules for editing partially understood scripts should be restated
entirely in terms of local properties (which may have been implicitly acquired
through Sub or other invocations).

We should check our characterset for disjointness with Interpress.DoubtfulChars.


*start*
11578 00024 USt
Date: 20 Aug. 1981 5:40 pm PDT (Thursday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 22
To: Mitchell, Horning, Lampson

[Jim,

I think I've fixed the syntax problems we discussed.  However, we should discuss
how palatable this syntax is--I made a number of relatively arbitrary decisions
for the sake of getting something definite as soon as possible.

Jim H.]


Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
	resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
	distinguish syntactically between properties (marks) and labels.
	only the "main" id of a label is declarable.
	eliminate  as an id character.
	eliminate op ids from grammar.
	restructure the grammar for "functional" notation for operators.
	update semantic equations for new grammar, etc.
	fix treatment of unbound qualified names (now produce Nil).

-------------------
Not done:
	State the formal semantics of labels and properties.

-------------------

We envision an Interdoc script being input and viewed in any manner
equivalent to the following:

Parse the script, alternately
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items, and 
- transforming the environment as indicated by the binding items.

				BASIC INTERDOC

GRAMMAR

item ::=  primary | binding | id ":!"	-- :! for label declaration
primary ::= id | primary "." id | literal | application | property | node 
id	::= letter ( letter | digit )*
literal	::= Boolean | integer | hexint | real | string | label | external
label	::= name "!"
name	::= id ( "." id)*
external ::= "$" name
application ::= ( op | primary ) "(" item*  ( "," item* )* ")"
op	::= "'" | "+" | "" | "*" | "/" 
property ::= primary "#"
node	::= "{" item* "}"
binding ::= name [ op ] bindingMode ( primary | "(" item* ")" )
bindingMode ::= "=" | ":" | ":=" | "←"

SYNTACTIC EXAMPLE:

{Book.example!				-- Places this in Book and Book.example
ExampleParagraph				-- Invokes a definition
$UniqueMark12356#			-- Adds a nonstandard property
a:='(NOT(EQ(margins.left, 120))) margins.right←100 r=*(12.5, pt)
IF(a, margins.left+←5 margins.right←5, margins.left+←10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)
	R&T: ( expression, environment ) > ( expression, environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Nil = ""
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)


R&T<id>(E) = R&T<valOf(id, E)>(E)

R&T<p "." id>(E) = R&T<locVal(id, R<p>(E))>(E)

R&T<literal>(E) = literal; E

R&T<op "(" arg* ")">(E) = operate(op, arg*, E)
R&T<p "(" arg* ")">(E) = operate(R<p>(E), arg*, E)

R&T<p "#">(E) = R<p>(E) "#"; E

R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
			  locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))

R&T<n m p>(E) = ""; bind(n, m, R<p>(E), E)
	<n op m p> = <n m op "(" n "," p ")" >			-- Syntactic sugar
	<n op m "(" arg* ")"> = <n m op "(" n "," arg* ")" >

R&T<"(" item* ")">(E) = R&T<item*>(E)

R&T<>(E) = ""; E

R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
			    T<item*>(T<item1>(E))

R&T<id ":!">(E) = 


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

operate(op, arg*, E) = 
	op = $QUOTE	=> arg*; E
	op = $HIDE		=> ""; E
	op = $ENV		=> [T<arg*>([Null | "Outer" = E]) | "Outer" = Null]; E
	True			=> apply(op, eval(arg*, E))

apply(op, val1, ... , valn) =
	op = $IF			=> if val1.R then val2 else val3
	op = "+" OR op = $PLUS 	=> val1.R + ... + valn.R; E
	...
	op = $LIST			=> val1
	op = $SUBSCRIPT		=> val1[val2.R]   -- val1: sequence, val2.R: int

eval("", E) = Nil
eval(arg1 arg*, E) = R&T<arg1>(E), eval(arg*, E)

bind(id, m, val, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, val, E)	-- Assign at right
level
	True				=> [E | id m val]

bind(id.n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]

assign(id, val, E) =
	locBinding(id, E) = ":"	=> [E | id ":" val]
	bindingOf(id, E) = ":"	=>
				[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
	True				=> E 			-- Can only assign to vars


-------------------
Expressions in an Interdoc script may denote
	literal values:
		Booleans: (F, T)
		integers: ... -3, -2, -1, 0, 1, 2, 3, ...
		reals: 1.2E5, . . .
		strings: <this is a string>
		labels: A123!, anId!, Paragraph.Example!
		external names: $name
		the empty environment: Null
		the empty list: NIL
	sequences of values
	unevaluated expressions
	environments
	sets (sequences) of nodes with given labels


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  The (implicit) "outermost" environment binds
each id to the external name $id.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.

Parentheses are used for grouping (e.g., creating a sequence value for a
binding), and to delimit the argument list of an operator.  The operator $ENV is
used to create a new environment, which behaves much like a record.


Semantics of labels:

A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members.  Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				OTHER NOTES

It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes.  (E.g., text is not always there to be
"shown".)

Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and don't
remove any of them OR if you understand ALL properties of ALL nodes in the
path back to the root.

The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.

We should check our characterset for disjointness with Interpress.DoubtfulChars.


				STANDARD CARD

 WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.

 GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.

 STANDARDIZE CONCEPTS, NOT NAMES.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
	Drop "%"; ENV() is now the only environment-constructing operator.
	Add SUB operator (first operand: sequence only, second: number only).
	Add atoms, as distinct from ids.
	Fix lhs op rhs syntax.


*start*
11654 00024 USt
Date: 21 Aug. 1981 6:58 pm PDT (Friday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 23
To: Mitchell, Horning, Lampson

Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
	restore $val.
	move quoting to rhs, allow quoted primaries without parentheses.
	allow an op to be the rhs of a definition.
	eliminate the functions operate, apply, eval by back substitution.
	change semantics of () to allow "record" construction without $env.

-------------------
Not done:
	State the formal semantics of labels and properties.
	Sets of properties, etc. (Cf. Mitchell's Font example.)
	Sort out "records" vs. quoted bindings.

-------------------

We envision an Interdoc script being input and viewed in any manner
equivalent to the following:

Parse the script, alternately
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items, and 
- transforming the environment as indicated by the binding items.

				BASIC INTERDOC

GRAMMAR

item		::= primary | binding
primary	::= id | primary "." id | literal | application | property | node 
id		::= letter ( letter | digit )*
literal		::= Boolean | integer | hexint | real | string | label | external
label		::= name "!"
name		::= id ( "." id)*
external	::= "$" name | op
op		::= "+" | "" | "*" | "/" 
application	::= primary "(" item*  ( "," item* )* ")"
property	::= primary "#"
node		::= "{" item* "}"
binding	::= name [ op ] bindingMode rhs | id ":!"
bindingMode ::= "=" | ":" | ":=" | "←"
rhs		::= [ "'" ] ( primary | "(" item* ")" )

SYNTACTIC EXAMPLE:

{Book.example!				-- Places this in Book and Book.example
ExampleParagraph				-- Invokes a definition
$UniqueMark12356#			-- Adds a nonstandard property
a:='NOT(EQ(margins.left, 120)) margins.right←100 r=*(12.5, pt)
IF(a, margins.left+←5 margins.right←5, margins.left+←10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)
	R&T: ( expression, environment ) > ( expression, environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Nil = ""
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)


R&T<id>(E) = R&T<valOf(id, E)>(E)

R&T<p "." id>(E) = R&T<locVal(id, R<p>(E))>(E)

R&T<literal>(E) = literal; E

R&T<p "(" arg1 "," ... "," argn ")">(E) =
    CASE R<p>(E) OF
	$if		=> if R<arg1>(E) then R&T<arg2>(E) else R&T<arg3>(E)
	"+"	 	=> R<arg1>(E) + ... + R<argn>(E); E
	...
	$val		=> R<arg1>(E); E
	$list		=> R&T<arg1>(E)
	$sub		=> R&T<arg1>(E)[R<arg2>(E)]
					-- Subscript, arg1: sequence | node, arg2.R: int
	$hide		=> ""; E

R&T<p "#">(E) = R<p>(E) "#"; E

R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
	      locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))

R&T<>(E) = ""; E

R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
			    T<item*>(T<item1>(E))

R&T<n m rhs>(E) = ""; bind(n, m, R<rhs>(E), E)
	<n op m p> = <n m op "(" n "," p ")" >			-- Syntactic sugar
	<n op m "(" arg* ")"> = <n m op "(" n "," arg* ")" >

R&T<"'" p>(E) = p; E

R&T<"(" item* ")">(E) = [T<item*>([Null | "Outer" = E]) | "Outer" = Null]; E
					-- Construct a "record" environment value

R&T<"'(" item* ")">(E) = item*; E

R&T<id ":!">(E) = ??; E


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, val, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, val, E)	-- Assign at right
level
	True				=> [E | id m val]

bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]

assign(id, val, E) =
	locBinding(id, E) = ":"	=> [E | id ":" val]
	bindingOf(id, E) = ":"	=>
				[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
	True				=> E 			-- Can only assign to vars


-------------------
Expressions in an Interdoc script may denote
	literal values:
		Booleans: (F, T)
		integers: ... -3, -2, -1, 0, 1, 2, 3, ...
		reals: 1.2E5, . . .
		strings: <this is a string>
		labels: A123!, anId!, Paragraph.Example!
		external names: $name
		the empty environment: Null
		the empty list: NIL
	sequences of values
	unevaluated expressions
	environments
	sets (sequences) of nodes with given labels


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  The (implicit) "outermost" environment binds
each id to the external name $id.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.

Parentheses are used for grouping (e.g., creating a sequence value for a
binding), and to delimit the argument list of an operator.  The operator $ENV is
used to create a new environment, which behaves much like a record.


Semantics of labels:

A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members.  Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				OTHER NOTES

It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes.  (E.g., text is not always there to be
"shown".)

Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and don't
remove any of them OR if you understand ALL properties of ALL nodes in the
path back to the root.

The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.

We should check our characterset for disjointness with Interpress.DoubtfulChars.


				STANDARD CARD

 WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.

 GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.

 STANDARDIZE CONCEPTS, NOT NAMES.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
	Drop "%"; ENV() is now the only environment-constructing operator.
	Add SUB operator (first operand: sequence only, second: number only).
	Add atoms, as distinct from ids.
	Fix lhs op rhs syntax.

Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
	resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
	distinguish syntactically between properties (marks) and labels.
	only the "main" id of a label is declarable.
	eliminate  as an id character.
	eliminate op ids from grammar.
	restructure the grammar for "functional" notation for operators.
	update semantic equations for new grammar, etc.
	fix treatment of unbound qualified names (now produce Nil).


*start*
12386 00024 USt
Date: 24 Aug. 1981 6:42 pm PDT (Monday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 24
To: Mitchell, Horning

Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
	"It's OK to edit a node if you understand ALL of its (local) properties, and
		either don't remove any of them or also understand ALL properties
		of its parent."
	"Put in contents if:				Put in environment if: ..."
	Add connection syntax to syntactically rule out a+←'b.


-------------------
Not done:
	Determine the (informal) semantics of labels.
	State the formal semantics of labels and properties.
	Sets of properties, etc. (Cf. Mitchell's Font example.)
		SET/LIST operators ($append $union ?)
		notation for list constants
	Sort out "records" vs. quoted bindings.
	Some syntactic marker to replace $env.
	Consider style for use of temporary local definitions.

-------------------

We envision an Interdoc script being input and viewed in any manner
equivalent to the following:

Parse the script, alternately
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items, and 
- transforming the environment as indicated by the binding items.

				BASIC INTERDOC

GRAMMAR

item		::= primary | binding
primary	::= id | primary "." id | literal | application | property | node 
id		::= letter ( letter | digit )*
literal		::= Boolean | integer | hexint | real | string | label | external
label		::= name "!"
name		::= id ( "." id)*
external	::= "$" name | op
op		::= "+" | "" | "*" | "/" 
application	::= primary "(" item*  ( "," item* )* ")"
property	::= primary "#"
node		::= "{" item* "}"
binding	::= name connection rhs | id ":!"
connection	::= bindingMode | op bindingMode | bindingMode "'"
bindingMode ::= "=" | ":" | ":=" | "←"
rhs		::= primary | "(" item* ")"

SYNTACTIC EXAMPLE:

{Book.example!				-- Places this in Book and Book.example
ExampleParagraph				-- Invokes a definition
$UniqueMark12356#			-- Adds a property
a:='NOT(EQ(margins.left, 120)) margins.right←100 r=*(12.5, pt)
IF(a, margins.left+←5 margins.right←5, margins.left+←10)
<text for this node>
}

SEMANTICS

R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)
	R&T: ( expression, environment ) > ( expression, environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Nil = ""
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)


R&T<id>(E) = R&T<valOf(id, E)>(E)

R&T<p "." id>(E) = R&T<locVal(id, R<p>(E))>(E)

R&T<literal>(E) = literal; E

R&T<p "(" arg1 "," ... "," argn ")">(E) =
    CASE R<p>(E) OF
	$if		=> if R<arg1>(E) then R&T<arg2>(E) else R&T<arg3>(E)
	"+"	 	=> R<arg1>(E) + ... + R<argn>(E); E
	...
	$val		=> R<arg1>(E); E
	$list		=> R&T<arg1>(E)
	$sub		=> R&T<arg1>(E)[R<arg2>(E)]
					-- Subscript, arg1: sequence | node, arg2.R: int
	$hide		=> ""; E

R&T<p "#">(E) = R<p>(E) "#"; E

R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
	      locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))

R&T<>(E) = ""; E

R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
			    T<item*>(T<item1>(E))

R&T<n m rhs>(E) = ""; bind(n, m, R<rhs>(E), E)
	<n op m p> = <n m op "(" n "," p ")" >			-- Syntactic sugar
	<n op m "(" arg* ")"> = <n m op "(" n "," arg* ")" >

R&T<"'" p>(E) = p; E

R&T<"(" item* ")">(E) = [T<item*>([Null | "Outer" = E]) | "Outer" = Null]; E
					-- Construct a "record" environment value

R&T<"'(" item* ")">(E) = item*; E

R&T<id ":!">(E) = ??; E


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, val, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, val, E)	-- Assign at right
level
	True				=> [E | id m val]

bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]

assign(id, val, E) =
	locBinding(id, E) = ":"	=> [E | id ":" val]
	bindingOf(id, E) = ":"	=>
				[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
	True				=> E 			-- Can only assign to vars


-------------------
Expressions in an Interdoc script may denote
	literal values:
		Booleans: (F, T)
		integers: ... -3, -2, -1, 0, 1, 2, 3, ...
		reals: 1.2E5, . . .
		strings: <this is a string>
		labels: A123!, anId!, Paragraph.Example!
		external names: $name
		the empty environment: Null
		the empty list: NIL
	sequences of values
	unevaluated expressions
	environments
	sets (sequences) of nodes with given labels


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  The (implicit) "outermost" environment binds
each id to the external name $id.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.

Parentheses are used for grouping (e.g., creating a sequence value for a
binding), and to delimit the argument list of an operator.  The operator $ENV is
used to create a new environment, which behaves much like a record.


Semantics of labels:

A label name! on a node gives that node membership in the sets identified by
name (and its prefixes); the "main" identifier of a set name must be declared at
the root of a subtree containing all its members.  Multiple labels place the node
in multiple sets, and a unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				OTHER NOTES

It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes.  (E.g., text is not always there to be
"shown".)

Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.

The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.

We should check our characterset for disjointness with Interpress.DoubtfulChars.

Put in contents if:				Put in environment if:
	effect is local to node			has scope
	is directly edited				is only indirectly edited
	is to be bound locally			needs delayed or global binding


				STANDARD CARD

 WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.

 GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.

 STANDARDIZE CONCEPTS, NOT NAMES.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
	Drop "%"; ENV() is now the only environment-constructing operator.
	Add SUB operator (first operand: sequence only, second: number only).
	Add atoms, as distinct from ids.
	Fix lhs op rhs syntax.

Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
	resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
	distinguish syntactically between properties (marks) and labels.
	only the "main" id of a label is declarable.
	eliminate  as an id character.
	eliminate op ids from grammar.
	restructure the grammar for "functional" notation for operators.
	update semantic equations for new grammar, etc.
	fix treatment of unbound qualified names (now produce Nil).

Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
	restore $val.
	move quoting to rhs, allow quoted primaries without parentheses.
	allow an op to be the rhs of a definition.
	eliminate the functions operate, apply, eval by back substitution.
	change semantics of () to allow "record" construction without $env.


*start*
01734 00024 USt
Date: 25 Aug. 1981 9:28 am PDT (Tuesday)
From: Horning.pa
Subject: Interdoc Label Thoughts
To: Mitchell
cc: Lampson, Horning

Overnight, I've come to the conclusion that the reason we were having trouble
yesterday with the semantics of labels is that we were trying to attach too much
semantics to them--much as if we had gone beyond numbers to specify the use
of numbers in spline curves.

I propose that we go back to something much closer to Brian Reid's "link and
mark" semantics.  I.e., the "meaning" of a reference, mark pair is simply: "record
the existence of a directed arc from here to there," without saying what an editor
would use such arcs for.  They are simply the escape mechanism from a strict
tree structure.

I believe that we should keep the present "sequence" semantics for multiple
nodes marked with the same label (i.e., there is a directed arc from every
reference to a label to each node marked with that label.)

I also accept your stricture that no environment information should flow along
these arcs, so that we can simply ignore them when determining the
Reduced&Transformed values of scripts and environments.

We should continue to declare the scope of main labels, for all the previously
discussed reasons, but we should not try to use the environment to record label
values (as (un)evaluated nodes or whatever).  We should thus syntactically
distinguish a label reference from a name invocation.

Except for the syntactic ambiguity, it would be tempting to adopt some
"symmetric" notation for references and marks, e.g., name> and >name .
However, I don't doubt our ability to find a satisfactory syntax once we have
agreed on the semantics.

Comments, improvements?

Jim H.

*start*
13389 00024 USt
Date: 25 Aug. 1981 11:34 am PDT (Tuesday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 25
To: Mitchell, Horning

Edited by Jim H. on 25 Aug. 1981 11:33 am PDT (Tuesday).
	Syntactically separate label references and name invocation.
	Put in distinct syntax in rhs for environment construction.
	Informal semantics of labels.
	( ... ) > [ ... ] in applications; permitting ( ... ) as a primary.

-------------------
Open questions:
	Use of ( ... ) vs. [ ... ] (especially in application).
	Sort out "records" vs. quoted bindings.
	Sets of properties, etc. (Cf. Mitchell's Font example.)
		SET/LIST operators ($append $union ?)
		semantics of $list
	Non-uniform semantics of quote.
		Subtle distinctions between quoted and unquoted ( ... )
		Semantics of quoted [ ... ] ?
	We should check our characterset for disjointness with
		Interpress.DoubtfulChars.


Not done:
	State the formal semantics of labels and properties.
	Consider style for use of temporary local definitions.

-------------------

We envision an Interdoc script being input and viewed in any manner
equivalent to the following:

Parse the script, repeatedly
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items,
- transforming the environment as indicated by the binding items, and
- recording the links indicated by label references and marks.

				BASIC INTERDOC

SYNTACTIC EXAMPLE:

{Book.example!		      -- Links to this from Book@ and Book.example@
ExampleParagraph				-- Invokes a definition
$UniqueMark12356#			-- Adds a property
a:='NOT[EQ[margins.left, 120]] margins.right←100 r=*[12.5, pt]
IF[a, margins.left+←5 margins.right←5, margins.left+←10]
<text for this node>
}

GRAMMAR

item		::= primary | binding | label
primary	::= literal | invocation | application | property | node | "(" item* ")"
literal		::= Boolean | integer | hexint | real | string | external | op
external	::= "$" name
name		::= id ( "." id)*
id		::= letter ( letter | digit )*
op		::= "+" | "" | "*" | "/"
invocation	::= id | primary "." id
application	::= primary "[" item*  ( "," item* )* "]"
property	::= primary "#"
node		::= "{" item* "}"
binding	::= name connection rhs
connection	::= bindingMode | op bindingMode | bindingMode "'"
bindingMode ::= "=" | ":" | ":=" | "←"
rhs		::= primary | "[" item* "]"
label		::= id ":!" | name "!" | name "@"

SEMANTICS

R&T<id>(E) = R&T<valOf(id, E)>(E)

R&T<p "." id>(E) = R&T<locVal(id, R<p>(E))>(E)

R&T<literal>(E) = literal; E

R&T<p "(" arg1 "," ... "," argn ")">(E) =
    CASE R<p>(E) OF
	$if		=> if R<arg1>(E) then R&T<arg2>(E) else R&T<arg3>(E)
	"+"	 	=> R<arg1>(E) + ... + R<argn>(E); E
	...
	$val		=> R<arg1>(E); E
	$list		=> R&T<arg1>(E)
	$subscript	=> R&T<arg1>(E)[R<arg2>(E)]
						-- arg1: sequence | node, arg2.R: int
	$hide		=> "" ; E

R&T<p "#">(E) = R<p>(E) "#"; E

R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
	      locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))

R&T<"(" item* ")">(E) = "(" R<item*>(E) ")" ; E			-- List constructor

R&T<"[" item* "]">(E) = [T<item*>([Null | "Outer" = E]) | "Outer" = Null]; E
					-- Construct a "record" environment value

R&T<>(E) = ""; E

R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
			    T<item*>(T<item1>(E))

R&T<n m rhs>(E) = "" ; bind(n, m, R<rhs>(E), E)
	<n op m rhs> = <n m op "(" n "," rhs ")">		-- Syntactic sugar
	<n op m "(" arg* ")"> = <n m op "(" n "," arg* ")">

R&T<"'" p>(E) = p; E

R&T<"'(" item* ")">(E) = item*; E  ??

R&T<"'[" item* "]">(E) = ??

R&T<label>(E) = label; E


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, val, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, val, E)	-- Assign at right
level
	True				=> [E | id m val]

bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]

assign(id, val, E) =
	locBinding(id, E) = ":"	=> [E | id ":" val]
	bindingOf(id, E) = ":"	=>
				[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
	True				=> E 			-- Can only assign to vars


R denotes the expression reduction function:
	R: expression > ( environment > expression )

T denotes the environment transformation function:
	T: expression > ( environment > environment )

R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)
	R&T: ( expression, environment ) > ( expression, environment )

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Nil = ""
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)


-------------------
Expressions in an Interdoc script may denote
	literal values:
		Booleans: (F, T)
		integers: ... -3, -2, -1, 0, 1, 2, 3, ...
		reals: 1.2E5, . . .
		strings: <this is a string>
		labels: A123!, anId!, Paragraph.Example!
		external names: $name
		the empty environment: Null
		the empty list: NIL
	sequences of values
	unevaluated expressions
	environments
	sets (sequences) of nodes with given labels


How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  The (implicit) "outermost" environment binds
each id to the external name $id.

The contents of each node are implicitly prefixed by Sub, which will generally
be bound in the containing environment to a quoted expression performing an
environment transformation, and perhaps supplying some properties.

Parentheses are used for grouping (e.g., creating a sequence value for a
binding), and to delimit the argument list of an operator.  The operator $ENV is
used to create a new environment, which behaves much like a record.


Semantics of labels:

A mark name! on a node makes that node a "target" of the label name (and its
prefixes); a reference name@ makes it a "source."  The "main" identifier of a
label must be declared (using name:!) at the root of a subtree containing all its
sources and targets.  The label represents a set of directed arcs, one from each of
its sources to each of its destinations.  Multiple marks make the node a target of
in multiple labels.  A unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				OTHER NOTES

It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes.  (E.g., text is not always there to be
"shown".)

Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
-It's OK to copy a node if that doesn't move any labels outside their scope, and
you understand ALL properties of its new parent.
-it's OK to delete a (subtree rooted at a) node if you understand ALL properties
of its parent.

The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.

Put in contents if:				Put in environment if:
	effect is local to node			has scope
	is directly edited				is only indirectly edited
	is to be bound locally			needs delayed or global binding


				STANDARD CARD

 WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.

 GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.

 STANDARDIZE CONCEPTS, NOT NAMES.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
	Drop "%"; ENV() is now the only environment-constructing operator.
	Add SUB operator (first operand: sequence only, second: number only).
	Add atoms, as distinct from ids.
	Fix lhs op rhs syntax.

Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
	resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
	distinguish syntactically between properties (marks) and labels.
	only the "main" id of a label is declarable.
	eliminate  as an id character.
	eliminate op ids from grammar.
	restructure the grammar for "functional" notation for operators.
	update semantic equations for new grammar, etc.
	fix treatment of unbound qualified names (now produce Nil).

Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
	restore $val.
	move quoting to rhs, allow quoted primaries without parentheses.
	allow an op to be the rhs of a definition.
	eliminate the functions operate, apply, eval by back substitution.
	change semantics of () to allow "record" construction without $env.

Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
	"It's OK to edit a node if you understand ALL of its (local) properties, and
		either don't remove any of them or also understand ALL properties
		of its parent."
	"Put in contents if:				Put in environment if: ..."
	Add connection syntax to syntactically rule out a+←'b.


*start*
15765 00024 USt
Date: 27 Aug. 1981 7:51 pm PDT (Thursday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 29
To: Mitchell, Horning

Edited by Jim H. on 26 Aug. 1981 7:11 pm PDT (Wednesday).
	' ... ' in rhs
	Restore infix operators, right to left.
	Modify syntax to rule out more nonsense, add semantically meaningful
		nonterminals.
	Introduce special syntax for selections.
	Eliminate side-effects for $subscript (actually, all applications).
	Add application of defined functions.
	Note that Value[ ... ] allows use of temporary (hidden) local definitions,
		Nil[ ... ] allows placement of hidden nodes.
	( ... ) creates list/sequence values (without hiding bindings).
	Tidy up definition of assign, using bind("Outer." ...).
	Introduce value nonterminal into grammar (rule out more nonsense).
	rhs	::= ... | "[" [ lookup ] "|" binding* "]" .
	Remove $ name from literal (to lookup).
	Change nonterminal lookup to invocation.

-------------------
Open questions:
	We should rethink our character assignments.
		check our characterset for disjointness with
			Interpress.DoubtfulChars.
		enlarge op with a few more single-character operators?  %, &, \
	Possible node operators (purely in semantic domain, not operators?).
		$properties: node, environment > sequence	-- All #'s
		$marks: node, environment > sequence		-- All !'s
		$references: node, environment > sequence	-- All @'s
		$contents: node, environment > sequence	-- The rest (fringe)
	Consider restricting $subscript just to sequences, not nodes.
	Extend selection to CASE?

Not done:

-------------------

We envision an Interdoc script being input and viewed in any manner
equivalent to the following:

Parse the script, repeatedly
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items,
- transforming the environment as indicated by the binding items (recording the
components of each node's environment in a form convenient to the editor), and
- recording the links indicated by label references and marks.

				BASIC INTERDOC

SYNTACTIC EXAMPLE:

{Book.example!		      -- Links to this from Book@ and Book.example@
ExampleParagraph				-- Invokes a definition
$UniqueMark12356#			-- Adds a property
Font←[Font | size←10*pt face←bold]
a:='NOT[EQ[margins.left 120]]' margins.right←100 r=12.5*pt
(a | margins.left←+5 margins.right←5 | margins.left+←10) -- conditional: Algol68
<text for this node>
}

GRAMMAR

item		::= value | binding | property | label
value		::= term | node | sequence
term		::= primary | primary op term	-- Ops apply right to left
primary	::= literal | invocation | application | selection
literal		::= Boolean | integer | hexint | real | string | op
op		::= "+" | "" | "*" | "/"
invocation	::= name | external
name		::= id ( "." id )*
id		::= letter ( letter | digit )*
external	::= "$" name
application	::= invocation "[" value* "]"
selection	::= "(" term "|" item* "|" item* ")"	-- Algol 68 style conditional
node		::= "{" item* "}"
sequence	::= "(" item* ")"
binding	::= name bindingMode rhs
bindingMode ::= "=" | ":" | ":=" | "←"
rhs		::= value | op term | "'" item* "'" | "[" [ invocation ] "|" binding* "]"
property	::= invocation "#"
label		::= id ":!" | name "!" | name "@"

SEMANTICS

R: expression > environment > expression			-- Reduction
T: expression > environment > environment			-- Transformation
R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)

[Unless explicitly given below, T<construct>(E) = E.]

R<primary op term>(E) = R<primary>(E) op R<term>(E)

R<literal>(E) = literal

R&T<id>(E) = R&T<valOf(id, E)>(E)

R&T<name "." id>(E) = R&T<valOf(id, R<name>(E))>(E)

R<"$" name>(E) = "$" name

R<invocation "[" value1 ... valuek "]">(E) =
    CASE R<invocation>(E) OF
	"$equal"	=> R<value1>(E) = R<value2>(E)
	"$greater"	=> R<value1>(E) > R<value2>(E)
	. . .
	"$subscript"	=> R<value1>(E)[R<value2>(E)]
						-- value1: sequence | node, value2: int
    ELSE		=> R<invocation>([E | "Value" "=" R<value1 ... valuek>(E)])

R&T<"(" term "|" item1* "|" item2* ")">(E) =
			if R<term>(E) then R&T<item1*>(E) else R&T<item2*>(E)

R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" "=" E]) "}";
	      locVal("Outer", (T<"Sub" item*>([Null | "Outer" "=" E])))

R&T<"(" item* ")">(E) = "(" R<item*>(E) ")" ; T<item*>(E)

R<>(E) = Nil

R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
					  T<item*>(T<item1>(E))

R&T<n m rhs>(E) = Nil; bind(n, m, R<rhs>(E), E)
	<n m op term> = <n m n op term>		-- Syntactic sugar

R<"'" item* "'">(E) = item*			--Usable only in rhs of binding

R<"[" invocation "|" binding* "]">(E) =
	[T<binding*>([R<invocation>(E) | "Outer" "=" E]) | "Outer" "=" Null]
R<"[|" binding* "]">(E) = [T<binding*>([Null | "Outer" "=" E]) | "Outer" "=" Null]

R<invocation "#">(E) = R<invocation>(E) "#"

R<label>(E) = label

	-- Subsidiary definitions for R&T

bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, val, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, val, E) -- Assign at right level
	True				=> [E | id m val]

bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, valOf(id, E))]

assign(id, val, E) =
	locBinding(id, E) = ":"	=> [E | id ":" val]
	bindingOf(id, E) = ":"	=> bind("Outer." id, ":=", val, E)
	True				=> E 			-- Can only assign to vars

NOTATION FOR ENVIRONMENTS

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Nil = ""
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)

SEMANTICS OF PROPERTIES, MARKS, REFERENCES (uses)

P: expression > environment > expression		-- Sequence of properties
M: expression > environment > expression		-- Sequence of marks
U: expression > environment > expression		-- Sequence of references
P&M&U<e>(E) denotes the triple  P<e>(E); M<e>(E); U<e>(E)

[These functions all return the empty list, Nil, except as specified below.]

P<invocation "#">(E) = R<invocation>(E)

M<name "!">(E) = prefixes(name)

U<name "@">(E) = prefixes(name)

P&M&U<invocation>(E) = P&M&U<R<invocation>(E)>(E)

P&M&U<"(" item* ")">(E) = P&M&U<item*>(E)

P&M&U<item1 item*>(E) = P<item1>(E) P<item*>(T<item1>(E));
				M<item1>(E) M<item*>(T<item1>(E));
				U<item1>(E) U<item*>(T<item1>(E))

prefixes(id) = id
prefixes(name "." id) = name "." id prefixes(name)

VALUE SPACE

Expressions in an Interdoc script may denote
	literal values:
		Booleans: (F, T)
		integers: ... -3, -2, -1, 0, 1, 2, 3, ...
		reals: 1.2E5, . . .
		strings: <this is a string>
		labels: A123!, anId!, Paragraph.Example!
		external names: $name
		the empty environment: Null
		the empty list: NIL
	sequences of values
	unevaluated expressions
	environments

DISCUSSION

How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  The (implicit) "outermost" environment binds
each id to the external name $id.

Nodes are delimited by brackets.  The contents of each node are implicitly
prefixed by Sub, which will generally be bound in the containing environment
to a quoted expression performing an environment transformation, and perhaps
supplying some properties and marks.

Parentheses are used to denote a sequence value.  to delimit the argument list of
an operator.  Square brackets are used to denote a new environment value, which
behaves much like a record.


Semantics of labels:

A mark name! on a node makes that node a "target" of the label name (and its
prefixes); a reference name@ makes it a "source."  The "main" identifier of a
label must be declared (using name:!) at the root of a subtree containing all its
sources and targets.  The label represents a set of directed arcs, one from each of
its sources to each of its destinations.  Multiple marks make the node a target of
in multiple labels.  A unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				OTHER NOTES

It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes.  (E.g., text is not always there to be
"shown".)

Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
-It's OK to copy a node if that doesn't move any labels outside their scope, and
you understand ALL properties of its new parent.
-it's OK to delete a (subtree rooted at a) node if you understand ALL properties
of its parent.

The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.

Put in contents if:				Put in environment if:
	effect is local to node			has scope
	is directly edited				is only indirectly edited
	is to be bound locally			needs delayed or global binding


				STANDARD CARD

 WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.

 GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.

 STANDARDIZE CONCEPTS, NOT NAMES.


			CONSCIOUSLY POSTPONED

Lambda expressions.

Sets of properties, etc. (Cf. Mitchell's Font example.)
	SET/LIST operators ($append $union ?)



				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
	Drop "%"; ENV() is now the only environment-constructing operator.
	Add SUB operator (first operand: sequence only, second: number only).
	Add atoms, as distinct from ids.
	Fix lhs op rhs syntax.

Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
	resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
	distinguish syntactically between properties (marks) and labels.
	only the "main" id of a label is declarable.
	eliminate  as an id character.
	eliminate op ids from grammar.
	restructure the grammar for "functional" notation for operators.
	update semantic equations for new grammar, etc.
	fix treatment of unbound qualified names (now produce Nil).

Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
	restore $val.
	move quoting to rhs, allow quoted primaries without parentheses.
	allow an op to be the rhs of a definition.
	eliminate the functions operate, apply, eval by back substitution.
	change semantics of () to allow "record" construction without $env.

Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
	"It's OK to edit a node if you understand ALL of its (local) properties, and
		either don't remove any of them or also understand ALL properties
		of its parent."
	"Put in contents if:				Put in environment if: ..."
	Add connection syntax to syntactically rule out a+←'b.

Edited by Jim H. on 25 Aug. 1981 11:33 am PDT (Tuesday).
	Syntactically separate label references and name invocation.
	Put in distinct syntax in rhs for environment construction.
	Informal semantics of labels.
	( ... ) > [ ... ] in applications; permitting ( ... ) as a primary.

Edited by Jim H. on 25 Aug. 1981 4:08 pm PDT (Tuesday).
	Add sequence as a nonterminal to the syntax.
	State the formal semantics of labels and properties.
	Reorder presentation (hopefully to improve readability).


*start*
15962 00024 USt
Date: 28 Aug. 1981 2:09 pm PDT (Friday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 30
To: Mitchell, Horning, Guttag

Edited by Jim H. on 28 Aug. 1981 2:08 pm PDT (Friday).
  [Changes since 25 August]
	' ... ' in rhs
	Restore infix operators, right to left.
	Modify syntax to rule out more nonsense, add semantically meaningful
		nonterminals.
	Introduce special syntax for selections.
	Eliminate side-effects for $subscript (actually, all applications).
	Add application of defined functions.
	Note that Value[ ... ] allows use of temporary (hidden) local definitions,
		Nil[ ... ] allows placement of hidden nodes.
	( ... ) creates list/sequence values (without hiding bindings).
	Tidy up definition of assign, using bind("Outer." ...).
	Introduce value nonterminal into grammar (rule out more nonsense).
	rhs	::= ... | "[" [ lookup ] "|" binding* "]" .
	Change nonterminal lookup to invocation.
	Remove $ name from literal (to invocation).
	Add node operators:
		$properties						-- All #'s
		$marks						-- All !'s
		$references						-- All @'s
		$contents						-- The rest (fringe)
	Restrict $subscript just to sequences, not nodes.


-------------------
Open questions:
	We should rethink our character assignments.
		check our characterset for disjointness with
			Interpress.DoubtfulChars.
		enlarge op with a few more single-character operators?  %, &, \
	Possible node operators (purely in semantic domain, not operators?).
	Extend selection to CASE?

Not done:

-------------------

We envision an Interdoc script being input and viewed in any manner
equivalent to the following:

Parse the script, repeatedly
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items,
- transforming the environment as indicated by the binding items (recording the
components of each node's environment in a form convenient to the editor), and
- recording the links indicated by label references and marks.

				BASIC INTERDOC

SYNTACTIC EXAMPLE:

{Book.example!		      -- Links to this from Book@ and Book.example@
ExampleParagraph				-- Invokes a definition
$UniqueMark12356#			-- Adds a property
Font←[Font | size←10*pt face←bold]
factorial←'(LT[Value 2] | 1 | Value* factorial(Value-1))'
a:='NOT[EQ[margins.left factorial[5]]]' margins.right←100 r=12.5*pt
(a | margins.left←+5 margins.right←5 | margins.left+←10) -- conditional: Algol68
<text for this node>
}

GRAMMAR

item		::= value | binding | property | label
value		::= term | node | sequence
term		::= primary | primary op term	-- Ops apply right to left
primary	::= literal | invocation | application | selection
literal		::= Boolean | integer | hexint | real | string | op
op		::= "+" | "" | "*" | "/"
invocation	::= name | external
name		::= id ( "." id )*
id		::= letter ( letter | digit )*
external	::= "$" name
application	::= invocation "[" value* "]"
selection	::= "(" term "|" item* "|" item* ")"	-- Algol 68 style conditional
node		::= "{" item* "}"
sequence	::= "(" item* ")"
binding	::= name bindingMode rhs
bindingMode ::= "=" | ":" | ":=" | "←"
rhs		::= value | op term | "'" item* "'" | "[" [ invocation ] "|" binding* "]"
property	::= invocation "#"
label		::= id ":!" | name "!" | name "@"

SEMANTICS

R: expression > environment > expression			-- Reduction
T: expression > environment > environment			-- Transformation
R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)

[Unless explicitly given below, T<construct>(E) = E.]

R<primary op term>(E) = R<primary>(E) op R<term>(E)

R<literal>(E) = literal

R&T<id>(E) = R&T<valOf(id, E)>(E)

R&T<name "." id>(E) = R&T<valOf(id, R<name>(E))>(E)

R<"$" name>(E) = "$" name

R<invocation "[" value* "]">(E) = apply(invocation, R<value*>(E), E)
apply(invocation, value*, E) =
    CASE R<invocation>(E) OF
	"$equal"	=> value1 = value2
	"$greater"	=> value1 > value2
	. . .
	"$subscript"	=> value1[value2]	-- value1: sequence, value2: int
	"$contents"	=> C<value*>
	"$properties"	=> P<value*>(E)
	"$marks"	=> M<value*>(E)
	"$references" => U<value*>(E)
    ELSE		=> R<invocation>([E | "Value" "=" value*])

R&T<"(" term "|" item1* "|" item2* ")">(E) =
			if R<term>(E) then R&T<item1*>(E) else R&T<item2*>(E)

R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" "=" E]) "}";
	      locVal("Outer", (T<"Sub" item*>([Null | "Outer" "=" E])))

R&T<"(" item* ")">(E) = "(" R<item*>(E) ")" ; T<item*>(E)

R<>(E) = Nil

R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
					  T<item*>(T<item1>(E))

R&T<n m rhs>(E) = Nil; bind(n, m, R<rhs>(E), E)
	<n m op term> = <n m n op term>		-- Syntactic sugar

R<"'" item* "'">(E) = item*			--Usable only in rhs of binding

R<"[" invocation "|" binding* "]">(E) =
	[T<binding*>([R<invocation>(E) | "Outer" "=" E]) | "Outer" "=" Null]
R<"[|" binding* "]">(E) = [T<binding*>([Null | "Outer" "=" E]) | "Outer" "=" Null]

R<invocation "#">(E) = R<invocation>(E) "#"

R<label>(E) = label

	-- Subsidiary definitions for R&T

bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, val, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, val, E) -- Assign at right level
	True				=> [E | id m val]

bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, valOf(id, E))]

assign(id, val, E) =
	locBinding(id, E) = ":"	=> [E | id ":" val]
	bindingOf(id, E) = ":"	=> bind("Outer." id, ":=", val, E)
	True				=> E 			-- Can only assign to vars

NOTATION FOR ENVIRONMENTS

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Nil = ""
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)

SEMANTICS OF PROPERTIES, MARKS, REFERENCES (uses), CONTENTS

P: expression > environment > expression		-- Sequence of properties
M: expression > environment > expression		-- Sequence of marks
U: expression > environment > expression		-- Sequence of references
P&M&U<e>(E) denotes the triple  P<e>(E); M<e>(E); U<e>(E)

[These functions all return the empty list, Nil, except as specified below.]

P<invocation "#">(E) = R<invocation>(E)

M<name "!">(E) = prefixes(name)

U<name "@">(E) = prefixes(name)

P&M&U<invocation>(E) = P&M&U<R<invocation>(E)>(E)

P&M&U<"(" item* ")">(E) = P&M&U<item*>(E)

P&M&U<item1 item*>(E) = P<item1>(E) P<item*>(T<item1>(E));
				M<item1>(E) M<item*>(T<item1>(E));
				U<item1>(E) U<item*>(T<item1>(E))

prefixes(id) = id
prefixes(name "." id) = name "." id prefixes(name)

C: expression > expression

C<value> = value
C<property> = C<label> = Nil


VALUE SPACE

Expressions in an Interdoc script may denote
	literal values:
		Booleans: (F, T)
		integers: ... -3, -2, -1, 0, 1, 2, 3, ...
		reals: 1.2E5, . . .
		strings: <this is a string>
		labels: A123!, anId!, Paragraph.Example!
		external names: $name
		the empty environment: Null
		the empty list: NIL
	sequences of values
	unevaluated expressions
	environments

DISCUSSION

How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  The (implicit) "outermost" environment binds
each id to the external name $id.

Nodes are delimited by brackets.  The contents of each node are implicitly
prefixed by Sub, which will generally be bound in the containing environment
to a quoted expression performing an environment transformation, and perhaps
supplying some properties and marks.

Parentheses are used to denote a sequence value.  to delimit the argument list of
an operator.  Square brackets are used to denote a new environment value, which
behaves much like a record.


Semantics of labels:

A mark name! on a node makes that node a "target" of the label name (and its
prefixes); a reference name@ makes it a "source."  The "main" identifier of a
label must be declared (using name:!) at the root of a subtree containing all its
sources and targets.  The label represents a set of directed arcs, one from each of
its sources to each of its destinations.  Multiple marks make the node a target of
in multiple labels.  A unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				OTHER NOTES

It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes.  (E.g., text is not always there to be
"shown".)

Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
-It's OK to copy a node if that doesn't move any labels outside their scope, and
you understand ALL properties of its new parent.
-it's OK to delete a (subtree rooted at a) node if you understand ALL properties
of its parent.

The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.

Put in contents if:				Put in environment if:
	effect is local to node			has scope
	is directly edited				is only indirectly edited
	is to be bound locally			needs delayed or global binding


				STANDARD CARD

 WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.

 GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.

 STANDARDIZE CONCEPTS, NOT NAMES.


			CONSCIOUSLY POSTPONED

Lambda expressions.

Sets of properties, etc. (Cf. Mitchell's Font example.)
	SET/LIST operators ($append $union ?)



				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
	Drop "%"; ENV() is now the only environment-constructing operator.
	Add SUB operator (first operand: sequence only, second: number only).
	Add atoms, as distinct from ids.
	Fix lhs op rhs syntax.

Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
	resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
	distinguish syntactically between properties (marks) and labels.
	only the "main" id of a label is declarable.
	eliminate  as an id character.
	eliminate op ids from grammar.
	restructure the grammar for "functional" notation for operators.
	update semantic equations for new grammar, etc.
	fix treatment of unbound qualified names (now produce Nil).

Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
	restore $val.
	move quoting to rhs, allow quoted primaries without parentheses.
	allow an op to be the rhs of a definition.
	eliminate the functions operate, apply, eval by back substitution.
	change semantics of () to allow "record" construction without $env.

Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
	"It's OK to edit a node if you understand ALL of its (local) properties, and
		either don't remove any of them or also understand ALL properties
		of its parent."
	"Put in contents if:				Put in environment if: ..."
	Add connection syntax to syntactically rule out a+←'b.

Edited by Jim H. on 25 Aug. 1981 11:33 am PDT (Tuesday).
	Syntactically separate label references and name invocation.
	Put in distinct syntax in rhs for environment construction.
	Informal semantics of labels.
	( ... ) > [ ... ] in applications; permitting ( ... ) as a primary.

Edited by Jim H. on 25 Aug. 1981 4:08 pm PDT (Tuesday).
	Add sequence as a nonterminal to the syntax.
	State the formal semantics of labels and properties.
	Reorder presentation (hopefully to improve readability).


*start*
14892 00024 USt
Date: 25 Aug. 1981 4:08 pm PDT (Tuesday)
From: Horning.pa
Subject: Current Level 0/1 Interdoc status/rev. 26
To: Mitchell, Horning

Edited by Jim H. on 25 Aug. 1981 4:08 pm PDT (Tuesday).
	Add sequence as a nonterminal to the syntax.
	State the formal semantics of labels and properties.
	Reorder presentation (hopefully to improve readability).

-------------------
Open questions:
	Use of ( ... ) vs. [ ... ] (especially in application).
	Sort out "records" vs. quoted bindings.
	Sets of properties, etc. (Cf. Mitchell's Font example.)
		SET/LIST operators ($append $union ?)
		semantics of $list
	Non-uniform semantics of quote.
		Subtle distinctions between quoted and unquoted ( ... )
		Semantics of quoted [ ... ] ?
	We should rethink our character assignments.
		check our characterset for disjointness with
			Interpress.DoubtfulChars.
		use of various bracket pairs.
		enlarge op with a few more single-character operators?
	Possible node operators (purely in semantic domain, not operators?).
		$properties: node, environment > sequence	-- All #'s
		$marks: node, environment > sequence		-- All !'s
		$references: node, environment > sequence	-- All @'s
		$contents: node, environment > sequence	-- The rest (fringe)
	Consider restricting $subscript just to sequences, not nodes.
	Note that $if is the only operator that requires multiple sequences (hence
		commas, different syntax for argument list and sequence).  Worth
		going back to treating it as a special case?


Not done:
	Consider style for use of temporary local definitions.

-------------------

We envision an Interdoc script being input and viewed in any manner
equivalent to the following:

Parse the script, repeatedly
- reducing each expression to its "dominant structure," containing only literals,
by replacing identifiers by the values to which they are bound in the current
environment, by applying operators, and by removing binding items,
- transforming the environment as indicated by the binding items, and
- recording the links indicated by label references and marks.

				BASIC INTERDOC

SYNTACTIC EXAMPLE:

{Book.example!		      -- Links to this from Book@ and Book.example@
ExampleParagraph				-- Invokes a definition
$UniqueMark12356#			-- Adds a property
a:='NOT[EQ[margins.left, 120]] margins.right←100 r=*[12.5, pt]
IF[a, margins.left+←5 margins.right←5, margins.left+←10]
<text for this node>
}

GRAMMAR

item		::= primary | binding | label
primary	::= literal | invocation | application | property | node | sequence
literal		::= Boolean | integer | hexint | real | string | external | op
external	::= "$" name
name		::= id ( "." id)*
id		::= letter ( letter | digit )*
op		::= "+" | "" | "*" | "/"
invocation	::= id | primary "." id
application	::= primary "[" item*  ( "," item* )* "]"
property	::= primary "#"
node		::= "{" item* "}"
sequence	::= "(" item* ")"
binding	::= name connection rhs
connection	::= bindingMode | op bindingMode | bindingMode "'"
bindingMode ::= "=" | ":" | ":=" | "←"
rhs		::= primary | "[" item* "]"
label		::= id ":!" | name "!" | name "@"

SEMANTICS

R: expression > environment > expression			-- Reduction
T: expression > environment > environment			-- Transformation
R&T<e>(E) denotes the pair  R<e>(E); T<e>(E)

R&T<literal>(E) = literal; E

R&T<id>(E) = R&T<valOf(id, E)>(E)

R&T<p "." id>(E) = R&T<valOf(id, R<p>(E))>(E)

R&T<p "[" arg1 "," ... "," argn "]">(E) =
    CASE R<p>(E) OF
	$if		=> if R<arg1>(E) then R&T<arg2>(E) else R&T<arg3>(E)
	"+"	 	=> R<arg1>(E) + ... + R<argn>(E); E
	...
	$val		=> R<arg1>(E); E
	$list		=> R&T<arg1>(E)
	$subscript	=> R&T<arg1>(E)[R<arg2>(E)]
						-- arg1: sequence | node, arg2.R: int
	$hide		=> "" ; E

R&T<p "#">(E) = R<p>(E) "#"; E

R&T<"{" item* "}">(E) = "{" R<"Sub" item*>([Null | "Outer" = E]) "}";
	      locVal("Outer", (T<"Sub" item*>([Null | "Outer" = E])))

R&T<"(" item* ")">(E) = "(" R<item*>(E) ")" ; E			-- List constructor

R&T<"[" item* "]">(E) = [T<item*>([Null | "Outer" = E]) | "Outer" = Null]; E
					-- Construct a "record" environment value

R&T<>(E) = ""; E

R&T<item1 item*>(E) = R<item1>(E) R<item*>(T<item1>(E));
			    T<item*>(T<item1>(E))

R&T<n m rhs>(E) = "" ; bind(n, m, R<rhs>(E), E)
	<n op m rhs> = <n m op "[" n "," rhs "]">		-- Syntactic sugar
	<n op m "(" arg* ")"> = <n m op "[" n "," arg* "]">	??

R&T<"'" p>(E) = p; E

R&T<"'(" item* ")">(E) = item*; E  ??

R&T<"'[" item* "]">(E) = ??

R&T<label>(E) = label; E


bindingOf(id, E) = locBinding(id, whereBound(id, E)) -- Gets innermost binding

valOf(id, E) = locVal(id, whereBound(id, E))		-- Gets innermost value

whereBound(id, E) =				-- Finds innermost binding
	locBinding(id, E) ~= None		=> E
	locBinding("Outer", E) ~= None	=> whereBound(id, locVal("Outer", E))
	True					=> Null

bind(id, m, val, E) =
	bindingOf(id, E) = "="	=> E			-- Can't rebind constants
	m = ":=" 			=> assign(id, val, E) -- Assign at right level
	True				=> [E | id m val]

bind(id "." n, m, val, E) = [E | id bindingOf(id, E) bind(n, m, val, R<id>(E))]

assign(id, val, E) =
	locBinding(id, E) = ":"	=> [E | id ":" val]
	bindingOf(id, E) = ":"	=>
				[E | "Outer" "=" bind(id, ":=", val, locVal("Outer", E))]
	True				=> E 			-- Can only assign to vars

SEMANTICS OF PROPERTIES, MARKS, REFERENCES (uses)

P: expression > environment > expression		-- Sequence of properties
M: expression > environment > expression		-- Sequence of marks
U: expression > environment > expression		-- Sequence of references
P&M&U<e>(E) denotes the triple  P<e>(E); M<e>(E); U<e>(E)

These functions all return the empty list, Nil, except as specified below:

P<p "#">(E) = R<p>(E)

M<name "!">(E) = prefixes(name)

U<name "@">(E) = prefixes(name)

P&M&U<invocation>(E) = P&M&U<R<invocation>(E)>(E)

P&M&U<"(" item* ")">(E) = P&M&U<item*>(E)

P&M&U<item1 item*>(E) = P<item1>(E) P<item*>(T<item1>(E));
				M<item1>(E) M<item*>(T<item1>(E));
				U<item1>(E) U<item*>(T<item1>(E))

prefixes(id) = id
prefixes(name "." id) = name "." id prefixes(name)

NOTATION FOR ENVIRONMENTS

Environments bind identifiers to expressions, in various modes ("=", ":", ":=", "←"):
	Null denotes the "empty" environment
	[E | id m e] means "E with id mode m bound to e"
	locBinding(id, E) denotes the binding mode of id in E
		locBinding(id, Null) = None
		locBinding(id, [E | id' m e]) =
			if id=id' then m else locBinding(id, E)
	locVal(id, E) denotes the value locally bound to id in E
		locVal(id, Null) = Nil = ""
		locVal(id, [E | id' m e]) = if id=id' then e else locVal(id, E)

VALUE SPACE

Expressions in an Interdoc script may denote
	literal values:
		Booleans: (F, T)
		integers: ... -3, -2, -1, 0, 1, 2, 3, ...
		reals: 1.2E5, . . .
		strings: <this is a string>
		labels: A123!, anId!, Paragraph.Example!
		external names: $name
		the empty environment: Null
		the empty list: NIL
	sequences of values
	unevaluated expressions
	environments

DISCUSSION

How semantics are associated with an entire document:

Each environment, E, initially contains only its "inherited" environment (bound
to the id Outer).  Most bindings take place directly in E.  However, the value
of a bind(id, ":=", val, E) will change E by rebinding id in the "innermost"
environment (following the chain of Outers) in which it is bound, if that
binding has the binding ":" (Var).  Identifiers bound with binding "=" (Const)
may not be rebound in inner environments.

When an id is referred to and locBinding(id, E)=None, then the value is sought
recursively in locVal("Outer").  The (implicit) "outermost" environment binds
each id to the external name $id.

Nodes are delimited by brackets.  The contents of each node are implicitly
prefixed by Sub, which will generally be bound in the containing environment
to a quoted expression performing an environment transformation, and perhaps
supplying some properties and marks.

Parentheses are used to denote a sequence value.  to delimit the argument list of
an operator.  Square brackets are used to denote a new environment value, which
behaves much like a record.


Semantics of labels:

A mark name! on a node makes that node a "target" of the label name (and its
prefixes); a reference name@ makes it a "source."  The "main" identifier of a
label must be declared (using name:!) at the root of a subtree containing all its
sources and targets.  The label represents a set of directed arcs, one from each of
its sources to each of its destinations.  Multiple marks make the node a target of
in multiple labels.  A unique label on a node places it in a singleton set, i.e.,
identifies it uniquely.


				OTHER NOTES

It should be clarified that the "view" of the dominant structure is ALWAYS
controlled by the properties of its nodes.  (E.g., text is not always there to be
"shown".)

Conservative rules for editor treatment of script subtrees created by other editors:
-It's OK to display a node if you understand at least one of its properties.
-It's OK to edit a node if you understand ALL of its (local) properties, and either
don't remove any of them or also understand ALL properties of its parent.
-It's OK to copy a node if that doesn't move any labels outside their scope, and
you understand ALL properties of its new parent.
-it's OK to delete a (subtree rooted at a) node if you understand ALL properties
of its parent.

The presentation of this material could be clarified by a table that relates
constructions in the notation to their intended uses and meanings.

Put in contents if:				Put in environment if:
	effect is local to node			has scope
	is directly edited				is only indirectly edited
	is to be bound locally			needs delayed or global binding


				STANDARD CARD

 WE ARE DESIGNING A STANDARD FOR INTERCHANGE, NOT EDITING.

 GENSYM IS AN EDITOR, NOT AN INTERCHANGE, FUNCTION.

 STANDARDIZE CONCEPTS, NOT NAMES.


				HISTORY LOG

 Bring the syntax up front.
 Further develop parallelism between grammar and semantic equations.
 Write semantic equations in terms of concrete syntax.
 Quote general expressions.
 V, E, C > R, T, E .
 [...] > <...> for quotation of script expressions.
 (E | id←e, m) > [E | id←e, m] for local binding.
 Introduce primary to disambiguate expression* , factor lhs from binding.
 Introduce Sub component to initialize nodes.
 Debug semantics of braces and dot.
 Mode > binding.
 Debug semantics of <id> (fix up indirection).
 Add VAL. 

Edited by Mitchell, 30 July 1981 9:21 pm PDT (Thursday):  Changed
grammar to allow more complete expression syntax; couldn't use "<" or ">" as
operators because they delimit strings.  Moved history log to end of message.

Edited by Mitchell, 31 July 1981 12:20 pm PDT (Friday)
Simplified expression syntax.  Expressions with embedded binary operators are
simply interpreted in a right-to-left fashion; e.g., x←a*b+c means x←a*(b+c). 
Fixed up semantic equations to reflect this.  Exchanged the use of {}s and ()s.

Edited by Mitchell, 7 Aug. 1981 4:40 pm PDT (Friday)
Fixed error in semantics when exchanging the use of {}s and ()s.

Edited by Horning 13 Aug. 1981 4:47 pm PDT (Thursday).
	E(id) > locVal(id, E) 	--Remove conflict with f(E).
	Outer > "Outer"
	Const > "="
	id lookup rule modified (R & T<id>)
	[E | id←e, m] > [E | id m e]
	"." as infix op
	expressions are evaluated left-to-right (except for binding operator)
	Reverse VAL/ENV default for parens.
	bindq > bind
	binding > bindingMode
	expand definition of apply inline
	default T<construct>(E) = E
	add comments to semantic equations

-------------------
R<>(E) = Nothing						-- The empty expression

							-- Expression sequence
R<e1 e*>(E) = R<e1>(E) R<e*>(T<e1>(E))			-- List insert
T<e1 e*>(E) = T<e*>(T<e1>(E))				-- Composition

R<literal>(E) = literal

R<id>(E) = if bindingOf(id, E)=None then id else R<valOf(id, E)>(E)
T<id>(E) = if bindingOf(id, E)=None then E else T<valOf(id, E)>(E)

R<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then R<e2*>(T<e1>(E)) else R<e3*>(T<e1>(E))
T<"IF(" e1 "," e2* "," e3* ")">(E) =
	if R<e1>(E) then T<e2*>(T<e1>(E)) else T<e3*>(T<e1>(E))

R<"NOT" p>(E) = if R<p>(E) then False else True

R<p1 op p2>(E) = 
	op = "." 	=> R<p2>([R<p1>(E) | "Outer" = E])
	op = "+"	=> R<p1>(E)+R<p2>(E)
	. . .

R<n m op e>(E) = Nothing						-- Empty list
T<n m e>(E) = bind(n, m, R<e>(E), E)
T<n m "'" e>(E) = bind(n, m, e, E)
T<n m op e>(E) = bind(n, m, R<n op e>(E), E)

R<"{" labels e* "}">(E) = "{" labels R<Sub e*>([Null | "Outer" = E]) "}"
T<"{" labels e* "}">(E) = locVal("Outer", (T<"ENV("Sub e*")">(E)))

R<"(" e* ")">(E) = R<e*>(E)

R<"ENV(" e* ")">(E) = [T<"ENV(" e* ")">(E) | "Outer" = Null]
T<"ENV(" e* ")">(E) = T<e*>([Null | "Outer" = E])

-------------------

Edited by Jim Horning 17 Aug. 1981 10:49 am PDT (Monday)
	R&T<>
	Nothing > ""

Edited by Jim H. on 17 Aug. 1981 4:58 pm PDT (Monday)
	Remove side-effects from all expressions.
	Parentheses purely for grouping (don't hide environment transformations).
	#label > label !
	labels within nodes

Edited by Jim H. on 19 Aug. 1981 9:52 am PDT (Wednesday).
	Rewrite <n m op e> as syntactic sugar.
	structured labels
	re-introduce apply function in R&T<p1 op p2>
	correct syntax for "."
	% for opening an environment (also replaces ENV?)

Edited by Jim H. on 19 Aug. 1981 6:55 pm PDT (Wednesday).
	Drop "%"; ENV() is now the only environment-constructing operator.
	Add SUB operator (first operand: sequence only, second: number only).
	Add atoms, as distinct from ids.
	Fix lhs op rhs syntax.

Edited by Jim H. on 20 Aug. 1981 5:29 pm PDT (Thursday).
	resolve pending questions as per message of 20 Aug. 1981 12:29 pm PDT.
	distinguish syntactically between properties (marks) and labels.
	only the "main" id of a label is declarable.
	eliminate  as an id character.
	eliminate op ids from grammar.
	restructure the grammar for "functional" notation for operators.
	update semantic equations for new grammar, etc.
	fix treatment of unbound qualified names (now produce Nil).

Edited by Jim H. on 21 Aug. 1981 6:58 pm PDT (Friday).
	restore $val.
	move quoting to rhs, allow quoted primaries without parentheses.
	allow an op to be the rhs of a definition.
	eliminate the functions operate, apply, eval by back substitution.
	change semantics of () to allow "record" construction without $env.

Edited by Jim H. on 24 Aug. 1981 6:08 pm PDT (Monday).
	"It's OK to edit a node if you understand ALL of its (local) properties, and
		either don't remove any of them or also understand ALL properties
		of its parent."
	"Put in contents if:				Put in environment if: ..."
	Add connection syntax to syntactically rule out a+←'b.

Edited by Jim H. on 25 Aug. 1981 11:33 am PDT (Tuesday).
	Syntactically separate label references and name invocation.
	Put in distinct syntax in rhs for environment construction.
	Informal semantics of labels.
	( ... ) > [ ... ] in applications; permitting ( ... ) as a primary.