RefText.mesa
Copyright © 1985 by Xerox Corporation. All rights reserved.
MBrown on September 16, 1983 2:52 pm
Paul Rovner on August 8, 1983 11:48 am
Russ Atkinson on January 29, 1985 6:57:40 pm PST
Doug Wyatt, February 26, 1985 3:26:40 pm PST
DIRECTORY
Basics USING [Comparison],
PrincOps USING [zLI1, zLI0, zBNDCK],
Rope USING [ActionType, Compare, Equal, Find, Map, Match, SkipOver, SkipTo, ROPE, Text];
RefText: CEDAR DEFINITIONS
IMPORTS Rope
= BEGIN
Introduction
This interface includes some simple procedures for REF TEXT. As much as possible the operations are parallel to the Rope interface's operations on ROPE.
When reading from a REF TEXT, the package treats NIL and "" equivalently (Length[NIL] = 0, Fetch[NIL, 0] raises BoundsFault). But Appending to a NIL REF TEXT raises PointerFault.
Only the New operation below allocates collectable storage. If you are doing large numbers of REF TEXT allocations, maybe you should be using ROPE.
In a "piece" defined by [s: REF TEXT, start: NAT, len: NAT], len is interpreted as follows:
IF start > s.length THEN BoundsFault ELSE len ← MIN[len, s.length-start].
The resulting len value is called the "effective len" below.
A boolean "case" parameter should be understood to mean "case significant".
If case, upper case characters are treated as distinct from lower case characters.
If ~case, upper case characters are converted to lower case before comparison.
PureText: TYPE = REF READONLY TEXT;
ROPE: TYPE = Rope.ROPE;
MaxLen: NAT = NAT.LAST;
Creating TEXT
New: PROC[nChars: NAT] RETURNS[REF TEXT];
Allocates a TEXT with length = 0, maxLength = nChars, and returns a REF to it.
Some programs allocate and discard TEXT frequently. To improve the performance of these programs, the RefText package manages a small pool of "scratch" TEXTs. The expected usage of this pool is for a client to get a scratch TEXT using the ObtainScratch procedure, manipulate this TEXT for awhile, and then return it to the pool using ReleaseScratch. A client who retains a REF to a scratch TEXT after releasing it is not playing by the rules (the same TEXT will surely be handed out to someone else), but this is still "safe" in the Cedar sense (the storage invariants are not compromised by the error).
Error: ERROR[ec: ErrorCode];
ErrorCode: TYPE = { clientModifiedReleasedText };
This error may be raised by ObtainScratch, below.
ObtainScratch: PROC[nChars: NAT] RETURNS[REF TEXT];
! Error [clientModifiedReleasedText]
Returns a REF to a TEXT from with length = 0, maxLength >= nChars. This TEXT is generally obtained from a pool of TEXTs.
A call to ObtainScratch is less expensive than New, but ObtainScratch should only be called with the expectation of calling ReleaseScratch (below) later on the resulting TEXT. (It is ok for a client to occasionally "forget" to release a TEXT obtained with this procedure, so for instance there is no need to call ReleaseScratch in UNWIND catch phrases unless UNWIND is expected most of the time!)
Raises Error [clientModifiedReleasedText] if the TEXT it wanted to return has been tampered with (in a detectable way) since it was released with ReleaseScratch; this indicates that some client is not playing by the rules.
ReleaseScratch: PROC[t: REF TEXT];
Caller asserts that it has no further interest in the TEXT pointed to by t.
Noop if t was not obtained from scratch pool.
line: NAT = 100;
page: NAT = 512;
A couple common arguments for ObtainScratch.
Writing TEXT
Append: PROC[to: REF TEXT, from: PureText, start: NAT ← 0, len: NAT ← MaxLen]
RETURNS
[REF TEXT];
! PointerFault (if to = NIL)
! BoundsFault (if to.length > to.maxLength or length of result text would exceed MaxLen)
AppendTextRope: PROC[to: REF TEXT, from: Rope.Text, start: NAT ← 0, len: NAT ← MaxLen]
RETURNS[REF TEXT] = INLINE {
! PointerFault (if to = NIL)
! BoundsFault (if to.length > to.maxLength or length of result text would exceed MaxLen)
RETURN[Append[to, TrustTextRopeAsText[from], start, len]];
};
AppendRope: PROC[to: REF TEXT, from: ROPE, start: INT ← 0, len: NAT ← MaxLen]
RETURNS
[REF TEXT];
! PointerFault (if to = NIL)
! BoundsFault (if to.length > to.maxLength or length of result text would exceed MaxLen)
AppendChar: PROC[to: REF TEXT, from: CHAR]
RETURNS
[REF TEXT];
! PointerFault (if to = NIL)
! BoundsFault (if to.length > to.maxLength or length of result text would exceed MaxLen)
InlineAppendChar: PROC[to: REF TEXT, from: CHAR]
RETURNS
[REF TEXT] = INLINE {
! PointerFault (if to = NIL)
! BoundsFault (if to.length > to.maxLength or length of result text would exceed MaxLen)
IF to.length >= to.maxLength THEN RETURN [AppendChar[to, from]];
to[to.length] ← from;
to.length ← to.length + 1;
RETURN [to];
};
ReserveChars: PROC[to: REF TEXT, nChars: NAT]
RETURNS
[REF TEXT];
! PointerFault (if to = NIL)
! BoundsFault (if length of result text would exceed MaxLen)
The client wishes to append nChars characters to the REF TEXT to without overflowing it.
The result text t satisfies (Compare[to, t] = $equal AND to.maxLength >= to.length + nChars).
InlineReserveChars: PROC[to: REF TEXT, nChars: NAT]
RETURNS
[REF TEXT] = INLINE {
! PointerFault (if to = NIL)
! BoundsFault (if length of result text would exceed MaxLen)
IF LOOPHOLE[to.maxLength, INTEGER]-LOOPHOLE[nChars, INTEGER] <
LOOPHOLE[to.length, INTEGER] THEN RETURN [ReserveChars[to, nChars]];
RETURN [to];
};
Reading TEXT
Compare: PROC[s1, s2: PureText, case: BOOLTRUE] RETURNS[Basics.Comparison] = INLINE {
returns lexicographic comparison of the REF TEXT contents
case => case of characters is significant
RETURN [Rope.Compare[TrustTextAsRope[s1], TrustTextAsRope[s2], case]];
};
Equal: PROC[s1, s2: PureText, case: BOOLTRUE] RETURNS[BOOL] = INLINE {
returns s1 = s2 (true iff s1 and s2 contain same sequence of characters, modulo the case parameter)
RETURN [Rope.Equal[TrustTextAsRope[s1], TrustTextAsRope[s2], case]];
};
Fetch: PROC[base: PureText, index: NAT] RETURNS[CHAR] = INLINE {
! BoundsFault (if base = NIL or index > base.length)
fetches indexed character from given REF TEXT.
IF base = NIL OR index > base.length THEN BoundsFault[];
RETURN [base[index]];
};
Find: PROC[s1, s2: PureText, pos1: NAT ← 0, case: BOOLTRUE] RETURNS[INTEGER] = INLINE {
returns position in s1 where s2 occurs (starts looking at pos1)
does NOT do *-matching (use Match below for this)
returns -1 if not found (including pos1 >= Length[s1])
RETURN [Rope.Find[TrustTextAsRope[s1], TrustTextAsRope[s2], pos1, case]];
};
Length: PROC[base: PureText] RETURNS[NAT] = INLINE {
returns the length of the REF TEXT (0 if NIL).
RETURN [IF base = NIL THEN 0 ELSE base.length];
};
ActionType: TYPE = PROC[CHAR] RETURNS[BOOL];
Type of procedure applied to each character by Map.
Map: PROC[s: PureText, start: NAT ← 0, len: NAT ← MaxLen, action: ActionType]
RETURNS
[quit: BOOL] = INLINE {
! BoundsFault (if start > s.length)
Applies the action to each char in the given piece of s, in ascending order, until action[char] = TRUE or no more chars.
Returns TRUE iff stopped by action[char] = TRUE.
RETURN [Rope.Map[TrustTextAsRope[s], start, len, action]];
};
Match: PROC[pattern, object: PureText, case: BOOLTRUE] RETURNS[BOOL] = INLINE {
Returns TRUE iff object matches the pattern, where the pattern may contain * to indicate that 0 or more characters will match.
If case is true, then case matters.
RETURN [Rope.Match[TrustTextAsRope[pattern], TrustTextAsRope[object], case]];
};
SkipTo: PROC[s: PureText, pos: NAT ← 0, skip: PureText] RETURNS[NAT] = INLINE {
Examine s[pos .. s.length), and return the lowest index in this range such that s[i] is contained in the "skip" string. If no such character exists, return s.length.
RETURN [Rope.SkipTo[TrustTextAsRope[s], pos, TrustTextAsRope[skip]]];
};
SkipOver: PROC[s: PureText, pos: NAT ← 0, skip: PureText] RETURNS[NAT] = INLINE {
Examine s[pos .. s.length), and return the lowest index in this range such that s[i] is NOT contained in the skip string. If no such character exists, return s.length.
RETURN [Rope.SkipOver[TrustTextAsRope[s], pos, TrustTextAsRope[skip]]];
};
Miscellaneous
TrustTextAsRope: PROC[text: PureText] RETURNS[Rope.Text] = TRUSTED INLINE {
It is sometimes OK to treat a REF TEXT as a ROPE, provided that
1. You do NOT alter the text while there is a ROPE reference to the object
2. You do not care about the runtime type of the object
RETURN [LOOPHOLE[text]];
};
TrustTextRopeAsText: PROC[rope: Rope.Text] RETURNS[PureText] = TRUSTED INLINE {
It is sometimes OK to treat a Rope.Text as a REF TEXT.
You must promise not to modify it!
RETURN [LOOPHOLE[rope]];
};
BoundsFault: PROC = TRUSTED MACHINE CODE {
Raises ERROR RuntimeError.BoundsFault.
PrincOps.zLI1; PrincOps.zLI0; PrincOps.zBNDCK;
};
END.