§2.1 The nature of documents
Documents facilitate a broad range of human activities by organizing and preserving information for future use. As physical encodings of information, they are cultural artifacts, so that their use is as much a function of their physical properties as their information content. They have traditionally been realized as marks on paper, but increasingly, computational technology has been altering their ªlook and feel.º
Is the traditional notion of a document giving way to a new conception? Which aspects of documents are changing in response to the new technology, and which remain unaltered? The answer to these questions is to be found in understanding the role and place of information in document processing.
f Encoding of information
Information must be represented or encoded in a physical substrate; it must be embodied. These encodings vary along three important dimensions: their perceptibility, permanence, and accessibility.
Perceptibility: Some encodings are directly available to human perception, while others are not. An encoding of information that can be directly perceived by humans might be called an external encoding, or a presentation. Examples are speech (an acoustic encoding) and marks on paper. Encodings not directly perceptible (internal encodings) include computer files and human thoughts (which are unavailable to all but the thinker).
Permanence: Orthogonal to perceptibility is the durability of an encoding, which is determined by the characteristics of the physical substrate. Paper can last several hundred years, while vellum can last a thousand or more. Speech, at the other extreme, is evanescent. A substrate capable of preserving an encoding can be thought of as storage medium.
Accessibility: As is well known in computer science, every encoding scheme biases the form and speed of access. This is equally true for perceptible and non"perceptible encodings, for permanent and impermanent ones.
f Transformations of information
To this essentially static account should be added a dynamic notion, that of the transformation of information. A transformation is a process that operates on one encoding to produce another. Both encodings may be embodied in the same type of substrate, but some of the more interesting cases are those in which the substrates are different. Of particular interest are those cases in which an internal encoding is transformed into an external one (the case of access, for example), or the inverse (in document capture).
f Documents as encodings of information
Viewed in this light, the traditional paper document can be seen to have a single encoding, which is both external and (relatively) permanent. The paper substrate constitutes the storage medium. This fact would have remained unremarkable, were it not for the changes brought by computational technology. For with computers came the ability $ in fact the need $ to distinguish two classes of document encodings, external and internal, and the need to transform each into the other.
As a result, the external encoding (e.g. marks on paper) is coming to be viewed as transient and subordinate to the internal encodings (e.g. computer files), which are viewed as permanent [O. North, 1986] (even though the former may in fact have the same absolute permanence as the latter). There are two important consequences of this split. First, it becomes possible to generate multiple external encodings (not just different layouts of the same information, but encodings in different media $ speech as well as text) from the same external encoding. Second, while internal encodings are not directly accessible to humans, certain forms are directly accessible to computers, so document processing is no longer inextricably wedded to the external encoding, nor to processing by humans exclusively.
f Constancy and change
To return to the earlier question, what has changed and what has not? Documents, it seems clear, are still (relatively) permanent, external encodings of information. But these are no longer the ªwhole truth,º for in the computational medium they are created by transformation of an internal encoding. The current emphasis on document recognition is exactly the concern for performing the reverse transformation, a need created by the splitting into two encodings, one ªperceptible,º the other ªpermanent.º
Consider the tradeoffs between paper and electronic documents. Just as paper is the dominant medium where it has advantages, electronic forms of documents will predominate where they have advantages.
f Paper documents
In the recent past, document processing was performed by people working with paper documents. This mode of processing is still the way that most people handle documents. The plain paper segment of this medium has been the basis for Xerox's success.
As a document medium, paper has many advantages over alternative media. Paper is relatively cheap (especially in small quantities), is quite portable, relatively permanent, requires no special equipment to read (or to write in small quantities), has a large base of trained users, and has widely accepted standards for written language. Other media (e.g. voice) also have an independent place in the current culture.
f Electronic documents
Electronic forms can combine elements of many media: visual and auditory, static and dynamic, words and images. Electronic documents currently have a substantial cost advantage in the large. But the primary advantages of electronic documents have to do with processing. With the advent of computers it has become far easier and cheaper to modify documents, transmit them from place to place, search for information, extract and merge information, and perform filing and retrieval operations.
Electronic documents still have disadvantages, and the future of document processing will depend on how some of those disadvantages are overcome. Electronic documents currently require a high investment in equipment relative to paper, they are not yet as permanent, they have less social and legal acceptance, their use requires extra training, and embryonic standards limit interchange among disparate systems.
It is inevitable that electronic and paper documents will coexist for quite some time, since each medium has advantages. In fact, the introduction of electronic document processing is reducing the cost of producing paper documents, thereby extending the advantage of paper in some areas. There will be substantial benefit, and substantial business, in converting documents among media.
f Documents vs. data
How will document processing differ from data processing? The two come from different roots, and have differed quite significantly in the past. While they will continue to differ in emphasis, it seems likely that their roots will become more heavily intertwined, and, in particular, that document processing systems will need to interface to data processing systems. As Paul Allaire noted in his remarks to The Research Board in March, 1987, ª ... the data processing world and the document world must and will be brought together. For corporate investments in information processing to become fully productive, these systems must be able to operate on all the information used by the corporation. And since most of that information is in the form of documents, this means connecting the document base to the MIS [ data processing ] environment.º He also observed that the technology to make this possible exists today, and that products are beginning to appear.
The revolution that is taking place in document processing is based on changing technology. It is now vastly easier to transfer information from one medium to another, and to use new media. To understand the future of document processing it is important to consider not only the information encoded in documents, but also their physical media and the social contexts of their use.
§2.2 Visions of document processing in the future
§2.2.1 The paper metaphor and beyond
Our vision begins with the familiar ªpaper metaphor.º Paper will remain a primary means for presenting documents to human readers, and since electronic reprographic products will displace conventional reprographic products, electronic reprographics is a business that Xerox must be in.
Beyond paper our vision expands the notion of document to include forms of information and techniques of information presentation that do not have paper analogs, and it encompasses more capable systems for processing documents and the information that they contain.
f Capture and creation
Information will come from an increasing variety of sources. Simple capture of keystroked text and mouse motions will be supplemented by scanned text and images, by communications with other document and data processing systems, and by voice, audio, and video input.
f Transformation and modification
Documents will be edited and modified by both people and machines. We expect increased technical assistance in such areas as version history and control, annotations, and coordination of changes in documents with several authors.
f Understanding
Documents will be increasingly understood by machines. This will involve understanding their social context and purposes, as well as the information that they express explicitly. Machines will become capable of automatic translation, intelligent retrieval, extraction and reformulation of document content, and recognition of human"oriented forms of documents.
This will enable a proliferation of intelligent applications packages, for example, to assist in decision making, to streamline authorization procedures, and to improve financial accounting and control.
f Storage and retrieval
Documents will be stored in document bases whose size will dwarf today's data bases. Users will be able to access these document bases using loose descriptions of desired information. Documents will contain machine"understood references to other documents, so that users can navigate around in a web of interconnected documents and inquire how they interrelate.
Multiple storage media will be used, since conversion of all information in an office to any single medium is unlikely. Paper, microfilm, magnetic tape, and optical and magnetic disk storage will all be used. Information on all these media will be coordinated through a single coherent retrieval system.
f Dissemination
Much more information will be available online, both locally and through communications links.
There will be on"demand production of abstracted, annotated, or specially"formatted versions of documents at the point of use.
Automatic routing of documents will become commonplace. For example, a document might be routed to a person based on his own interest profile or as required by office work flow procedures.
Workstations will be linked with the telephone system to allow direct access to and from virtually everywhere. Workstations will incorporate multinational character sets, so that documents can be produced in many languages. Much of the day's mail may be heard in the car on the way to work, via a radio telephone link.
f Other features
Color will be commonplace, both in electronic and paper forms of documents.
Documents will be active, and include such computations as simulations, spreadsheets, animations, and data base queries. Their activity may be triggered by changes in the document base, the passage of time, or other external events.
Electronic documents will become legal entities. Digital signatures will be used for authentication and authorization.
Expanded uses of alternative document forms will change the way we use paper. In many cases documents will be generated, serve their purpose, and be destroyed or archived, without ever existing in paper form. On the other hand, inexpensive printers will greatly increase our ability to produce paper when paper is the most suitable medium.
§2.2.2 Competitive environment and influence on Xerox
We expect a continued high level of competition. There will be no ªsafe'' markets for Xerox; even high volume printing will be competitive. This means that Xerox will have to pay great attention to the requirements of its customers. Many customers will simply want present"day functionality at a much lower price. Others will demand faster systems with broader ranges of applications and functionality. Neither group will have much patience with unreliability or complex user interfaces.
There will be an increasing premium on compatibility with external standards.
There will be rapid changes in technology, and therefore rapid changes in customer requirements. We view technology as the enabler of change in document processing. Technology alters the way we do business since it introduces new methods, and alters the relative costs of old methods. Special attention to new requirements will be essential, since old assumptions will be frequently wrong. This places a premium on flexibility, and will force upward compatibility for archival documents. It will be essential to have short development cycles.
We believe, although we have not substantiated the belief in this report, that profits will come from the value added through better systems integration and better understanding of customer requirements.
§2.2.3 Reflections on the future
It is critical that Xerox develop an understanding of the fundamental parameters of document processing so as to move flexibly in this changing world. Basic research can help develop this understanding.
§2.3 The technological base for our vision
This section lists specific research areas that focus on document processing, including both systems issues and components; it is a statement of the application's requirements rather than a list of specific projects.
f Document system architecture and system design
The technical issues of document processing systems are not dramatically different from those of other large distributed interactive computer based systems. A capable document processing system will support the user through the integration of its individual components / capabilities. It must be modular so that customers can choose subsets of features and extensible so that new object types can be added to the system. The architecture must be open and scalable so that it can connect with a heterogeneous and rapidly changing environment of hardware, software, protocols, standards and languages. The system should also support groups of people working together on a given task in real time.
f Document base
There is general agreement that any document system must support storage and retrieval of the large amounts of heterogeneous and heavily linked data that constitute our notion of documents. The system design/architecture must be able to handle very large objects (digital forms of image, voice and video). Electronic storage will be organized in a performance hierarchy with buffered, immediate, and archival levels. Data will migrate automatically among these levels to improve both reliability and access time.
This document base should support massive amounts of hierarchical file storage, with distribution and migration of information across the internet. It will require DBMS retrieval capabilities, extendible to allow retrieval by content and use of the document, and should support autonomous activity by servers that help users achieve standing goals. As described in much more detail in the Spinrad report on document systems, documents must be active and polystructured, with links from a document to its history, and to other (pieces of) documents. Documents should support nontextual information: images, voice annotations, and video segments.
f Communications
A document processing system must support the communication of documents between system elements and users. A document system requires high performance, high bandwidth point to point connectivity with a network infrastructure supporting the distribution of documents: e. g., authentication and access control, and gateways to alternate communication architectures.
The system should also include standard, easily used mechanisms for supporting document"related communications: electronic mail supporting forms and multiple body types, easily processed language (like Courier or more functional) for describing and implementing communications, and ''smart'' routing of documents between people (by role of the intended receiver as well as by interest in content of the document).
f Document transformation processing
Much of the added value of a document processing system over file storage lies in its ability to carry out or assist transformations of the data form and content. It should be able to hide the boundary between inter"document and intra"document structure, and support compression for storage and retrieval. It might be able to index a document by content and attributes, reason about this content, and use its analysis to create alternate, derived representations for different purposes. Servers on the net will allow intelligent processing that otherwise will be too expensive to implement in individual workstations.
f Human interface
Both the richness and the evolution of a document processing system will be most evident in its support for a variety of representations for the system's data and control mechanisms. An effective document processing system must include sophisticated input and output support for hardcopy, voice and video. Users and developers should be able to extend and customize the interface for specific applications, and for level of expertise of the user.
f Social and anthropological issues
To support its users, a document system must incorporate knowledge of how documents are used and processed, both within the system and by its human users. This requires an understanding of current human document processing work flow and use of models that include users and tasks. Tools must support and enhance productivity and understandability of the activities.