Daniel C. Swinehart
Computer Science Laboratory
Xerox Palo Alto Research Center
Etherphones have been designed to support the ready implementation of additional applications. Most enhanced telephone options and other voice services can be entirely implemented in the workstation environment. Those that prove particularly useful may migrate to a server in order to increase their availability from stand-alone telephones and from a wide range of operating system environments and workstation types. For server-based functions, permanent user-specified options can be stored in a database. These methods form a flexible basis for experimentation with voice in an integrated personal information environment.
1. Introduction
The Etherphone system is experimental facility that has been developed at the Xerox Palo Alto Research Center. Its objective is to explore methods for extending existing electronic office environments with the facilities needed to handle the digital transmission, storage, and manipulation of voice. The system has been designed to permit the exploration of a wide range of telephony, voice mail, voice annotation, and other multi-media document applications, while preserving simple conventional telephone behavior as a subset of its capabilities.
From the beginning, although the project has required the construction of a complete system, our emphasis has been not on the particular choices of switching or voice transmission methods, terminal equipment, or network architectures, but on functionality. More specifically, it has been our aim to provide a flexible environment for novel applications, fully-integrated into the office workstation environment, across a wide range of voice applications: telephony, recorded voice and music, synthesized voice and music. Our overall approach should extend to include speech transcription and the control of video recording, switching, and transmission, although we have not explored these areas.
Finally, our intent was to produce not only applications but also programmable facilities that would allow other developers to include sophisticated voice functions in their own applicaitons, without having to redevelop all of the underlying facilities.
This is an area of active research. We should mention some of the more relevant activities here. Many others have investigated the challenges of transmitting voice as datagrams on local area networks [4, 1, 5, 24]. Some notable approaches to high-level methods for programming telephone and voice functions include work by Ruiz [13], Richards et al [13], DeTreville [3], and Herman et al [6]. Advanced applications of recorded voice and document annotation have been extensively reported [8, 23, 11, 25]. A few products containing advanced telephone management and integrated voice capabilities are beginning to appear [8, 9, 10]. The Speech Research Group at MIT's Media Laboratory have produced remarkable results in their explorations of intelligent voice-directed interfaces to telephones, answering machines, and office system in general. [15, 16]. However, only recently have other broad-based experimental testbeds for office voice applications begun to emerge, most notably the Mice system [6] and the Island project [1].
Earlier publications on the Etherphone system have emphasized the system objectives, the overall hardware and software organization [19, 21], the specific software methodologies used to handle recorded voice [22], and the user interface designs that support voice annotation and editing [2] and scripted documents [29]. In this report, our intent is to demonstrate how the Etherphone system can be used to improve the nature of the telephone in the office setting. The emphasis is primarily on telephone control and management functions, referring to the annotation and editing capabilities only as they relate to these functions. The exposition now continues with the presentation of a series of typical office situations, and the way the Etherphone system deals with them. A brief implementation section outlines the general approach to development and execution of these applications.
References
[1] S. Ades. An Architecture for Integrated Services on the Local Area Network. Ph.D. Thesis, Cambridge University, February 1987.
[2] S. Ades and D. C. Swinehart. "Voice annotation and editing in a workstation environment," Proceedings AVIOS Voice Applications '86, September 1986, pages 13-28.
[3] J. DeTreville. Phoan: "An Intelligent System for Distributed Control Synthesis." Publication information as yet unknown.
[4] J. DeTreville and W. David Sincoskie. "A Distributed Experimental Communications System." IEEE Journal on Selected Areas in Communications SAC-1(6):1070-1075, December 1983.
[5] T. A. Gonsalves. "Packet-Voice Communication on an Ethernet Local Computer Network: an Experimental Study." Proceedings SIGCOMM 1983 Symposium on Communications Architectures and Protocols, Austin TX, March 1983, pages 178-185.
[6] Herman, G., Ordun, M., Riley, C., and Woodbury, L. "The Modular Integrated Communications Environment (MICE): a system for prototyping and evaluating communications services." Bell Communications Research, 435 South Street, Morristown NJ 07960. To appear.
[7] K. Lantz. "An Experiment in Integrated Multimedia Conferencing." Proceedings Conference on Computer-Supported Cooperative Work, Austin, TX, December, 1986.
[8] R. Nicholson. "Integrating voice in the office world." BYTE 8(12):177-184, December 1983.
[9] [A reference to Northern Telecom's Meridian telephone is intended here.]
[10] [A reference to a Panasonic multi-featured telephone is intended here.]
[11] J. K. Reynolds, J. B. Postel, A. R. Katz, G. G. Finn, and A. L. DeSchon. "The DARPA experimental multimedia mail system." Computer 18(10):82-89, October 1985.
[12] J. T. Richards, S. J. Boies, and J. D. Gould. "Rapid Prototyping and System Development: Examination of an Interface Toolkit for Voice and Telephony Applications." Proceedings CHI'86 Conference on Human Factors in Computing Systems, Boston, Mass., April 1986, pages 216-220.
[13] Ruiz, A. Voice and telephony applications for the office workstation. Proc. 1st International Conference on Computer Workstations, San Jose, CA, November 1985, 158-163.
[14] S. K. Sarin. Interactive On-line Conferences. Ph.D. thesis, Massachusetts Institute of Technology, Report MIT/LCS/TR-330.
[15] Schmandt, C. and Arons, B. Phone Slave: A Graphical Telecommunications Interface. Proc. Society for Information Display 1984 International Symposium, June 1984.
[16] C. Schmandt and B. Arons. "Voice Interaction in an Integrated Office and Telecommunications Environment." Proceedings 1985 Conference of American Voice Input/Output Society, October 1985.
[17] M. Stefik, G. Foster, D. Bobrow, K. Kahn, S. Lanning, and S. Suchman. "Beyond the Chalkboard: Computer Support for Collaboration and Problem Solving in Meetings." Comm. ACM 30(1):32-47, January 1987.
[18] [A reference to a paper by David Stodolsky on conference call floor control is intended here. It is this work that inspired the floor control notion in this paper.]
[19] D. C. Swinehart, L. C. Stewart, and S. M. Ornstein. "Adding voice to an office computer network."
Proceedings IEEE GlobeCom '83, November 1983. Also available as Xerox Palo Alto Research Center, Technical Report CSL-83-8, February 1984.
[20] D. C. Swinehart, P. T. Zellweger, R. J. Beach, and R. B. Hagmann. "A structural view of the Cedar programming environment." ACM Transactions on Programming Languages and Systems 8(4):419-490, October 1986.
[21] D. C. Swinehart, D. B. Terry, and P. T. Zellweger. "An experimental environment for voice system development." IEEE Office Knowledge Engineering Newsletter, February 1987.
[22] D. B. Terry, D. C. Swinehart. "Managing Voice Stored Voice in the Etherphone System." to appear.
[23] R. H. Thomas, H. C. Forsdick, T. R. Crowley, R. W. Schaaf, R. S. Tomlinsin, V. M. Travers, and G. G. Robertson. "Diamond: A multimedia message system built on a distributed architecture." Computer 18(12):65-78, December 1985.
[24] F. A. Tobagi and N. Gonzalez-Cowley. "On CSMA-CD Networks and Voice Communication." Proceedings Infocom, 1982, pages 122-1278.
[25] H. Wilder and N. Maxemchuk. "Virtual Editing II: the User Interface," Proceedings of the SIGOA Conference on Office Automation Systems, Philadelphia, Penn. 1982.
[26] Xerox Corporation, Intel Corporation, Digital Equipment Corporation. The Ethernet: A Local Area Network; Data Link Layer and Physical Layer Specifications. Version 2.0, November 82.
[27] Xerox Corporation. The Dorado: A high-performance personal computer—Three papers. Xerox Palo Alto Research Center, Report No. CSL-81-1, January 1981.
[28] [A reference describing the Xerox 6085 Professional workstation is intended here.]
[29] Zellweger, P. Scripted Documents. To appear.