[Indigo]<Yggdrasil>Documentation>YggdrasilDealer5Oct88.slides!1

Dealer: Yggdrasil Status

October 5, 1988

Yggdrasil

A large scale hypertext database system

Bob Hagmann

Introduction

Database effort in CSL

research

support our filing needs

basis for retrieval experiments

store images, programs, text, audio, mail, ...

document processing

... and System 33

System 33

three months

scanning

document recognition

storage

retrieval

conversions

protocols

printing ?

Yggdrasil

years

"client server" database

so good that all the bits end up here

Project Name

Ygg|dra|sil n. Also Yg|dra|sil. Norse Mythology. The great ash tree that holds together earth, heaven, and hell by its roots and branches. [Old Norse, probably the horse of Yggr'' : Yggr, name of Odin, from yggr, variant of uggr, frightful (see ugly) + drasill, horse.]

Alternative Name: Hypertext

Hy|per|text n. (pronounced as if spelled Rtext) Hypertext without the hype.

Historical Perspective on Stored Information (European view)

Dark ages (guilds, oral tradition, bards, manuscripts)

Moors fall => Arabic libraries into Christian hands

Monks translate Arabic to Latin

Constantinople falls => Renaissance.

Printing press

Public libraries and public education

Large publishing industry

Computers

boxes of cards

file systems

access methods for files

hierarchical and network databases

relational databases

???

Information Storage Needs

``Store the bits and get out of my way''

Major execution on the server not a requirement

Three principle sources of needs

Distributed Notecards in SSL

Large capacity and high performance file server

Software storage for programming environments

Others

Voice project in CSL

Storage of scanned images

Mail storage

Library support

Support evolving standards for file systems and document retrieval

...

Technology Change

Optical disks

Optical disk jukeboxes

High capacity magnetic disks

Decreasing cost of main memory

Fast commercial microprocessors/workstations

High capacity optical/magnetic tapes

Scanners

FDDI communications

Fax

Electronic printers

CD ROM

Information services

Ò deal with change and scale

Project

Build a large scale hypertext database server

No user interface -- this is a database

SUN, Dragon, and beyond

Ethernet or FDDI

Lots of memory (100's of megabytes)

Lots of MIPS (10-100's)

Modest number of processors (1-16)

Lots of magnetic disk (10's of gigabytes)

Lots of optical disk (a terabyte)

TCP/IP and XNS

Written in Cedar

Mach and Camelot

Mach

Rick Rashid's project at CMU

UNIX compatible

Lightweight processes, message based, multiprocessor, external pagers

Camelot

Transaction facility on top of Mach

Logging, commit message protocols, recoverable storage management, media recovery, backup, name service, ...

Might be throw away

Mach, Camelot, and Cedar

Cedarboot runs under Mach!

... but debugging, PCR, communications, threads, fd's

Six Key Ideas

Objects ({ documents)

Typed links

Objects have properties ({ attributes)

Containers group objects

Indices automatically built on properties

Documents can be named

System Summary

Server architecture

Large number of documents of vastly varying sizes

Document ``types'' (extensible) - few interpreted at server

Hypertext: documents can be connected via links

Documents can be named

Documents can have attributes and keywords

Documents are grouped into contexts called containers

Keyword and other indices maintained per container

Versions and alternatives

Data compression and decompression

On-line archival storage

Alerters (send a message when an event occurs)

Page level access, access control, transactions, robust, performance, recovery, and availability

Hooks for multi-server and foreign server support

Not an OODBMS

``Store the bits and get out of my way''

Major execution on the server not a requirement

Full blown execution is hard

Performance

Locking

Security

Execution model

Match of execution model to programming model

Query optimization

Looping

...

Simple execution is doable

Leave It To The Client

Find the set objects of interest non-navigationally

Let the client further filter objects

Let the client build appropriate data structures for the current problem

Add hooks to the database

Alerters (e.g., be informed when something changes)

Type system

Yggdrasil and IFS Comparison

topic IFS Yggdrasil ratio

size 1 Gbyte 1 Tbyte 1000

CPU .2 MIP 8 MIP 40

memory 128 KB 128 MB 1000

net read 28 1000 35
bandwidth
(Kbytes/sec)

latency 50 msec 3-30 msec 16 -
- 1 min 20000

Yggdrasil Phases

build hypertext, naming, indexing, and containers (mostly)

use Camelot/Mach

skip systems issues: performance, recovery, availability, access control, alerters, archival storage, and data compression

postpone versions and alternatives

archival storage

versions and alternatives

alerters and better locking

availability, access control, and data compression

Yggdrasil Status

High level design done

Wildly coding version 0

BJ is minding the Mach/Camelot store

Brian Oki shows up in a few weeks

RR says we still have a slot

Optical disk jukebox next year

Recruiting