IFLANET home - International Federation of Library Associations and InstitutionsActivities and ServicesSearchContacts


IN THIS DOCUMENT:

UDT Office Personnel

Preliminary Plans for Istanbul

National Digital Library Federation Agreement Signed

Confederation Project of National Library of Canada

Campus Watch

What's New on IFLANET?

Agora: Retrieving IFLANET WWW Documents through Electronic Mail

Standard Generalized Markup Language (SGML): Overview and Developments




UDT Newsletter

Universal Dataflow and Telecommunications

Archive - Historical Material

Summer 1995
Issue #28

UDT Office Personnel

Leigh Swain, Director of the IFLA International Office for UDT, would like to announce the following personnel changes. In March 1995, Paula Tallim, Programme Officer, began working on assignment elsewhere in the National Library of Canada. We wish her well in her new endeavours! Louise Lantaigne, who is replacing Ms. Tallim, is responsible for publication sales, newsletter distribution, general queries, and other administrative activities. Ms. Lantaigne has previously worked in several key areas of the National Library of Canada, and we welcome her formidable organizational skills. Terry Kuny, Global Village Research, an Ottawa-based Internet and electronic communications consulting company, has been contracted by the UDT Core Programme to develop and maintain the IFLANET World Wide Web service. He is also on contract to the National Library of Canada for consulting services with regard to information policy and technological developments. Also involved with IFLA electronic communications is Gary Cleveland of the National Library, author of several IFLA publications, who will attend the Istanbul conference with Leigh Swain.

Members of the UDT Core Programme Office can be reached via e-mail at the following addresses:

Preliminary Plans for Istanbul

At the 1995 IFLA Conference in Istanbul (August 19-25), the IFLA Section on Information Technology (IT) is planning to join with the Section on Serial Publications to hold an Open Session on the control of electronic publications, particularly journals. One important focus will be on standards issues. This is a timely topic in an area that is moving rapidly out of control as the electronic media proliferate.

In addition, the Section will sponsor a half-day morning “Workshop on Internet Basics” for colleagues who are just beginning to use the Internet and explore its many aspects. Since the conference in Havana, the number of World Wide Web sites has exploded and the subject of Internet discovery tools has become more complex. At least one other IFLA group plans to present a complementary afternoon program on a specific Internet topic.

(From "IT Review", the newsletter of the IFLA Section on Information Technology, March 1995, No. 24)

National Digital Library Federation Agreement Signed

On May 1, leaders of fifteen of the largest U.S. research libraries and archives and the Commission on Preservation and Access signed an agreement that pledges collaboration toward the establishment of a National Digital Library Federation. At the signing held at Harvard University, the founding members of the Federation — collectively responsible for hundreds of millions of cultural, scholarly and historical resources — agreed to cooperate on defining what must be done to bring together, from across the nation and beyond, digitized materials that will be made accessible to students, scholars, and citizens everywhere, and that document the building and dynamics of United States heritage and cultures.

A primary goal of the Federation is the implementation of a distributed, open digital library accessible across the global Internet. The library will consist of collections — expanding over time in number and scope — to be created from the conversion to digital form of documents contained in founding member institutions and other libraries and archives, and from the incorporation of holdings already in electronic form. In support of this goal, the Federation will establish a collaborative management structure, develop a coordinated funding strategy, and formulate selection guidelines to ensure conformance to the general theme of U.S. heritage and culture. The Federation also will adopt common standards and best practices to ensure full informational capture and guarantee universal accessibility.

The agreement recognizes and acknowledges the important leadership role that the Library of Congress has played in raising as a national issue the need for such a digital library.

The first phase of the Federation's work will be completed in six months. During that time, a task force coordinated by the Commission on Preservation and Access and composed of senior members of the staffs of the founding institutions will develop an action plan to address, among other issues, funding strategies and the involvement of additional institutions. The Commission will report regularly on developments.

The primary mission of the Commission on Preservation and Access is to foster, develop and support collaboration among libraries and allied organizations to ensure the preservation of the published and documentary record in all formats and to provide enduring access to scholarly information. It operates as a private, non-profit corporation supported by foundation grants and the sponsorship of colleges, universities, associations, libraries, and publishers.

Information:

Maxine K. Sitts,
Program Officer,
Commission on Preservation and Access,
1400 16th St., NW, Suite 740, Washington, DC 20036
Internet: mksitts@cpa.org

The mission statement and list of parties to the agreement are available in the CPA area of Conservation OnLine:

http://palimpsest.stanford.edu/

(Information from a Commission on Preservation and Access news release)

Confederation Project of National Library of Canada

On April 5, National Librarian Marianne Scott announced at a press conference in Sutherland's River, Nova Scotia, that the National Library of Canada is playing an active role in the creation of an "electronic library" by initiating efforts to digitize Canadian material and make it available on the Information Highway.

The announcement marked the launch of a demonstration project involving both the National Library of Canada and the Library of Congress. Two schools, East Pictou Rural High School in Sutherland's River, Nova Scotia and Hammond Middle School in Alexandria, Virginia, participated in a week-long demonstration of the project by accessing information on Canada's Confederation and the U.S. Civil War through Internet links to both libraries.

The project is based on the idea of using the Internet to provide wider and more immediate accessibility of information. As Marianne Scott explained, "This project allows us to share the resources of the National Library with a greater audience and to promote the study and appreciation of Canada's published heritage." The National Library's long-term objective is to provide electronic access to a much greater volume of Canadian material, both in Canada and around the world.

The Stentor Alliance, a consortium of Canada's telephone carriers, represented by Mr. Colin Latham, President and CEO, Maritime Telephone & Telegraph (MT&T), made a commitment of $450 000 to the Friends of the National Library. Accepting on their behalf, the National Librarian said: "We are grateful to the members of the Stentor Alliance for their contribution. We hope that this is the first of many corporate contributions that will support the National Library's efforts to promote and increase Canadian content on the Information Highway."

Marianne Scott was joined at the launch by the Honourable John Savage, Premier of Nova Scotia; Mr. Colin Latham, President and CEO, MT&T; and Mr. Alex Burney, Manager, Canadian Business Development. Dr. James Billington, Librarian of Congress; Ms. Linda Roberts, US Secretary of Education; Mr. Derek Burney, Chairman, Bell Canada International; Mr. Glenn Jones, CEO, Jones Intercable, and Mr. Decker Anstrom, President, American Cable Association also participated in the launch in Alexandria, Virginia, Toronto and New York via satellite links.

(From a National Library of Canada news release)

Campus Watch

Campus Watch is an electronic newsletter published by CAUSE, a Colorado association, to share news concerning effective management and use of higher education information resources, including information, technology, and services. Here are two items of interest from the April 28, 1995 issue:

University of Michigan Forms Coalition to Reinvent Information Education

The University of Michigan's School of Information and Library Studies (SILS), with the W.K. Kellogg Foundation, has formed the Kellogg Coalition on Reinventing Information Science, Technology, and Library Education (CRISTAL- ED). Under the five-year project, SILS hopes to radically change its instructional program, using an international, multidisciplinary consortium of schools to define new professional specializations focussing on information access and use. SILS has started a moderated listserv on the subject; subscribe by sending an e-mail to majordomo @sils.umich.edu with the message: subscribe cristal-ed. Web home page: http://sils.umich.edu/publications/ CRISTALED/Kellogg-HomePage.html

Harvard Initiates Policy Project on Digital Libraries

The Science, Technology and Public Policy Program at Harvard's John F. Kennedy School of Government has announced a four-year project on policy development for the "digital library". The project, funded by the W.K. Kellogg Foundation, will investigate financial, legal, and institutional issues, focussing on life-cycle management of information resources. Contact: Brian Kahin, director, Information Infrastructure Project, kahin@harvard.edu

What's New on IFLANET?

The IFLANET World Wide Web (WWW) home page is located at:

URL: http://www.ifla.org/index.htm

We are extremely pleased to report that within the first six weeks of operation, the IFLANET WWW home page was visited by over 2000 people. Users from more than 30 different countries accessed the server in March. So far, users have been from Australia, Chile, South Africa, Thailand, China, Israel, and almost every country of Europe and North America. Initial feedback indicates that the IFLANET WWW service is becoming regarded as one of the premier Internet sites for information pertaining to libraries!

A number of new electronic resources have been added to the WWW service since February. Among the most popular pages have been:

Digital Libraries - Resources and Project.

URL: http://www.ifla.org/II/diglib.htm

G-7 Information Society Conference Information.

A meeting was held in Brussels on February 25-26, 1995 to encourage the development of a worldwide information society.

URL: http://www.ifla.org/II/g7.htm

Copyright and Intellectual Property Resources.

URL: http://www.ifla.org/II/cpyright.htm

Information Technology Standards and Organizations.

URL: http://www.ifla.org/II/standard.htm

The Unesco Public Library Manifesto (1994).

URL: http://www.ifla.org/documents/libraries/policies/unesco.htm

UDT Occasional Papers

The first is a set of three papers exploring information and networking technologies in Russian libraries.

URL: http://www.ifla.org/VI/5/op/index.htm

Interlibrary Loan, Document Delivery and Resource Sharing Information.

URL: http://www.ifla.org/II/ill.htm

Cataloguing and Indexing of Electronic Resources.

URL: http://www.ifla.org/II/catalog.htm

World Guide to Doctoral Dissertations in Science and Technology.

URL: http://www.ifla.org/V/wgddst/wgddst.htm

IFLA Grants Information.

URL: http://www.ifla.org/III/members/grants.htm

Libraries and Related Information Policy Statements.

URL: http://www.ifla.org/II/libpol.htm

Electronic Text Archives.

URL: http://www.ifla.org/II/etext.htm

61st IFLA General Conference Information.

URL: http://www.ifla.org/IV/ifla61/61intro.htm

Watch for future IFLANET additions! Upcoming services will include:

  • Internet and library software archive (June 1995)
  • The Gutenberg Electronic Text archive (July 1995)
  • An electronic icon and clipart archive for libraries (late summer/fall 1995)

Additions and changes to the IFLANET WWW service are announced regularly to the PACS-L and IFLA-L listservers. Please check the IFLANET WWW home page "What's New" section for the latest developments.

If you have any comments, corrections, or suggestions pertaining to the IFLANET WWW service, please send a message to:

IFLANET@ifla.org

Agora: Retrieving IFLANET WWW Documents through Electronic Mail

Agora is a service to allow users who have access only to electronic mail on the Internet to retrieve content from World Wide Web (WWW) servers. This service is being provided by the W3 Consortium http://www.w3.org/Consortium/, the international organization that is promoting WWW standards, applications and development.

IFLANET administration will be examining the possibility of mirroring Agora on IFLANET in the future to alleviate the load on the W3 Consortium computers.

The following information about Agora has been adapted from text prepared by Arthur Secret (agora-request@mail. w3.org) of the W3 Consortium.

The following information has been prepared by Arthur Secret at the W3 Consortium.


To retrieve a WWW document through Agora, you just have to specify its "address", called a Uniform Resource Locator (URL). For example, the URL of this help document is:

    http://www.w3.org/hypertext/WWW/Agora/Help.txt

This means that to get it, you just have to send a mail to listserv@mail.w3.org, with whatever subject you like, the body of the mail being:

SEND http://www.w3.org/hypertext/WWW/Agora/Help.txt

If this is the first time you enter this new medium, we suggest you look at the following documents:

In this new medium, most documents are hypertext. In such a case you will notice numbers in square brackets such as [12] next to certain words. This means that you can access another document, hopefully related to the word(s) preceding the [12]. If you want to see this document, you will find at the bottom of the document containing the [12] a list of URLs, next to numbers. Copy the URL next to [12], and paste it to the body of the request you sent to listserv@mail.w3.org.

There is another way to retrieve W3 documents through email. Reply to listserv@mail.w3.org, and specify in the body the number(s) you are interested in. This program will figure out which document you are interested in by looking at the subject header, which you then must save.

Example 1

You'd like to know more about the World Wide Web Initiative? The number within brackets is 1. At the bottom of the page, 1 corresponds to

    http://www.w3.org/hypertext/WWW/TheProject.html

So you have to send a mail to listserv@mail.w3.org with the body:

    SEND http://www.w3.org/hypertext/WWW/TheProject.html

Example 2

You still have some questions on World-Wide Web ? All right, the appropriate documents are "World-Wide Web Developpment" and "Frequently Asked Questions". They have the numbers 2 and 3 in brackets next to them. So you reply to the mail from listserv@mail.w3.org that you are currently reading, and write in the body:

    2 3

Note: If your mail tool truncates subject lines, it may be useful for you to know that this robot needs only the part (URL: ...) to determine what the numbers refer to.

Commands related to the retrieval of W3 documents

Everything appearing in [] below is optional; everything appearing in is mandatory; all arguments are case insensitive. Only the first 10 lines of requests will be processed.

    send
    www

this will send you back the document you requested, with all its hrefs, so that you may ask further requests. (if the document is too large, you will get only its first 5 000 lines). The url sent may contain the following characters:

    abcdefghijklmnopqrstuvwxyzABCDEFGHIJK
    LMNOPQRSTUVWXYZ0123456789:/._-+@%*()?$#&

Example:

    SEND http://www.w3.org/
    WWW http://www.cern.ch/

rsend

Same as "send", but you can specify a different return-path

Example:

    rsend zorro@horsemen.holywood.com http://www.w3.org/
deep

Same as "send", but it will also send you the documents refered to in the URL you mentioned. (If the documents are too large, you will get only the first 5 000 lines of what "deep" should provide). Be cautious when using "deep"! Agora might mail to you several dozens of documents!

Example:

    deep http://www.w3.org/

source

Same as "send", but allows you to see the source of the document, so that you may use a nicer HTML browser to read it

Example:

    source http://www.w3.org/

rsource

Same as "source", but you can specify a different return-path

Example:

    rssource zorro@horsemen.holywood.com http://www.w3.org/

help

Shows you this document.

References from this document

Standard Generalized Markup Language (SGML): Overview and Developments

by George Charlebois, National Library of Canada, National Library of Canada

Introduction

Standard Generalized Markup Language (SGML) is an International Organization for Standardization Standard (ISO 8879) that describes a language for text encoding. It provides a set of conventions for marking up, or describing, a full text document so that individual parts of the document can be identified and manipulated by processing applications. These conventions specify what codes are allowed, which are mandatory, how to distinguish markup from text, and what each code means.

Overview and Description

SGML is a product of the publishing industry, having had its origins in the automation of proofreaders markup in the 1970s. The concept of a generic markup language was first discussed at a meeting between the Graphics Communication Association and the Canadian Government Printing Office in 1967. Work resulting from this meeting culminated in the publication of a standard for the first generic markup language, GML, in the early 1970s. By 1978 this had been developed into the first version of Standard Generalized Markup Language (SGML) published as a Graphics Communication Association Standard. By 1986, this had been revised and published as International Standard, ISO 8879 which is the definitive version still in use today.

Unlike most other markup languages, SGML is non-proprietary, that is, it is not owned by any one vendor and does not have to be licensed for use. Anyone can study the SGML specification and implement a version of it. It is also not machine or application dependent. SGML tagging does not embed specific formatting codes into the text of a document, it only identifies the various parts of the document for later processing. This unique ability to separate format, structure and content and to process each separately affords SGML encoded documents a degree of hardware and software independence not available by any other method.

An SGML document consists of :

Document Declaration which specifies basic facts about the "dialect" of SGML being used. This is where one sets defaults, specifies options and delimiters, declares which character set is being used and similar functions. The Document Declaration may be held as a table in the processing applications rather than in the actual document , and so may be invisible to the user.

Document Type Definition (DTD) which specifies the document's structure. One DTD may be used for a whole class of similar documents if an organization has a large publishing program. For example, all the administrative manuals may share the same DTD, all the catalogs may share another. Sharing DTDs in this way is not only economical, it also provides a uniform "family" look and feel to the documentation produced by an organization. The main function of the DTD is to specify the tagset used to encode the text of a document. This includes specifying the actual tag names, the relationship between the tags, the order in which they appear in the document, and any qualifying attributes which apply to individual tags. Another important function of the DTD is to define the format for linkages to other documents.

Document Instance is the actual text of the document with the SGML tags embedded into it to identify the various parts of the text. Most of these texts are plain ASCII files created on a word processor or SGML author/editor. Although a Document Instance can share a DTD with several other documents as mentioned above, it can conform to only one DTD itself, and cannot draw on tagsets, defaults or definitions from several DTDs.

Output Specification which carries information about the formatting of specific text elements, such as type face, indention and font size. It is especially useful in cases where you must preserve the exact format of a document, such as in forms transmission. There are two types currently available, Formatting Output Specification Instance (FOSI), used for printed material, and Document Style Semantic and Specification Language (DSSSL), used for other types of media as well as printed. The DSSSL is an ISO standard currently in its second draft, and is still undergoing extensive revisions.

There are two types of SGML systems in use today. Those that operate directly on SGML encoded files are referred to as native SGML systems, and those that convert SGML encoded files to some proprietary internal format for processing are called quasi-SGML systems. Both major word processing packages, MS WORD and WordPerfect, are of this later quasi-SGML type. As could be expected, the conversions between SGML tags and word processing styles, are not perfect, and some of the SGML functionality is lost in the process. This is the first generation of word processors to have this capability however, and the companies concerned are committed to full support of SGML in future releases of their products.

Applications

Many of the most successful SGML applications are in the area of publishing and information handling. One of the earliest implementations was the Computer Aided Logistics Support (CALS) Project of the U.S. Department of Defense. The DOD and its defense contractors used SGML encoded documents to exchange data, bids and technical specifications for weapons systems contracts. The Text Encoding Initiative (TEI), a consortium of academic institutions, is using SGML to make available electronic versions of texts in the humanities for scholars to study. In conjunction with other similar projects, they have so far made available all known works in classical Greek, all the works of Shakespeare, all English poetry before 1900, and a substantial body of English prose works as well. Statistics Canada uses SGML to produce the Canada Yearbook, and is transferring most of its statistical publications to SGML format. Beginning in Sept. 1994, they also offer on-line access via the Internet to their SGML-based statistical databases. In addition, there are applications of SGML used to create Braille and large print books for the handicapped, to provide access to flight information and technical data in the aviation industry, and to monitor automobile emissions for conformance to EPA standards in the automotive industry. The increasing number of SGML-based projects and systems all highlight the growing acceptance of generic data encoding with its long term advantages of platform and application independence for valuable corporate data resources.

New Developments: SGML and World Wide Web (WWW)

A milestone in Internet history was announced at the SGML '94 Conference in Washington, D.C held in early November. Yuri Rubinski, President of SoftQuad a major SGML software supplier, and Joseph Hardin from the National Centre for Supercomputing Applications (NCSA), announced that as of January 1995 it will be possible to create, view and distribute fully conformant SGML documents on the Internet. This will be possible thanks to SoftQuad's decision to distribute its SGML viewer, PANORAMA, free of charge as part of NCSA's MOSAIC software package.

NCSA has been a leader in the development and distribution of freeware on the Internet. Their World Wide Web (WWW) client, MOSAIC, is probably the most popular Web client in use today, and is freely available from NCSA. Applications on the Web currently use Hypertext Markup Language (HTML), a reduced tagset version of SGML, to encode their documents, and an HTML viewer supplied by NCSA. The advent of a full capability SGML viewer like PANORAMA will provide a considerable impetus for the use of SGML conformant applications on the Internet.

In the short term this means that any existing SGML document can be put up and viewed on the Internet in its native format. In the longer term, it probably means that SGML will consolidate its position as the document encoding standard of choice for use on the Web. SGML will also provide much richer and more functional documents than HTML is capable of handling, and over time, it will develop in ways as yet unforeseen. It also provides impetus to document registry and repository projects, which will be necessary to control and manage the increasing number of SGML documents on the Web.

The HTML DTD itself will not disappear. NCSA, the creators of HTML, indicated that they are going to develop the HTML DTD to become a fully conformant SGML DTD in order to support the additional functionality offered by PANORAMA. This will mean a significant expansion of the current HTML tagset, and the incorporation of many new features of the SGML standard into the HTML DTD. The new HTML will probably be scaleable, that is it will support several levels of conformance to the SGML standard. Currently existing HTML encoded documents will not become obsolete. You will still be able to view and update them, but new documents conforming to full SGML will be handled with equal ease.

SGML information on the Internet

Usenet

    COMP.TEXT.SGML - news:comp.text.sgml

FTP archive for the above newsgroup

Frequently asked questions (FAQ)

    URL: http://www.osf.org:8001/osfdtd/sgml-faq.html

An Introduction to SGML

Discussion Lists

SGML-L Discussion List

TEI-L Discussion List

*    

Latest Revision: March 6, 1996 Copyright © 1995-2000
International Federation of Library Associations and Institutions
www.ifla.org