Preservation and persistance is an important property of scholarly information, but which is commonly ignored for information stored on computers. For example: to a researcher, the bibliography can be the most useful part of a scientific paper, since it leads to more relevant or complete works; but the references in that bibliography must be valid after years or decades. Using HTTP URLs on the Web, this is not likely, since a URL is a specific pointer to a computer and file, and it becomes obselete as soon as the file is moved or the computer is renamed.
The CNRI Handle System was created to solve this problem. It offers a single unchanging, "persistent" name for a network resource, and resolves it upon request to the actual URL. It is both a layer of indirection and an archival-quality naming system. When a document references a "handle", the reference may still be in the form of a URL (to a proxy server), but the contract of the Handle System states that URL will never change. Whenever the resource referenced by the handle moves, its author has one place to update -- the Handle System entry -- and all of the links made to it will work.
MIT is already an online publisher of scholarly resources, both formally (via the Libraries and the MIT Press) and informally. The commercial publishing community is starting to adopt Handle System technology in the form of Digital Object Identifiers. It makes sense for MIT to enjoy the benefits of persistent reference names as well, and to encourage their use throughout the world academic community to produce more useful online documents.
The NCSTRL project and the Digital Library of MIT Theses are already using handles on an experimental basis, and would benefit for a formal committment to some persistent resource naming system. The MIT Libraries online catalog (Barton) needs persistent names to be able to include references to online resources, since URLs are too volatile to keep in the catalog. The Electronic Media Creation Center (EMCC) has shown interest in using persistent names for the content they are creating.
To succeed, a persistent naming service must be pervasive and permanent. Every publisher of a document worth preserving indefinitely at MIT and its associated laboratories is a potential customer, but they must be willing to adopt the persistent name as the canonical reference to their work. They must be convinced that it will always be available.
The MIT Libraries can help by lending their reputation and expertise in preservation to the project, and thus legitimizing the service. They can even help attract "customers" by using persistent names to list the online version of a document as a holding in Barton, and providing a live link to it in the Web catalog interface.
There is little technological risk, since there is already working, production-quality technology to implement a persistent naming service. The main risk is that it will be deployed but not used widely enough and thus we will be stuck maintaining the service indefinitely without all of the benefits it can offer.
One of the biggest challenges of this project is to decide how to integrate it throughout the Institute. The proposed technology offers a hierarchical namespace with decentralized administration; should it be allocated to bodies within the Institute and given over to their control? Guidelines for allocating names and namespaces must be established.
If an Institute-wide mailing is needed to spread the word about the service, that also has to be funded.
The chief executive of namespace administration may have more extensive duties, calling for a larger fraction of an FTE -- but it is still probably nothing close to a full-time occupation.
Since the entire value of this service is that is persistent and unchanging, by providing it the Institute is promising to support it indefinitely into the future. The Handle System was carefully designed to be independent of specific network protocols and services so it can be ported to future incarnations of the Internet; this means MIT must commit to support it for as long as it is relevant.
As a Discovery project, the most important goal of this effort is to decide whether MIT is to address this problem now and with this technology.