Perseus · Tufts
Perseus Research Preprints
Collections: Classics · Papyri · Renaissance · London · California · Upper Midwest · Tufts History
Configure display · Help · Tools · Copyright · FAQ · Publications · Collaborations · Support Perseus

Publications: Collection contents
About the collection

The Management of XML Documents in an Integrated Digital Library

David A. Smith
Anne Mahoney
Jeffrey A. Rydberg-Cox

Paper presented at Extreme Markup Languages 2000: The Expanding XML/SGML Universe, Montréal, 15-18 August 2000.

The final version of this article will be published in Markup Languages, vol. 2, issue 3 (2001), published by The MIT Press.

Abstract: We describe a generalized toolset developed by the Perseus Project to manage XML documents in the context of a large, heterogeneous digital library. The system manages multiple DTDs through mappings from elements in the DTD to abstract document structures. The abstraction of document metadata, both structural and descriptive, facilitates the development of application-level tools for knowledge management and document presentation. We discuss the implementation of the XML back end and describe applications for cross citation retrieval, toponym extraction and plotting, automatic hypertext generation, morphology, and word co-occurrence.

Full text: pdf