Cover Pages: SGML/XML DTD Transduction and Generation

The Cover Pages [画像:The OASIS Cover Pages: The Online Resource for Markup Language Technologies]
SEARCH | ABOUT | INDEX | NEWS | CORE STANDARDS | TECHNOLOGY REPORTS | EVENTS | LIBRARY
SGML/XML DTD Transduction and Generation

[August 29, 2000] A provisional reference list. See the bibliographies in the individual articles.


References:


[CR: 19970523]

Ahonen, Helena . "Automatic Generation of SGML Content Models." Pages 195-206 (with 12 references) in EP '96. Proceedings of the Sixth International Conference on Electronic Publishing, Document Manipulation and Typography. [ = Journal Special Issue: Electronic Publishing - Origination, Dissemination and Design (EPODD), June & September 1995, Volume 8, Issues 2-3. Sixth International Conference on Electronic Publishing, Document Manipulation and Typography, Palo Alto, California. September 24-26, 1996. Sponsored by Adobe Systems Incorporated; School of Information Management and Systems, University of California at Berkeley; Xerox Corporation. [Proceedings Volume] Edited by Allen Brown, Anne Brüggemann-Klein, and An Feng; [Journal] Editors David F. Brailsford and Richard K. Furuta. Chichester/ New York: John Wiley & Sons, 1996. ISSN: 0894-3982. Author's affiliation: Department of Computer Science, P. O. Box 26 (Teollisuuskatu 23), FIN-00014 University of Helsinki, Finland. Phone: +358 0 708 44218; Fax: +358 0 708 44441; Email: helena.ahonen@helsinki.fi. WWW: Helena Ahonen Home Page .

Abstract: "We study the problem of automatic generation of a document type definition (DTD) for a set of Standard Generalized Markup Language (SGML) documents. We present various situations where we have tagged documents but no DTD, and discuss the requirements various applications may have with respect to the generation process. We also present an automatic DTD generation tool that can be adjusted for several tasks necessary in the applications. The method is also demonstrated with some experimental cases."

Keywords: SGML, document type definition, generation, TEKES.

For other conference information, see the main conference entry for EP '96, or the brief history of the conference as sixth in a series since 1986. See the volume main bibliographic entry for a linked list of other EP '96 titles relevant to SGML and structured documents.

The document is available in Postscript format: http://www.cs.helsinki.fi/~hahonen/helena_ep96.ps [mirror copy].



[CR: 19960728]

Ahonen, Helena . Automatic Generation of SGML Content Models. Paper Submitted and accepted for presentation at Electronic Publishing '96 . Helsinki, Finland: Department of Computer Science, University of Helsinki, Finland, 1996. Extent: 10 pages. Author's affiliation: Department of Computer Science, P. O. Box 26 (Teollisuuskatu 23), FIN-00014 University of Helsinki, Finland. Phone: +358 0 708 44218; Fax: +358 0 708 44441; Email: helena.ahonen@helsinki.fi. WWW: Helena Ahonen Home Page .

Abstract: "We study the problem of automatic generation of a document type definition (DTD) for a set of Standard Generalized Markup Language (SGML) documents. We present various situations where we have tagged documents but no DTD, and discuss the requirements various applications may have with respect to the generation process. We also present an automatic DTD generation tool that can be adjusted for several tasks necessary in the applications. The method is also demonstrated with some experimental cases."

The document is available on the Internet: http://www.cs.helsinki.fi/~hahonen/helena_ep96.ps; [mirror copy]



[CR: 19951220]

Ahonen, Helena ; Nikunen, Erja. "Forming Grammars for Structured Documents: An Application of Grammatical Inference." Pages 153-167 in Grammatical Inference and Applications. Papers Presented During the Second International Colloquium. Second International Colloquium on Grammatical Inference - ICGI-94. Alicante, Spain, September 21-23, 1994. Edited by Rafael C. Carrasco and Jose Oncina. Lecture notes in computer science, number 862. Berlin/New York: Springer-Verlag, 1994. ISBN: 3540584730 (Berlin); 0387584730 (New York). ISSN: 0302-9743. Authors' affiliation: Department of Computer Science, P. O. Box 26 (Teollisuuskatu 23), FIN-00014 University of Helsinki, Finland. Phone: +358 0 708 44218; Fax: +358 0 708 44441; Email: helena.ahonen@helsinki.fi. WWW: Helena Ahonen Home Page .

"Abstract: We consider the problem of generating grammars for classes of structured documents -- dictionaries, encyclopedias, user manuals, and so on -- from examples. The examples consist of structures of individual documents, and they can be collected either by converting typographical tagging of documents prepared for printing into structural tags, or by using document recognition techniques. Our method forms first finite-state automata describing the examples completely . These automata are modified by considering certain context conditions; the modifications correspond to generalizing the underlying language. Finally, the automata are converted into regular expressions, and they are used to construct the grammar. In addition to automata, an alternative representation, characteristic k-grams, is in-troduced. Some interactive operations are also described that are necessary for generating a grammar for a large and complicated document."

Available on the Internet: http://www.cs.helsinki.fi/~hahonen/ahonen_icgi94.ps [mirror copy, December 1995].



[CR: 19951220]

Ahonen, Helena ; Mannila, H. ; Nikunen, Erja. "Generating Grammars for SGML Tagged Texts Lacking DTD." Pages [???-???] in Principles of Documents Processing, PODP '94. Principles of Documents Processing. Darmstadt. April 11-12, 1994. Sponsored by: Fuji Xerox Systems and Commnunications Lab, GMD-IPSI, Rank Xerox Research Centre, and Xerox Webster Research Center. Edited by Makoto Murata and Herve Gallaire. [pub-location: Darmstadt?]: [publisher: GMD-IPSI?], 1994. Authors' affiliation: [Ahonen, Mannila] Department of Computer Science, P. O. Box 26 (Teollisuuskatu 23), FIN-00014 University of Helsinki, Finland. Phone: +358 0 708 44218; Fax: +358 0 708 44441; Email: helena.ahonen@helsinki.fi. WWW: Helena Ahonen Home Page; [Nikunen] Research Centre for Domestic Languages.

"Abstract: We describe a technique for forming a context free grammar for a document that has some kind of tagging -- structural or typographical -- but no concise description of the structure is available. The technique is based on ideas from machine learning. It forms first a set of finite-state automata describing the document completely. These automata are modified by considering certain context conditions; the modifications correspond to generalizing the underlying languages. Finally, the automata are converted into regular expressions, which are then used to construct the grammar. An alternative representation, characteristic k-grams, is also introduced. Additionally, the paper describes some interactive operations necessary for generating a grammar for a large and complicated document."

Available online: http://www.cs.helsinki.fi/~hahonen/ahonen_podp94.ps [mirror copy, December 1995]. The paper is also to appear in Mathematical and Computer Modelling. See the first author's home page for more up-to-date bibliographic details and other SGML-related research.


SEARCH
Advanced Search
ABOUT
Site Map
CP RSS Channel
Contact Us
Sponsoring CP
About Our Sponsors

NEWS
Cover Stories
Articles & Papers
Press Releases

CORE STANDARDS
XML
SGML
Schemas
XSL/XSLT/XPath
XLink
XML Query
CSS
SVG

TECHNOLOGY REPORTS
XML Applications
General Apps
Government Apps
Academic Apps

EVENTS
LIBRARY
Introductions
FAQs
Bibliography
Technology and Society
Semantics
Tech Topics
Software
Related Standards
Historic
Last modified: August 29, 2000

Hosted By
OASIS - Organization for the Advancement of Structured Information Standards

Sponsored By

IBM Corporation
ISIS Papyrus
Microsoft Corporation
Oracle Corporation

Primeton

XML Daily Newslink
Receive daily news updates from Managing Editor, Robin Cover.

Newsletter Subscription
Newsletter Archives
[画像:Globe Image]

Document URI: http://xml.coverpages.org/grammarTransduction.htmlLegal stuff
Robin Cover, Editor: robin@oasis-open.org


AltStyle によって変換されたページ (->オリジナル) /