Metadata Design for Search and Retrieval in State of Illinois Electronic Archives

GSLIS technical report number: ISRN UIUCLIS--2001/9+EARCH

Dated: 27 December 2001

Author: Larry S. Jackson < lsjackso@uiuc.edu >, http://www.isrl.uiuc.edu/~jackson

This report available on the web at http://www.isrl.uiuc.edu/pep/papers/UIUCLIS_2001_9_EARCH.html


PROJECT OVERVIEW & BACKGROUND

Background and Introduction

The metadata and schema design work addressed herein are part of a larger effort involving the State Libraries of Illinois [1] and Ohio [2]. Under a sub-award of the grant, "Preserving Electronic Publications", from the Institute of Museum and Library Services [3] (IMLS) to the Illinois State Library (ISL), GSLIS is performing the design and creation of a readily deployable, low-cost facility capable of acquiring and retaining electronic copies of an arbitrary web site [4] [5]. Such a facility would enable either centralized electronic archival (e.g., operated by GSLIS or ISL), or decentralized archival wherein each agency operates its own archive. The facility will be used to archive official documents published on the web by agencies and activities of the State of Illinois. The system will ultimately be made available to other states and organizations.

The Illinois system will be built upon a number of off-the-shelf computing solutions, planned to include;

The system design, and all software components used therein are intended to subsequently be available for download, either under no-cost license, or as public domain material. The Illinois system is being tested, using the complete contents of the web sites of;

The Ohio system is to be based on prior work by the Ohio State Library on the Joint Electronic Records Repository Initiative (JERRI), and on "beta test" use of the "Digital Vault" software from OCLC.

Recognition of the problems of state government document archival in Illinois dates to "... 1967 when legislation and rules were adopted that named the Illinois State Library as the mandated depository for all State of Illinois publications and the administrator of the statewide Illinois Document Depository Program. The State Library Act was amended in 1995 making the State Librarian also responsible for making electronic state records available via the World Wide Web." [19]

In Illinois, the Secretary of State ("the Secretary") is, in addition to numerous other administrative and operational duties of the State, also the titular State Librarian, and therefore presides over the Illinois State Library (ISL). The collection of activities overseen by the Secretary is relatively large, with 15 major entries in the public description of the office [20]. Many of these are, to some extent, involved in information and records management work for the state. Even within the Secretary's office, there is a strong need for systematic metadata usage to make a number of web sites searchable as one unified body.

Within ISL, a history of grants, and the associated staffs, coming and going has created an inventory of information resources (e.g., the "Find-It! Illinois" search facility [21]). These resources may, or may not, be fully up to date and interoperable. For example, the "Find-It! Illinois" search engine does not find the Metadata Generator [22] that was also created by ISL staff. A number of partly related web sites have been created and registered, often with different names, and name selections that emphasize what in the commercial sector would be called "brand recognition" rather than making clear the exact governmental affiliation. These domain names include;

Organizationally, the webmasters of the numerous other Illinois state agencies are not under the de jure supervision of the Secretary. Thus, direction from the ISL concerning practices for information construction and metadata tagging cannot be "enforced", but must rather be "cajoled". The ISL staff has taken the approach of facilitating "taking the better path". For example, they provide web-accessible fill-in-the-blanks forms that generate the appropriate metadata tags for use by authors of State of Illinois web pages, an enumerated classification system embodied in the metadata generator, and explanatory, rationale, and motivational material concerning metadata usage in general [19] [23].

Definitions and Scope of the Archival Problem

The Illinois State Archives [24] considers the difference between a "document" or "publication" and a "record" to be that the retention of the record is somehow formally required, and that a record may not be disposed of without some for of approval or directive [25]. Further, while publication of a document is generally specifically mandated, its retention period is often unspecified. Until or unless information to the contrary can be obtained, a document is presumed to be retained "forever".

The language of legislation and the Illinois Administrative Code has not caught up with the emergence of the web and other Internet technologies, but the matter is under discussion, and guidelines have been issued concerning scanned versions of physical documents [26]. Archivist concerns address issues of media lifetime and commercial viability, availability of suitable software to view electronic contents, the ease of inadvertent deletion, the maelstrom of competing and evolving file format standards. Archivists have formerly been concerned primarily with "primary sources", and not so much with materials derived therefrom [25]. Thus, while a master copy of a high-resolution state highway map would be of concern, a simplified copy to provide motorists directions on some web site has not been viewed as appropriately archived. E-mail and associated informational lists operated by the state are not considered suitable for archiving.

In the interest of long-term survivability, the Illinois Archives [26] recommends "... that digital records with a retention value of more than ten years be backed up on microfilm or paper", and "... that every textual record an agency creates is reducible to the ASCII format". In the interest of surviving disastrous events, notes "For the past two thousand years, nothing has helped information survive natural and man-made disasters more than redundancy...". The copies of electronic documents made by the system being constructed under this grant will duplicate those materials off-site from their original location, and will also support copying onto any electronic media type that should emerge in the future.

The Role of Metadata in Facilitating Search

As promoted in the Find-It! Illinois Agency Guide to Indexing Websites [23], consistent use of a consistent set of metadata would support better searching across multiple State of Illinois web sites by the citizenry. User queries can potentially be cross-cutting (e.g., a query about a certain park should perhaps trigger retrieval of web pages from both the Department of Natural Resources and the Department of Transportation, if, say, the main access roads to that park are undergoing repair).

Also, searchers would not need to know a priori which agency happens to have administrative custody of the literature on a particular topic at a given moment in time. This is particularly desirable in that organizational responsibilities frequently change. The electronic archive will strongly benefit from consistent use of metadata in cases where a search is made for documents that are part of programs for which administrative custody has changed over the life of the archive.

More complex cross-domain queries may not be supportable in the absence of appropriate metadata, as keyword occurrence measures are generally unable to determine themes and topicality of documents.

As part of the larger grant effort, a schema suitable for retention of the web sites' metadata will be created. The initial reviews and intentions for that schema is presented herein. Metadata fields retained are planned to be;

Other Potential Applications for the Archive System

In addition to formal government applications, the system may be very useful to arbitrary projects that are continually evolving, and that need to capture their history as it plays out across their web pages over time. Accordingly, spidering experiments already have processed the web sites of;

in addition to the much more extensive testing involving the selected web sites of those agencies of the State of Illinois.

DESIGN INTENT

The design of the Illinois system reflects an overall commitment to achieving a solution that is compatible with typically limited budgets of state and municipal agencies and of other small groups. As such, the use of very large and expensive search engine and data acquisition facilities, as are being tried elsewhere, are precluded here. The system is intended to be constructed entirely of freely obtainable software. Under terms of the various licenses (e.g., GNU public license), this material will be obtainable from either the original providers of long standing, or from the web site for this project [4]. Material constructed as part of this grant will be made freely available, on our web site.

The principal freeware software modules mentioned above will be interconnected much like Figure 1. Interconnection scripts to date have been written in BASH shell commands [30], PERL [31] [32], AWK [33], and EX editor macros [34] -- all standard, free, UNIX components. It is intended to port all locally scripts to PERL to minimize the number of UNIX components upon which the project software is dependent.

The operation of the wget web spider is controlled by many parameters that must be supplied on the command line when the spider is started. As command-line syntax is not particularly user-friendly, and as administrators of the archive system should not be presumed to even have access to the host machine, web-based access to the control parameters is particularly desirable. By encapsulating the parameter set for each web site in a specified Extensible Markup Language (XML) [35] file, the parameter sets can be made much more maintainable. The XML files themselves will be editable using any text editor, and a simple web server script will be created to provide editing capability remotely, and under password control.

Figure 1. Overall archive system architecture.

The current version of the XML Document Type Descriptor (DTD) is provided in Appendix A.1. Variable values in this application should have some form of default behavior. The default values are also describable by filling in an XML document, compliant with the same DTD. Storing default values in XML format will allow a new archive site (e.g., a download, done by some unknown party) to have default behaviors that differ from the "factory version" of the archive toolkit, without necessitating they re-specify these behaviors for every web site they wish to archive. The current version of the default values is provided in Appendix A.2. The values for a specific web site should then override any of those defaults. An example using the current version of the parameters for the Illinois Department of Transportation's web site is provided in Appendix A.3.

Web pages for administrator operation of the archive, and for user entry into the archive are planned to be minimalist HTML. It appears the FORMs provisions of HTML 2.0 will suffice. This streamlined design will support the widest range of users and user hardware facilities, specifically including (1) handicapped users, as required by [36], (2) low-bandwidth users such as Prairienet users with the Lynx text-only web browser, and (3) users with hardware necessitating content format adaptation, including wireless access devices, and handicapped access devices. Of course, the retrieved content itself will be as its author made it. But, access, control, and management features of the archive need not be inaccessible. Archive operation, if not its content, will also be browser-independent, allowing users their choice between the two currently most popular web browsers.

The archive must support acquiring the wide variety of file formats being encountered on the web. Appendix B lists the formats the archive is currently configured to support. Unfortunately, new formats are continually being introduced. Changes to the archive to support a new format are relatively straightforward, requiring only that a file type be determined to be either ASCII or non-ASCII. ASCII file archival is more storage efficient, more portable, and more amenable to subsequent data mining and search operations.

CONSTRUCTION & RESULTS TO DATE

Operational Prototype Sites

A complete working copy [37] of the original Department of Transportation website [14] has been created by the manual sequence of;

The reader is encouraged to try these copied web sites, and to compare their behavior with the originals. Very many of the features work to support browsing, including the use of hyperlinks, embedded images, and some JavaScripts. A number of relatively isolated problems remain, many of these deriving from seemingly erroneous use of the underlying HTML in a webpage.

Sites that query scripts on the parent server generally do not work, although they may work at the moment as the original site still exists in a form that exactly matches the needs of the archived page. That is, the archived page may be calling the "live" site.

Notable among the failures is that files retrieved by JavaScripts or the execution of Java applets or Active-X controls are not retrieved by the wget spider. Thus, those files will not reside on the archive server, and the active code that attempts to reference those files will fail. A notable example of this is the use of the "mouseover" features in JavaScript to toggle to a new image when a user passes the mouse cursor over a button or region on the webpage. The alternate image will fail to load from the archive, giving an empty rectangular frame in its place.

Site Statistics

The sizes of the test sites, and the number of META tags encountered are listed in Table 1. The META tags encapsulated in those files are one of the three defining sources of what the metadata schema must support, and, correspondingly, what sort of searches will be available beyond simple keyword matching.

IMLS-PEP Test Web Site Total Web Site:
Size in Bytes,
# Documents
# of Markup-Formatted Documents # of META Tags Remarks
Department of Transportation
@200111302101
620949504
7286
1312 1461  
Secretary of State
@200111301614
154144768
1525
841 1531  
Department of Natural Resources
@200112010004
508919808
9305
5107 8374 Other unusual file names (e.g., containing punctuation or space characters) suggest the spider encountered syntax errors or became confused in light of hardware platform dissimilarities as discussed below.
Department of Revenue
@200111301312
84639744
3379
1678 3308  
Illinois Pollution Control Board
@200111301754
364077056
2597
202 762 The website contains 2 files or directories with UNIX metacharacters in their names, which interfere with the statistics gathering programs.
Total for IMLS-PEP Test Sites 1732730880
24092
9140 15436  

Table 1. Observed sizes of test sites, and extent of metadata use.

Configuration Problems Encountered

Wget must be set to "make links relative" in order to support relocation of sites to an archive server. Otherwise, embedded anchors will continue to point back to the parent site, and all embedded links relative to "docroot" (i.e., the top-most document directory reachable by the web server in its operation) on the parent site will break.

Vendor Incompatibility Problems Encountered

Sites use multiple authoring tools and multiple types of host computers (i.e., Windows, Mac, UNIX) for content development. This causes all possible combinations of "newline" (i.e., end-of-line) characters to be used. Wget does not standardize the files it downloads, so an additional step was added to the processing to ensure line-breaks are sensible.

Web software vendors have not followed suit with the UNIX convention of case-sensitive names. In particular, Microsoft's web servers are case-insensitive. For example, the use of case in the user's input of a URL to a browser, or the use of case in an HTML tag referencing a URL need not match the use of case in the directory and file names on a Microsoft web server. Unless special measures are taken (e.g., the mod_speling module is compiled in to Apache servers [10]) use of case must be entirely consistent on UNIX web servers (the original web servers), or file retrieval will fail. The Department of Natural Resources site exhibits this problem. Each unique spelling, considered on a case-sensitive basis, of a file path and name cause a separate file retrieval by wget (e.g., "A.htm" and "a.htm" would be stored twice in the archive, and would be considered to be distinct). Thus, the following file groups show examples of files that are stored more than once, under file or path names that differ in use the of case.

./dnr.state.il.us/orep/inrin/ctap/CTAP 2summary.htm
./dnr.state.il.us/orep/inrin/CTAP/CTAP 2summary.htm
./dnr.state.il.us/orep/INRIN/ctap/CTAP 2summary.htm

./dnr.state.il.us/lands/education/interprt/Caverns Permit.htm
./dnr.state.il.us/lands/education/INTERPRT/giant_city/Fall Programs, 2002.htm
./dnr.state.il.us/lands/education/INTERPRT/giant_city/Summer Programs, 2002.htm
./dnr.state.il.us/lands/Education/INTERPRT/Junior Naturalist Program.htm
./dnr.state.il.us/lands/Education/interprt/Junior Naturalist Program.htm
./dnr.state.il.us/lands/Education/INTERPRT/Junior Naturalist Activity Page.htm
./dnr.state.il.us/lands/Education/interprt/Junior Naturalist Activity Page.htm
./dnr.state.il.us/lands/Education/INTERPRT/Seasonal Programs.htm
./dnr.state.il.us/lands/Education/interprt/Seasonal Programs.htm

./dnr.state.il.us/ocd/ACE/Bios/Dwight Hanson Bio.htm
./dnr.state.il.us/OCD/ACE/Bios/Dwight Hanson Bio.htm

./dnr.state.il.us/ocd/ACE/Bios/Bender, Andrew.htm
./dnr.state.il.us/OCD/ACE/Bios/Bender, Andrew.htm

./dnr.state.il.us/ocd/ACE/Bios/Roseboom, Don.htm
./dnr.state.il.us/OCD/ACE/Bios/Roseboom, Don.htm

The Department of Transportation web site contains 54 files or directories with spaces in their names, including those listed just above. Such naming can be supported directly by a web server, depending upon its configuration. If not so configured, a web server requires the URL arrive with a "%20" in place of any blanks in the file name in order to correctly serve the file. Thus, to retrieve the file

http://dnr.state.il.us/ocd/ACE/Bios/Roseboom, Don.htm

from a UNIX server, the user must type the URL

http://dnr.state.il.us/ocd/ACE/Bios/Roseboom,%20Don.htm

in to the web browser. Two files at the Pollution Control Board web site also exhibit this problem.

The wget spider uses URLs it encounters in HTML documents to identify subsequent files to retrieve. The spider treats all URLs as case-sensitive. As noted above, some servers are not case sensitive. Thus, if wget encounters a second URL pointing to the same target web page, but that is spelled using a variation on the case used in the first URL, wget will again ask the server for the file identified by the spelling of the second URL. The case-insensitive server will then respond with the exact same file as formerly transmitted. The spider will then store this duplicate file under the case-sensitive spelling provided in the second URL. The spider then winds up with multiple files, having the same content, but differing in the case used in their filenames or in the names of directories along their path. To prevent this from happening, additional programming must be done to change all URLs and filenames into a consistent case (e.g., use uppercase throughout), for all files that originated on case-insensitive servers. It may be possible to make these modifications to the wget spider and to contribute them back to the wget development community for incorporation into subsequent versions of the freeware.

The Department of Transportation web site contains 211 files with an ampersand in their names. The ampersand is a UNIX "metacharacter" (i.e., a character with special functional meaning), and must appear in "escaped form" (i.e., as the string "\&"), if it appears at all. Commonly used UNIX analytical tools often cannot process a file with such a name.

Use of these sometimes-special characters, and vendor-specific practices interfere with the analytical and archival tools on other platforms. And, they would interfere with the administrative process of re-hosting a web site to another server type (which, essentially, the archive project is trying to do, automatically).

THE ROLE OF METADATA IN THE PROJECT

The Metadata Generator, and Prior Classification Work by ISL

The state government web page classification enumeration was devised by ISL for Illinois use. It is listed in Appendix D. It is based heavily on a predecessor, "WaGILS", by the Washington State Library [38] [39]. The hierarchy can be traversed using the category tree provided on the Illinois Government Information web page [40]. These values are stored in the CLASSIFICATION variant of the META tag.

Other measures such as authority control, while essential for effective search, are not yet in place statewide. For example, the Metadata Generator suggests a date format of "MMDDYYYY", but the script does not enforce that. (Date values are stored in the "createDate" and "dateofLastModification" variants of the META tag.) There are no controls on the various lines of a mailing address, nor for the formatting of telephone numbers.

The Metadata Generator web page [22] also suggests the use of unambiguous web page titles, containing time specifications so that one web page can be distinguished from another very much like it. (These values are stored in the "siteTitle" variant of the META tag.) Examples are provided, for this field and for others, but the software does not enforce any conventions.

Possible Improvements to the Metadata Generator

A possible improvement to the Metadata Generator would be to prompt the user for sub-fields, and then to agglomerate these into an authority-controlled format. For example, dates could be prompted for via pull-down entries for date, month, and year, and then a date string could be written to the appropriate META tag. The present system simply moves the user's typing directly into the META tags without performing any legality checks.

Another possible tool would relocate user prompting and input processing to an active component such as a Java applet, a JavaScript, or an Active-X control. The active component could be the place where consistency checks and standardization processing occur, thus shortening the path to accomplish user feedback. Such an implementation may not be desirable on the grounds of platform portability, on due to lack of compliance with the Americans with Disabilities Act. Alternatively, to stay with the web FORM processing mechanism and its desirable universality, the server script could be expanded to perform consistency checks and standardization processing. That is, instead of simply using the user's input verbatim, the script might warn or advise the user upon detection of spelling errors, omissions, or use of deprecated terminology. As web site practices are not controlled by the ISL, the user would expect to be able to override any warnings.

Limitations of the META Tag, in Theory

Having only the HTML META tag available to convey metadata information imposes two significant limitations:

  1. The metadata values supplied by the authors are ultimately arbitrary character strings. While state libraries and standards bodies may encourage the use of authority control measures, there is no mechanism within HTML itself to ensure authors comply with such measures. Additionally, data values may not be specified as to type (e.g., a character string may be entered in a field where an integer number was expected, such as when a user enters "N/A" instead of "0").
  2. There is no place to put META tags inside files containing other than SGML-influenced markup languages. For example, none of these metadata provisions may be used verbatim in image files or most word processor files (e.g., Microsoft Word's *.doc, and Adobe Acrobat's *.pdf). And, although some other formats contain places for some metadata fields (e.g., Microsoft's "document properties"), extracting them from files in a proprietary format would require additional software development.

Limitations of the META Tag, in Practice

The count of META tags in Table 1 is generally discouraging. Three (i.e., Department of Transportation, Department of Natural Resources, and Secretary of State) of the five agencies in the test baseline exhibit substantive progress in metadata implementation. Of those three sites, an average of 1.8 META tags per markup-language document reflects how very little the encouragement of ISL has penetrated to date. The situation is even worse than it might appear in that several META tags (see examples following) are effectively character string constants for all State of Illinois web pages, and may have wound up in live web pages simply because they were in an HTML template file.

So, for every use of a constant template a handful of META tags will be put in a document, and into the statistics of Table 1. That would mean the number of META tags that someone deliberately thought about including would be far below the average of 1.8 per web page.

Web page authoring tools also generally inject at least their identity as a META tag, as in

<META NAME="GENERATOR" CONTENT="Internet Assistant for Word ">

This information is basically useless for indexing purposes. It may help in constructing parsers that defend against know-deficient HTML practices by the authoring tool. The presence of this tag is possibly an effective form of advertising for the tool vendor, in that it could be construed as some form of endorsement of the tool by the state organization using the tool to do its web work. If 1.0 of the 1.9 META tags per document is just advertising injected by the authoring tool used, the usage of META tags by authors would fall to the average of 0.8 per document.

The frequency of use of the most widely used META tags is listed in Table 2. Note that the strictly pro forma "http-equiv", "prog-id", "VI60_DTCScriptingPlatform", "TEMPLATE", and "generator" tags vastly outnumber the usage of all other tags, with the exception of the "subject" variant used extensively in the Department of Natural Resources web site.

Metadata Term Dept. of Transportation Secretary of State Dept. of Revenue Dept. of Natural Resources Pollution Control Board
Tags from the ISL Metadata Generator
http-equiv 70 783 0 1734 170
CLASSIFICATION 114 105 0 277 0
DISTRIBUTION 48 11 2 104 0
RATING 49 11 0 52 0
contactState 48 12 0 86 0
siteTitle 48 11 0 79 0
KEYWORDS 56 195 0 272 136
DESCRIPTION 55 22 0 224 136
createDate 2 11 0 18 0
dateofLastModification 1 11 0 18 0
timePeriodTextual 1 10 0 16 0
MEDIUM 48 11 0 16 0
PermanentPublicAccess 48 11 0 16 0
originalControlIdentifier 1 10 0 14 0
agencyProgram 1 10 0 14 0
LINKAGE 1 11 0 16 0
LANGUAGE 48 11 0 23 0
govType 48 10 0 23 0
contactName 1 10 0 18 0
contactOrganization 1 10 0 23 0
contactStreetAddress1 1 10 0 21 0
contactStreetAddress2 1 10 0 20 0
contactCity 1 10 0 28 0
contactZipCode 1 10 0 23 0
contactNetworkAddress 1 11 0 23 0
contactPhoneNumber 1 10 0 23 0
contactFaxNumber 1 10 0 16 0
originatorJurisdiction 1 11 0 23 0
originatorOffice 1 11 0 22 0
originatorDepartment 1 10 0 23 0
originatorDivision 1 10 0 18 0
originatorSection 1 10 0 14 0

Tags observed to be in use

GENERATOR 735 74 1676 2544 124
BUILD 0 0 0 659 0
DATE 0 0 0 90 0
AUTHOR 0 0 0 396 0
ProgID 18 7 0 437 119
VERSION 0 0 0 260 0
CreaTim 0 0 0 151 0
DOCCOMM 0 0 0 207 0
CreatedBy 0 0 0 164 0
TEMPLATE 7 0 0 10 0
THESAURUS 0 41 0 15 0
COPYRIGHT 0 0 0 4 0
DC.Title 0 0 0 6 0
DC.Creator 0 0 0 2 0
Microsoft Theme 0 0 0 56 0
Resource-Type 0 0 0 12 0
ROBOT 0 0 0 8 0
SUBJECT 1 0 1632 0 0
VI60_DTCScriptingPlatform 0 0 0 0 57
All other tags 0 7 0 83 0

Table 2. Frequency of use of specific META tags at the test sites.

In preparing Table 2, a number of metadata terms were found that did not match those in use at other sites. For examples, note;

No Mechanism to Label Time-Variant Projects

Upon inspection of the Department of Natural Resources web site [18] in view of its 83 META tags not matching those most frequently used elsewhere, it was discovered that the site uses several tags that seem related to division of work along project or regional lines.

There is no mechanism currently declared in the Metadata Generator for exactly such metadata, although the "agencyProgram" addresses one of the upper levels of abstraction along these lines.

The Metadata Generator does offer a hierarchical decomposition of an organization (see the terms beginning with "organization" in Table 2), but in matrix organizations and ad hoc teaming arrangements, projects may involve employees drawn from many parts of the organization. A plausible set of META names to indicate projects or regions might be;

  1. "agencyProject", and
  2. "agencyProjectPhase".

For purposes of this paper, approval of those proposed variants of the META tag will be assumed. This assumption will result in the creation of data storage slots in the schema, which may at worst go unfilled. If changes to this assumption are necessary, an updating memorandum will be issued and placed in the software documentation.

Existing Use of Unstructured Subject Classifications

The Department of Revenue web site [17] uses an unstructured SUBJECT variant of the META tag in lieu of the CLASSIFICATION tag developed and recommended by ISL. Further examination of SUBJECT variant use shows it to contain information that is a mixture of what the Metadata Generator [22] advocates be used in the "siteTitle" and "timePeriodTextual" variants, and as various enumerated choices in the CLASSIFICATION variant such as those beginning with "Government finance and taxes" (see Appendixes C and D).

In the interest of optimal support for the Department of Revenue web site, the design of the Metadata Generator should be revisited, considering the list of SUBJECT variants already in use at the Department of Revenue. It is not known if the requisite cooperation of the site maintenance staff for the Department of Revenue web site, or any site, will be readily obtainable to support any revisions within their documents.

  • Values such as the following were encountered in the SUBJECT variants.

    Metadata Supplied by or Discernable from Spider Operations and Version Control Systems

    The wget spider will provide date and time information on all files it retrieves. If the server provides a date and time of last modification of the file, that information will be used in the file wget creates under the UNIX file system. Otherwise, the local (spider) time as of the downloading of the file will be used on the file. The date and time on the file are therefore not necessarily analogous to a date of publication, but it is the only information available. An author may consider publication to occur when a document leaves his or her desk, or when it is physically printed (e.g., in a brochure or letter), so dates associated with web versions of documents should always remain somewhat suspect.

    Further, the way some servers are configured, the date and time returned with a file are always the time of file retrieval. Thus, file date and time cannot be reliably used to indicate that a freshly downloaded file is in fact different from the previous copy of that file held in the archive. The contents of the freshly downloaded file must be compared against the contents of the file in the archive to determine if the file in fact differs. Only detection of a difference should trigger posting the new version of the file to the archive.

    An interpretation of the value of the file date and time, for files stored in the archive, would then be "the date and time of file creation or change (if known), or of the moment when that creation or change was first detected by the archival system." A metadata field is thus needed to store this date and time information, which may well differ from any official date of publication information provided, or not, in the META tags internal to the document.

    The UNIX file system also provides the sizes of each file. This information is sometimes, but less reliably, included within the HTTP header used to transmit the file from the server to the spider. Size information may be useful in some searches, and is not provided in any of the other META tags previously mentioned.

    A number of metadata fields are optionally transmitted by a web server [43]. A change in web site operations simply to facilitate archive operations is not expected. But, while these fields cannot be relied upon to arrive, the possibility of their arrival should be supported. Extensive spider program code modifications to capture these fields is not possible due to budgetary limitations. The HTTP header fields which appear useful as metadata for the archive, and which provide information not available directly from the UNIX file system are;

    Other HTTP header fields such as "cost", "link", "URI", and "Message-Id" are either currently unspecified, or critically dependent upon the existence of nomenclature, lookup, and broker services that do not now exist.

    The spider in use (wget) does not pass any HTTP header information along with the files themselves. However, the verbose version of the spider's execution log does contain some of the header information, in a manner that should easily be extractable via parsing, and therefore useable in the archive. The log identifies;

    Metadata Integration and Schema Definition

    The metadata fields derived from these three sources are not organized and standardized into compound structures to indicate relatedness, groupings, or hierarchies. Rather, the relative immaturity and lack of enforcement of metadata usage standards and conventions currently allows authors to inject much interpretation into how, and in what combinations, they use the various fields. Accordingly, a "flat file" organization (i.e., where all the fields are siblings within a single record) of the metadata is all that is indicated at the moment. When field groupings become more standardized, compound records can be created along the lines of database schema design principles [44].

    Dealing with the large number of site-unique META variants evidenced in Table 2 raises the problems of whether such variants should be retained, and if so, how are they to be retained so as to support subsequent search? The mechanism to be used in the archive will be to provide a META variant essentially meaning "none of the above" -- that a META tag was encountered that does not match the tags officially defined or supported. That archived tag will have a NAME field that contains the same contents as the tag encountered in the HTML document, with a prefixed "LocalTag--" (e.g., "ROBOT" would become "LocalTag--ROBOT"). The data contents of the archived tag will be the same as the data contents of the tag originally encountered in the HTML document. This will at least pass the metadata activities to date at the various agencies in to the search engine so that they may be used. Improving tag standardization will, when that occurs, improve search efficiency.

    The vast majority of retained metadata fields must be considered optional, in that they may not be available from source materials. It would be possible, say, under a grant, to retrofit values into these fields of the database in the archive. In such a way, explorations of more demanding forms of searching could be supported without requiring extensive, coordinated administrative assistance from the multiple agencies that operate State of Illinois web sites.

    The fields derived by this paper are as specified in Table 3. Fields from the Metadata Generator [22] are retained verbatim. Fields obtained from spider operation (excluding those of the HTTP header) are prefixed "Spider--". Fields obtained from the HTTP header (here, by parsing the execution log of the spider, are prefixed "HTTP--". Fields added as a result of this examination are prefixed "eArchives--". Fields from other sources are explained.

    Metadata Term Retain Existing Version Discard Map Onto Other Term(s) Remarks Resulting Field Name

    Tags from the ISL Metadata Generator

    http-equiv   x     N/A
    CLASSIFICATION x       CLASSIFICATION
    DISTRIBUTION x     Can be inferred for Illinois sites. DISTRIBUTION
    RATING x     Can be inferred for Illinois sites. RATING
    contactState x     Can be inferred for Illinois sites. contactState
    siteTitle x       siteTitle
    KEYWORDS x       KEYWORDS
    DESCRIPTION x       DESCRIPTION
    createDate x       createDate
    dateofLastModification x       dateofLastModification
    timePeriodTextual x       timePeriodTextual
    MEDIUM x     Can be inferred, as all documents arrive here as a web page. MEDIUM
    PermanentPublicAccess x       PermanentPublicAccess
    originalControlIdentifier x       originalControlIdentifier
    agencyProgram x       agencyProgram
    LINKAGE x       LINKAGE
    LANGUAGE x       LANGUAGE
    govType x     Can be inferred for Illinois sites. govType
    contactName x       contactName
    contactOrganization x       contactOrganization
    contactStreetAddress1 x       contactStreetAddress1
    contactStreetAddress2 x       contactStreetAddress2
    contactCity x       contactCity
    contactZipCode x       contactZipCode
    contactNetworkAddress x       contactNetworkAddress
    contactPhoneNumber x       contactPhoneNumber
    contactFaxNumber x       contactFaxNumber
    originatorJurisdiction x       originatorJurisdiction
    originatorOffice x       originatorOffice
    originatorDepartment x       originatorDepartment
    originatorDivision x       originatorDivision
    originatorSection x       originatorSection

    Tags observed to be in use

    GENERATOR   x     N/A
    BUILD     originalControlIdentifier,

    dateofLastModification
      as mapped, or with use of "Local--" prefix.
    DATE     createDate   as mapped, or with use of "Local--" prefix.
    AUTHOR     contactName,

    contactOrganization
      as mapped, or with use of "Local--" prefix.
    ProgID   x     N/A
    VERSION     originalControlIdentifier,

    dateofLastModification
      as mapped, or with use of "Local--" prefix.
    CreaTim     createDate   as mapped, or with use of "Local--" prefix.
    CreatedBy     contactName   as mapped, or with use of "Local--" prefix.
    TEMPLATE   x     N/A
    Microsoft Theme   x     N/A
    SUBJECT         Local--SUBJECT, until time can be spent to re-catalog as per [22].
    VI60_DTCScriptingPlatform   x     N/A
    All other tags encountered       Retain, as a general rule. Use the "Local--" prefix.

    And, see discussion concerning the agencyInitiative series of tags.

    Tags obtainable from spider operation

    file timestamp       Use as a secondary source of information to fill in the ISL field dateofLastModification as this derives entirely from server-provided data. dateofLastModification
    file size         Spider--FileSize

    Tags obtainable from HTTP headers

    Expires       Not implemented in this project. HTTP--Expires
    Content-Type       MIME HTTP--Content-Type
    Last-Modified       Use as a secondary source of information to fill in the ISL field dateofLastModification as the server only knows of modification of the server-resident copy of the document, not the master document. HTTP--FileDate
    Derived-From       Not implemented under current budget limits. HTTP--DerivedFrom
    server type       Server type designation indicates if prohibited characters may appear in names. HTTP--ServerType
    Content-Length       Information in this field is less dependable than the size as provided by the UNIX file system. Spider--FileSize
    all other HTTP header fields   x     N/A

    Tags added by this technical report

    agencyProject       After ISL's agencyProgram agencyProject or

    eArchives--agencyProject
    agencyProjectPhase       After ISL's agencyProgram agencyProjectPhase or

    eArchives--agencyProjectPhase

    Table 3. Metadata fields to be retained in the archive.

    CAPSTONE EFFORTS: THE SEARCH FACILITY

    The design, collection, and retention of document metadata is to support the eventual construction of a search and retrieval facility for the archive. This search facility is proposed to incorporate keyword search, metadata search, and retrieval of all applicable versions of a target document. That is, if a search indicates versions 2 through 4 of a given document are all relevant, the search engine should identify this to the user, and should offer to retrieve any of those versions of the document.

    It is planned to begin the evolution of this search facility using the Isearch freeware keyword search engine. Isearch is SGML-markup-aware, that is, it can constrain its searches on the basis of angle-bracket-delineated wrapper tags such as those used in SGML, XML, and HTML.

    One open design issue is how to identify to the user that a retrieved document is an archived version and not the current information from the State of Illinois. A number of options are possible, such as wrapping the retrieved site in possibly another HTML FRAME, or of injecting a block of text and imagery at the top of a retrieved HTML page, as is done by Google [45]. But, multiply nested FRAMEs would be undesirable from an understandability standpoint. The implementation selected will have to be in compliance with the American's with Disabilities Act, which is problematic when FRAMEs are used.

    The larger open problem is how to convince state agencies that are unfamiliar with basic principles of library-like cataloging that use of descriptive, and authority-controlled metadata is of considerable value to web site users. The system designed here would support transforming a site into a "showpiece" demonstration vehicle through the retroactive addition of metadata fields the original authors did not supply. Suitably trained ISL staff could examine the pages of the demonstration site, and add the missing metadata to the archives database. Then, suitable demonstration queries could be constructed that show how use of the ISL metadata improves the support for searching of the site and its archives.

    REFERENCES

    [1] The Illinois State Library homepage
    http://www.cyberdriveillinois.com/library/isl/isl.html

    [2] State Library of Ohio web page
    http://winslo.state.oh.us/

    [3] Institute of Museum and Library Service homepage
    http://www.imls.gov/

    [4] Larry S. Jackson, 2001. Preserving Electronic Publications project, software development public webpage.
    http://realfun.isrl.uiuc.edu/eArchives/

    [5] Larry S. Jackson, 2001. Preserving Electronic Publications project, private status report and demonstrations webpage
    http://realfun.isrl.uiuc.edu/eArchives/private/
    User ID and password provided separately -- contact the author at lsjackso@uiuc.edu.

    [6] Linux homepage
    http://www.linux.org/

    [7] Free Software Foundation. GNU wget homepage
    http://www.gnu.org/software/wget/wget.html

    [8] Concurrent Versions System (CVS) homepage
    http://www.cvshome.org/

    [9] Apache Software Foundation. Apache homepage
    http://www.apache.org/

    [10] Mohammed J. Kabir. Apache Server Administrator's Handbook. IDG Books Worldwide, Inc., Foster City, CA. 1999.

    [11] mySQL homepage
    http://www.mysql.com/

    [12] Coalition for Networked Information Discovery and Retrieval. Isearch Text Search Engine web page.
    http://www.cnidr.org/isearch.html

    [13] Milori ISearch web page
    http://www.milori.com/products/isearch/

    [14] Illinois Department of Transportation homepage
    http://www.dot.state.il.us/

    [15] Illinois Secretary of State homepage
    http://www.sos.state.il.us/
    and
    http://www.cyberdriveillinois.com/

    [16] Illinois Pollution Control Board homepage
    http://www.ipcb.state.il.us/

    [17] Illinois Department of Revenue homepage
    http://www.revenue.state.il.us/

    [18] Illinois Department of Natural Resources homepage
    http://dnr.state.il.us/

    [19] Illinois State Library. "Resources for Illinois State Agency Webmasters" web page
    http://www.finditillinois.org/metadata/webmasters.htm

    [20] Illinois Secretary of State's Office. "About the Office" webpage
    http://www.cyberdriveillinois.com/about/about.html

    [21] "Find-It! Illinois" homepage
    http://finditillinois.org/

    [22] Illinois State Library. "Metadata Generator" web page
    http://www.finditillinois.org/metadata/index.html

    [23] Find-It! Illinois, Agency Guide to Indexing Websites
    http://www.finditillinois.org/metadata/indexingsites.htm

    [24] Illinois State Archives webpage
    http://www.cyberdriveillinois.com/departments/archives/archives.html

    [25] Mark Sorrenson, Assistant Director, Illinois State Archives, December 3, 2001. Explanations during PEP project meeting at the Illinois State Library in Springfield, IL.

    [26] Illinois State Archives, April, 2001. Electronic Records and the Illinois Local Records Act: Guidelines for Using Electronic Records webpage
    http://www.cyberdriveillinois.com/departments/archives/records_management/electrecs.html

    [27] American Library Association. Rare Books and Manuscripts Section (RBMS) web page
    http://www.rbms.nd.edu

    [28] The Inquiry Page project homepage
    http://www.inquiry.uiuc.edu/

    [29] The National Science Foundation's Digital Libraries Initiative ("DLI-2") project coordination homepage
    http://www.dli2.nsf.gov/

    [30] Bill Rosenblatt. Learning the Korn Shell. O'Reilly & Associates, Inc., Sebastopol, CA. 1993.

    [31] PERL homepage
    http://www.perl.com/

    [32] Comprehensive PERL Archive Network homepage
    http://www.cpan.org/

    [33] Alfred V. Aho, Brian W. Kernighan, Peter J. Weinberger. The AWK Programming Language. Addison-Wesley Publishing Co., Reading, MA. 1988.

    [34] Linda Lamb. Learning the vi Editor. O'Reilly & Associates, Inc., Sebastopol, CA. 1994.

    [35] World Wide Web Consortium. "Extensible Markup Language (XML)" web page
    http://www.w3.org/XML/

    [36] United States Code. Section 508 of the Rehabilitation Act (29 U.S.C. § 794(d))
    http://www.access-board.gov/about/Rehab%20Act%20Amend-508.htm

    [37] Larry S. Jackson, 2001. Preserving Electronic Publications project, private complete working copy of the State of Illinois Department of Transportation website
    http://www.canis.uiuc.edu/~lsjackso/eArchives/DeptTransportation/www.dot.state.il.us/
    user ID and password provided separately.

    [38] Nancy Zussy. The Birth and Development of Find-It!: Washington State's Government Information Locator Service", in First Monday, volume 5, number 6 (June 2000),
    http://www.firstmonday.dk/issues/issue5_6/zussy/index.html

    [39] Washington State Library homepage
    http://www.statelib.wa.gov/

    [40] Illinois Government Information webpage
    http://findit.sos.state.il.us:2000/compass

    [41] Dublin Core Metadata Initiative web page
    http://dublincore.org/

    [42] Arlene G. Taylor. The Organization of Information. Libraries Unlimited, Inc., Englewood, CO. 1999.

    [43] World Wide Web Consortium. "Object MetaInforamtion" web page.
    http://www.w3.org/Protocols/HTTP/Object_Headers.html

    [44] Jeffrey D. Ullman. Principles of Database Systems. Computer Science Press, Rockville, MD. 1980.

    [45] Google search engine web page
    http://www.google.com

    APPENDIXES

    Appendix A: XML Formatting of System Control Parameters

    A.1 -- Document Type Definition

    <?xml version="1.0" encoding="UTF-8"?>
    <!--This structure defines the control records for the Illinois State Library's electronic document archive project. The current assumption is that the GNU-licensed wget spider will be used to perform the web file retrievals, and these fields match its parameter choices. These parameters are supplied for each site that is to be archived to enable performance tuning to the characteristics of the site. Additionally, a default record named -TBD- can globally override the hard-coded defaults listed here. Any provided contents of the default record take precedence over the hard-coded defaults listed here. Contents of a site-specific record take precedence over both the hard-coded default behaviors and the instructions of the default record. If it is desired to not override a default, do not include the associated (optional) field, as zero or other numbers may have meaning to wget and may cause undesired behavior modifications. The present design addresses only HTTP file transfer. Copyright UIUC GS-LIS 2001, all rights reserved. (The intention is for this material to be placed under widely available no-cost license when completed.) This version dated 20 November 2001 @ 1421, by Larry S. Jackson lsjackso@uiuc.edu.-->
    <!ELEMENT SiteParameters (Host, SlangName?, TopURL?, PollPeriodicity?, NextPolling, FileOfURLs?, FileIsHtml?, Delay?, Debug?, Retries?, Timeout?, UserID?, Password?, IgnoreLength?, DepthLimit?, RelativizeLinks, DomainsAccepted?, DomainsExcluded?, SpanHosts?, IncludedDirectories?, ExcludedDirectories?, ParentDirectories?)>
    <!ELEMENT Host (#PCDATA)>
    <!--Short name, usually the host machine's dot name. Used for filing download results. Defaults to localhost.-->
    <!ELEMENT SlangName (#PCDATA)>
    <!--An English description of the site, usually the name of the responsible agency or individual. Defaults to be the same as the Host field.-->
    <!ELEMENT TopURL (#PCDATA)>
    <!--URL of topmost node, provided to the spider on the command line (the preferred mode of spider invocation). If this site is all one tree/subtree. If this field is provided, the field FileOfURLs is not ever referenced. Either TopURL or FileOfURLs must be provided. Default is the value of the Host field, prefixed by 'http://'.-->
    <!ELEMENT PollPeriodicity (#PCDATA)>
    <!--Nominal time, in minutes, between successive initiations of the spider on this whole site. Default is 1 week. Notes: 24 hours = 1440 minutes. 1 week = 10080 minutes. 30 days = 43200 minutes. 1 solar year = 525960 minutes.-->
    <!ELEMENT NextPolling (#PCDATA)>
    <!--Nominal time of the next scheduled spider initiation on this whole site. If that time has past, initiate the spider now, and revise this field to be a time determined by now+PollPeriodicity.-->
    <!ELEMENT FileOfURLs (#PCDATA)>
    <!ELEMENT FileIsHtml (#PCDATA)>
    <!--"yes" or "no", depending upon if FileOfURLs is to be interpreted as HTML or not. Default is no. Used with the \-\-force-html switch.-->
    <!ELEMENT Delay (#PCDATA)>
    <!--Number of seconds between subsequent retrievals. Default is 9. Used with the \-\-wait switch.-->
    <!ELEMENT Debug (#PCDATA)>
    <!--"yes" or "no", to indicate if debug output is desired. Default is no. Used with the \-\-debug switch.-->
    <!ELEMENT Retries (#PCDATA)>
    <!--Number of retries if failed or incomplete file transfer happens. Default is 7. Used with the \-\-tries switch.-->
    <!ELEMENT Timeout (#PCDATA)>
    <!--Used on file read with the \-\-timeout switch. Value interpreted as a number of seconds. The wget vendor very much recommends their default of 900 not be tampered with. Default is 900. A value of 0 is interpreted to imply infinity, and is particularly risky.-->
    <!ELEMENT UserID (#PCDATA)>
    <!--Only used if site uses HTTP basic authentication. Default is not to use. Paired with Password field. Used with the \-\-httpd-user switch.-->
    <!ELEMENT Password (#PCDATA)>
    <!--Only used if site uses HTTP basic authentication. Default is not to use. Paired with UserID field. Used with the \-\-http-passwd switch.-->
    <!ELEMENT IgnoreLength (#PCDATA)>
    <!--Used only on HTTPd sites sending unreliable Content-Length headers. "yes" or "no", default is "no". Used with the \-\-ignore-length switch.-->
    <!ELEMENT DepthLimit (#PCDATA)>
    <!--Number of levels allowed in recursion. Default is 10. Used with the \-\-level switch.-->
    <!ELEMENT RelativizeLinks (#PCDATA)>
    <!--Make embedded links to same site all relative. "yes" or "no", default is "yes". Without this, rehosting into the archive subdirectory will almost certainly break any links to images or other parts of the document, as the host name will have changed, and will not equate to a docroot someplace.-->
    <!ELEMENT DomainsAccepted (#PCDATA)>
    <!--Comma-separated list, used with the \-\-domains switch. Default behavior is to only accept files from the machine named in the URL, thus generally obviating use of this field.-->
    <!ELEMENT DomainsExcluded (#PCDATA)>
    <!--Comma-separated list, used with the \-\-exclude-domains switch.-->
    <!ELEMENT SpanHosts (#PCDATA)>
    <!--"yes" or "no", default "no". Choosing yes could lead to attempting to retrieve the whole web, unless otherwise limited. Used with the \-\-span-hosts switch.-->
    <!ELEMENT IncludedDirectories (#PCDATA)>
    <!--A comma-separated list of directories to be included. Default is all directories. Used with the \-\-include-directories switch.-->
    <!ELEMENT ExcludedDirectories (#PCDATA)>
    <!--A comma-separated list of directories to be excluded. Default is none. Used with the \-\-exclude-directories switch.-->
    <!ELEMENT ParentDirectories (#PCDATA)>
    <!--Are files stored in parent directories to be retrieved? "yes" or "no", default "yes". Used with the \-\-no-parent switch.-->
    <!--Remarks on other wget switches: If a switch is not listed above, or below, it is not used in this project. Switches that are hard-wired are; \-\-output-file (writes to a site-specific filename), \-\-timestamping (always on), \-\-server-response (always on), \-\-force-directories (writes to a site-specific subtree), \-\-cache (always off), \-\-recursive (always on)-->
    <!--Open design questions: Would \-\-header=additional-header be useful to delineate Accept-Language? Would the \-\-convert-links switch be useful in developing a site-full of apparently old content (i.e., by removing links to the 'real' web)?-->
    <!--Name of file containing the multiple URLs that define the archived portion of this site. Used only if the archived portion does not form a complete tree/subtree. Either TopURL or FileOfURLs must be provided. Used with the \-\-input-file switch.-->

    A.2 -- System-Wide Default Values

    <?xml version="1.0" encoding="UTF-8"?>
    <!DOCTYPE SiteParameters SYSTEM "F:\Docs\Docs\- eArchive -- IMLS and IL State Library\Specs\wget.dtd">
    <SiteParameters>
    <Host/>
    <SlangName/>
    <TopURL/>
    <PollPeriodicity>10080</PollPeriodicity>
    <NextPolling/>
    <FileOfURLs/>
    <FileIsHtml>no</FileIsHtml>
    <Delay>9</Delay>
    <Debug>no</Debug>
    <Retries>7</Retries>
    <Timeout>900</Timeout>
    <UserID/>
    <Password/>
    <IgnoreLength>no</IgnoreLength>
    <DepthLimit>10</DepthLimit>
    <RelativizeLinks>yes</RelativizeLinks>
    <DomainsAccepted/>
    <DomainsExcluded/>
    <SpanHosts>no</SpanHosts>
    <IncludedDirectories/>
    <ExcludedDirectories/>
    <ParentDirectories>yes</ParentDirectories>
    </SiteParameters>

    A.3 -- Department of Transportation Website Parameters

    <?xml version="1.0" encoding="UTF-8"?>
    <!DOCTYPE SiteParameters SYSTEM "F:\Docs\Docs\- eArchive -- IMLS and IL State Library\Specs\wget.dtd">
    <SiteParameters>
    <Host>www.dot.state.il.us</Host>
    <SlangName>DeptTransportation</SlangName>
    <TopURL/>
    <PollPeriodicity/>
    <NextPolling/>
    <FileOfURLs/>
    <FileIsHtml/>
    <Delay/>
    <Debug/>
    <Retries/>
    <Timeout/>
    <UserID/>
    <Password/>
    <IgnoreLength/>
    <DepthLimit/>
    <RelativizeLinks/>
    <DomainsAccepted/>
    <DomainsExcluded/>
    <SpanHosts/>
    <IncludedDirectories/>
    <ExcludedDirectories/>
    <ParentDirectories/>
    </SiteParameters>

    Appendix B: File Types Currently Supported by the Archive

    Binary file types:

    Image formats:
    *.png *.jpg *.jpe *.jpeg *.bmp *.gif *.tif *.tiff
    Document formats:
    *.doc *.dot *.pdf *.akf *.apf *.pdx *.ps *.eps *.rtf *.wri *.ppt *.mpp *.mpt *.xls *.pps *.pot
    Audio and music formats:
    *.mp3 *.au *.aa *.aif *.aifc *.aiff *.323 *.ra *.ram *.mid *.midi
    Movie formats:
    *.mpeg *.mpg *.rm *.fla *.cam *.avi *.rf *.rv
    Archived and/or compressed files:
    *.zip *.jar *.arc *.arj *.b64 *.binhex *.gz *.tar *.Z
    Executable code:
    *.exe *.class *.vbe *.vbs
    Fonts:
    *.fon
    Databases:
    *.pdb
    Other binary, files:
    *.swf *.wmf *.sit *.mso

    ASCII-based formats that should not be a problem for CVS:

    *.htm *.html *.php3 *.css *.js *.mso *.mht *.s *.txt *.xml *.sgm *.sgml *.log *.dtd *.java *.vrm *.vrml *.xhtml *.xsd *.shtml *.asp

    Mixed-mode material that is an open problem for CVS:

    Anything served by a script, which may or may not have "cgi" somewhere in the URL. File type cannot necessarily be recognized, although use of the server-supplied MIME type may help.

    Appendix C: Sample Metadata from the Illinois State Library's Metadata Generator Web Page

    <link rel="WAGILS" href="http://www.statelib.wa.gov/info_rscrs/dbs_tools/find-it/metadesc.htm">
    <meta http-equiv="PICS-Label" content='(PICS-1.1 "http://www.weburbia.com/safe/ratings.htm" l r (s 0))'>
    <meta name="classification" content="Government">
    <meta name="distribution" content="Global">
    <meta name="rating" content="General">
    <meta name="contactState" content="IL">
    <meta name="siteTitle" content="Metadata Sample Appendix, 2001">
    <meta name="keywords" content="metadata; GSLIS; electronic archives; web spider; web crawler">
    <meta name="description" content="This site describes work performed under a sub-award of the IMLS "Preserving Electronic Publications" project to GSLIS.">
    <meta name="createDate" content="12062001">
    <meta name="dateofLastModification" content="12102001">
    <meta name="timePeriodTextual" content="2001-2002, US GFY 2002.">
    <meta name="medium" content="Web Site">
    <meta name="PermanentPublicAccess" content="yes">
    <meta name="originalControlIdentifier" content="LSJ-eArch-1.0">
    <meta name="agencyProgram" content="IMLS PEP">
    <meta name="linkage" content="http://www.isrl.uiuc.edu/">
    <meta name="language" content="EN">
    <meta name="govType" content="State">
    <meta name="contactName" content="Larry S. Jackson">
    <meta name="contactOrganization" content="GSLIS">
    <meta name="contactStreetAddress1" content="501 East Daniel">
    <meta name="contactStreetAddress2" content="">
    <meta name="contactCity" content="Champaign">
    <meta name="contactZipcode" content="61821">
    <meta name="contactNetworkAddress" content="lsjackso@uiuc.edu">
    <meta name="contactPhoneNumber" content="+1-217-333-0949">
    <meta name="contactFaxNumber" content="+1-217-244-3302">
    <meta name="originatorJurisdiction" content="State of Illinois">
    <meta name="originatorOffice" content="University of Illinois">
    <meta name="originatorDepartment" content="Graduate School of Library and Information Science">
    <meta name="originatorDivision" content="Information Systems Research Laboratory">
    <meta name="originatorSection" content="Electronic Archives Project">
    <meta name="Classification" content="Information management and resources">
    <meta name="Classification" content="Information management and resources--Information resources">
    <meta name="Classification" content="Information management and resources--Information resources--Government information">
    <meta name="Classification" content="Information management and resources--Information resources--Libraries and archives">

    Appendix D: Illinois State Library Subject Headings Used in the Metadata Generator for the CLASSIFICATION Variant of the META Tag


    Agriculture and food production
    Agriculture and food production--Agricultural finance
    Agriculture and food production--Agricultural statistics
    Agriculture and food production--Agriculture pages for kids
    Agriculture and food production--Aquaculture
    Agriculture and food production--Crops
    Agriculture and food production--Farmlands
    Agriculture and food production--Fisheries
    Agriculture and food production--Irrigation
    Agriculture and food production--Livestock
    Agriculture and food production--Pesticides
    Agriculture and food production--Soil erosion
    Business and industry
    Business and industry--Banking
    Business and industry--Business and professional licenses
    Business and industry--Business and professional licenses--Business licenses
    Business and industry--Business and professional licenses--Professional licenses
    Business and industry--Business pages for kids
    Business and industry--Doing business with government
    Business and industry--Doing business with government--Defense contracts
    Business and industry--Doing business with government--Government contracts
    Business and industry--Doing business with government--Government purchasing
    Business and industry--Doing business with government--Government surplus property
    Business and industry--Doing business with government--Vendor forms
    Business and industry--Economic development
    Business and industry--Economic forecasts
    Business and industry--Employment--Employment benefits
    Business and industry--Employment--Job searching
    Business and industry--Employment--Job training
    Business and industry--Employment--Labor laws
    Business and industry--Employment--Labor laws--Affirmative action
    Business and industry--Employment--Labor laws--Equal employment opportunity
    Business and industry--Employment--Labor laws--Workers' rights
    Business and industry--Employment--Occupations
    Business and industry--Employment--Professional licenses
    Business and industry--Employment--Retirement
    Business and industry--Employment--State employment forms
    Business and industry--Employment--Unemployment
    Business and industry--Employment--Unions
    Business and industry--Employment--Wages and salaries
    Business and industry--Employment--Workers' compensation
    Business and industry--Employment--Workfare
    Business and industry--High tech industry
    Business and industry--Insurance
    Business and industry--Insurance--Automobile insurance
    Business and industry--Insurance--Health insurance and health care coverage
    Business and industry--Insurance--Health insurance and health care coverage--Disability insurance
    Business and industry--Insurance--Health insurance and health care coverage--HMOs
    Business and industry--Insurance--Health insurance and health care coverage--Managed care
    Business and industry--Insurance--Health insurance and health care coverage--Medicaid
    Business and industry--Insurance--Health insurance and health care coverage--Medicare
    Business and industry--Insurance--Health insurance and health care coverage--Uninsured people
    Business and industry--Insurance--Health insurance--Uninsured people--Workers' compensation
    Business and industry--Insurance--Insurance forms
    Business and industry--Insurance--Life insurance
    Business and industry--Investment
    Business and industry--Labor laws
    Business and industry--Labor laws--Affirmative action
    Business and industry--Labor laws--Equal employment opportunity
    Business and industry--Labor laws--Workers' rights
    Business and industry--Manufacturing
    Business and industry--Mining
    Business and industry--Small business
    Business and industry--Special libraries
    Business and industry--Trade and commerce
    Business and industry--Unclaimed property
    City and town government
    City and town government--Elected municipal officials
    City and town government--Elected municipal officials--Mayors
    City and town government--Local government relations
    City and town government--Mayors
    City and town government--Municipal agencies
    City and town government--Municipal employees
    City and town government--Municipal employees--Elected municipal officials--Mayors
    City and town government--Municipal employees--Local police
    City and town government--Municipal facilities
    City and town government--Municipal finance and audits
    City and town government--Municipal finance and audits--Grants made by states
    City and town government--Municipal finance and audits--Grants made by states--Grants application forms
    Consumer protection and public safety
    City and town government--Municipal government purchasing
    City and town government--Municipal ordinances
    County government
    County government--County agencies
    County government--County codes
    County government--County employees
    County government--County employees--Elected county officials
    County government--County employees--Local police
    County government--County facilities
    County government--County finance and audits--Grants made by states
    County government--County finance and audits--Grants made by states--Grant application forms
    County government--County government purchasing
    County government--Elected county officials
    County government--Local government relations
    Education
    Education--Adult and continuing education
    Education--Adult and continuing education--Adult education
    Education--Adult and continuing education--Continuing education
    Education--Adult and continuing education--Job training
    Education--Athletic associations
    Education--Directories of educational institutions
    Education--Distance learning
    Education--Early childhood education
    Education--Education pages for kids
    Education--Education programs
    Education--Educational finance
    Education--Educational finance--Federal education grants
    Education--Educational finance--State aid to higher education
    Education--Educational finance--State aid to schools
    Education--Educational finance--Student financial aid
    Education--Educational finance--Tuition tax credits and school vouchers
    Education--Educational policies
    Education--Educational policies--School waivers
    Education--Educational technology
    Education--English as a second language
    Education--Extracurricular activites
    Education--Higher education
    Education--Higher education--Academic libraries
    Education--Higher education--Affirmative action
    Education--Higher education--Community and technical colleges
    Education--Higher education--Graduate schools
    Education--Higher education--Private four-year colleges and universities
    Education--Higher education--Public four-year colleges and universities
    Education--Higher education--State aid to higher education
    Education--Higher education--Student financial aid
    Education--Home schooling
    Education--Libraries and archives
    Education--Libraries and archives--Academic libraries
    Education--Libraries and archives--Library systems
    Education--Libraries and archives--Public libraries
    Education--Libraries and archives--School media centers
    Education--Libraries and archives--Special libraries
    Education--Literacy
    Education--Private schools
    Education--Public schools
    Education--Public schools--Alternative learning and charter schools
    Education and--Public schools--Desegregation and diversity
    Education--Public schools--Elementary schools
    Education--Public schools--High schools
    Education--Public schools--Kindergartens
    Education--Public schools--Middle schools
    Education--Public schools--School boards
    Education--Public schools--School districts
    Education--Public schools--School media centers
    Education--Public schools--School waivers
    Education--Public schools--State aid to schools
    Education--School personnel
    Education--School personnel--Certification of school personnel
    Education--School personnel--Educational support staff
    Education--School personnel--School administrators
    Education--School personnel--Teachers
    Education--School personnel--Teachers--Certification of school personnel
    Education--School personnel--Teachers--Teachers' retirement system
    Education--School personnel--Teachers--Teachers' retirement system--Teachers' retirement forms
    Education--Special education
    Education--Students
    Education--Students--College students
    Education--Students--College students--Student financial aid
    Education--Students--Elementary school students
    Education--Students--High school students
    Education--Students--Kindergarten students
    Education--Students--Middle school students
    Education and--Test scores
    Forms
    Federal government
    Federal government--Federal agencies
    Federal government--Federal aid
    Federal government--Federal aid--Federal grants
    Federal government--Federal aid--Federal grants--Federal education grants
    Federal government--Federal government purchasing
    Federal government--Federal laws
    Federal government--Federal relations
    Federal government--Military
    Federal government--Military--Defense contracts
    Federal government--Military--Military bases and forts
    Federal government--Military--National Guard
    Federal government--Military--Reserves (military)
    Federal government--Military--Veterans' affairs
    Federal government--Military--Veterans' affairs--Veterans' benefit forms
    Federal government--U.S. Representatives
    Federal government--U.S. Senators
    Forms
    Forms--Accident report forms
    Forms--Complaint forms
    Forms--Grant application forms
    Forms--I-Pass forms
    Forms--Insurance forms
    Forms--Legal forms
    Forms--License application forms
    Forms--Retirement forms
    Forms--Retirement forms--Teachers' retirement forms
    Forms--State employment forms
    Forms--Student assistance forms
    Forms--Tax forms
    Forms--Vehicle registration forms
    Forms--Vendor forms
    Forms--Veterans' benefits forms
    Government finance and taxes
    Government finance and taxes--Bond issues
    Government finance and taxes--County finance and audits
    Government finance and taxes--County finance and audits--Grants made by states
    Government finance and taxes--County finance and audits--Grants made by states--Grant application forms
    Government finance and taxes--Educational finance
    Government finance and taxes--Educational finance--Federal education grants
    Government finance and taxes--Educational finance--State aid to higher education
    Government finance and taxes--Educational finance--State aid to schools
    Government finance and taxes--Educational finance--Student financial aid
    Government finance and taxes--Educational finance--Tuition tax credits and school vouchers
    Government finance and taxes--Federal grants
    Government finance and taxes--Federal grants--Federal education grants
    Government finance and taxes--Government revenues
    Government finance and taxes--Government revenues--Block grants
    Government finance and taxes--Government revenues--Federal aid
    Government finance and taxes--Government revenues--Federal aid--Federal grants
    Government finance and taxes--Government revenues--Federal aid--Federal grants--Federal education grants
    Government finance and taxes--Government revenues--Gambling
    Government finance and taxes--Government revenues--Gambling--Casinos
    Government finance and taxes--Government revenues--Gambling--Lottery
    Government finance and taxes--Government revenues--License fees
    Government finance and taxes--Government revenues--Other non-tax revenues
    Government finance and taxes--Government revenues--Taxes
    Government finance and taxes--Government revenues--Taxes--Business taxes
    Government finance and taxes--Government revenues--Taxes--Income taxes
    Government finance and taxes--Government revenues--Taxes--Liquor taxes
    Government finance and taxes--Government revenues--Taxes--Other taxes
    Government finance and taxes--Government revenues--Taxes--Property taxes
    Government finance and taxes--Government revenues--Taxes--Sales taxes
    Government finance and taxes--Government revenues--Taxes--Tax forms
    Government finance and taxes--Government revenues--Taxes--Tax laws
    Government finance and taxes--Government revenues--Taxes--Tobacco taxes
    Government finance and taxes--Government revenues--Taxes--Tuition tax credits and school vouchers
    Government finance and taxes--Municipal finance and audits
    Government finance and taxes--Municipal finance and audits--Grants made by states
    Government finance and taxes--Municipal finance and audits--Grants made by states--Grant application forms
    Government finance and taxes--Regional government finance
    Government finance and taxes--State finance
    Government finance and taxes--State finance--Grants made by states
    Government finance and taxes--State finance--Grants made by states--Grant application forms
    Government finance and taxes--State finance--Grants to states
    Government finance and taxes--State finance--Grants to states--Federal grants
    Government finance and taxes--State finance--Grants to states--Federal grants--Federal education grants
    Government finance and taxes--State finance--State aid to higher education
    Government finance and taxes--State finance--State aid to schools
    Government finance and taxes--State finance--State employee retirement system
    Government finance and taxes--State finance--State employee retirement system--Retirement forms
    Government finance and taxes--State finance--State employee retirement system--Retirement forms--Teachers' retirement forms
    Government finance and taxes--State finance--Teachers' retirement system
    Government finance and taxes--State finance--Teachers' retirement system--Teachers' retirement forms
    Health and medicine
    Health and medicine--Disease
    Health and medicine--Diseases--AIDS and HIV
    Health and medicine--Diseases--Alcohol and drug abuse
    Health and medicine--Diseases--Alzheimer's disease
    Health and medicine--Diseases--Cancer
    Health and medicine--Diseases--Communicable diseases
    Health and medicine--Diseases--Eating disorders
    Health and medicine--Diseases--Lyme disease
    Health and medicine--Diseases--Sexually transmitted diseases
    Health and medicine--Drugs and pharmacology
    Health and medicine--Family planning
    Health and medicine--Health care
    Health and medicine--Health care--Alternative medicine
    Health and medicine--Health care--Dental care
    Health and medicine--Health care--Emergency care
    Health and medicine--Health care--Hospice care
    Health and medicine--Health care--Hospitals and clinics
    Health and medicine--Health care--Hygiene and personal cleanliness
    Health and medicine--Health care--Medical equipment
    Health and medicine--Health care--Medical laboratories and testing
    Health and medicine--Health care--Medical laboratories and testing--Drug testing
    Health and medicine--Health care--Medical personnel
    Health and medicine--Health care--Medical studies and experimental treatments
    Health and medicine--Health care--Mental health
    Health and medicine--Health care--Nursing homes and assisted living facilities
    Health and medicine--Health care--Occupational therapy
    Health and medicine--Health care--Organ and tissue donations
    Health and medicine--Health insurance and health care coverage
    Health and medicine--Health insurance and health care coverage--Disability insurance
    Health and medicine--Health insurance and health care coverage--HMOs
    Health and medicine--Health insurance and health care coverage--Managed care
    Health and medicine--Health insurance and health care coverage--Medicaid
    Health and medicine--Health insurance and health care coverage--Medicare
    Health and medicine--Health insurance and health care coverage--Uninsured people
    Health and medicine--Health insurance and health care coverage--Workers' compensation
    Health and medicine--Health pages for kids
    Health and medicine--Medical malpractice
    Health and medicine--Nutrition
    Health and medicine--Physical fitness
    Health and medicine--Public health
    Health and medicine--Public health--Epidemiology
    Health and medicine--Public health--Immunizations
    Health and medicine--Public health--Product safety
    History and culture
    History and culture--Archaeology
    History and culture--Genealogy
    History and culture--Historic preservation
    History and culture--History and culture pages for kids
    History and culture--History of Illinois
    History and culture--History of Illinois--Abraham Lincoln
    History and culture--History of Illinois--Al Capone
    History and culture--History of Illinois--Carl Sandburg
    History and culture--History of Illinois--Edgar Lee Masters
    History and culture--History of Illinois--Vachel Lindsay
    History and culture--Landmarks
    History and culture--Literature
    History and culture--Literature--Carl Sandburg
    History and culture--Literature--Edgar Lee Masters
    History and culture--Literature--Vachel Lindsay
    History and culture--Local history
    History and culture--Museums and historical societies
    History and culture--Performing arts
    History and culture--Performing arts--Ethnic and folk art
    History and culture--State symbols and flags
    History and culture--State symbols and flags--State flags
    History and culture--Visual arts
    History and culture--Visual arts--Ethnic and folk art
    Information management and resources
    Information management and resources--Educational technology
    Information management and resources--Information management and resources pages for kids
    Information management and resources--Information policy
    Information management and resources--Information policy--Intellectual freedom and censorship
    Information management and resources--Information policy--Privacy
    Information management and resources--Information resources
    Information management and resources--Information resources--Directories
    Information management and resources--Information resources--Directories--Directories of educational institutions
    Information management and resources--Information resources--Directories--Government directories
    Information management and resources--Information resources--Government information
    Information management and resources--Information resources--Government information--Government directories
    Information management and resources--Information resources--Government information--Government records management
    Information management and resources--Information resources--Government statistics
    Information management and resources--Information resources--Government statistics--Agricultural statistics
    Information management and resources--Information resources--Crime statistics
    Information management and resources--Information resources--Demographic statistics
    Information management and resources--Information resources--Libraries and archives
    Information management and resources--Information resources--Libraries and archives--Academic libraries
    Information managment and resources--Information resources--Libraries and archives--Library systems
    Information management and resources--Information resources--Libraries and archives--Public libraries
    Information management and resources--Information resources--Libraries and archives--School media centers
    Information management and resources--Information resources--Libraries and archives--Special libraries
    Information management and resources--Information resources--Maps and gazetteers
    Information management and resources--Information resources--Vital records
    Information management and resources--Information resources--Vital records--Birth records
    Information management and resources--Information resources--Vital records--Census
    Information management and resources--Information resources--Vital records--Death records
    Information management and resources--Information resources--Vital records--Demographic statistics
    Information management and resources--Information resources--Vital records--Marriage and divorce records
    Information management and resources--Information technology
    Information management and resources--Information technology--Ecommerce
    Information management and resources--Information technology--Y2K issue
    Kids' pages
    Kids' pages--Agriculture pages for kids
    Kids' pages--Business pages for kids
    Kids' pages--Consumer protection pages for kids
    Kids' pages--Education pages for kids
    Kids' pages--Government pages for kids
    Kids' pages--Health pages for kids
    Kids' pages--History and culture pages for kids
    Kids' pages--Information management and resources pages for kids
    Kids' pages--Land use pages for kids
    Kids' pages--Law enforcement pages for kids
    Kids' pages--Law pages for kids
    Kids' pages--Natural resources pages for kids
    Kids' pages--Public safety pages for kids
    Kids' pages--Recreation pages for kids
    Kids' pages--Utilities and communications pages for kids
    Kids' pages--Voting and elections pages for kids
    Land use, development, and construction
    Land use, development, and construction--Architecture
    Land use, development, and construction--Building codes
    Land use, development, and construction--Cemeteries
    Land use, development, and construction--Construction
    Land use, development and construction--Farmlands
    Land use, development, and construction--Growth management
    Land use, development, and construction--Housing
    Land use, development, and construction--Housing--Affordable housing
    Land use, development, and construction--Housing--Group homes
    Land use, development, and construction--Housing--Homeless people
    Land use, development, and construction--Housing--Housing finance
    Land use, development and construction--Housing--Public housing
    Land use, development, and construction--Land development
    Land use, development, and construction--Land use pages for kids
    Land use, development, and construction--Land use planning
    Land use, development, and construction--Land use planning--Zoning
    Land use, development, and construction--Property laws
    Land use, development, and construction--Real estate
    Land use, development, and construction--Real estate--Property laws
    Land use and construction--Real estate--Zoning
    Law enforcement and courts
    Law enforcement--Attorneys
    Law enforcement and courts--Attorneys--Illinois Attorney General
    Law enforcement and courts--Attorneys--Public defenders
    Law enforcement and courts--Attorneys--Public prosecutors
    Law enforcement and courts--Attorneys--State bar association
    Law enforcement and courts--Corrections
    Law enforcement and courts--Corrections--Alternatives to incarceration
    Law enforcement and courts--Corrections--Alternatives to incarceration--Community based corrections
    Law enforcement and courts--Corrections--Alternatives to incarceration--Work release programs
    Law enforcement and courts--Corrections--Halfway houses
    Law enforcement and courts--Corrections--Prisons
    Law enforcement and courts--Corrections--Probation and parole
    Law enforcement--Courts
    Law enforcement and courts--Courts--Appellate courts
    Law enforcement and courts--Courts--Civil courts
    Law enforcement and courts--Courts--Civil courts--Medical malpractice
    Law enforcement and courts--Courts--Civil courts--Small claims courts
    Law enforcement and courts--Courts--Criminal courts
    Law enforcement and courts--Courts--Illinois Supreme Court
    Law enforcement and courts--Courts--Judges
    Law enforcement and courts--Courts--Small claims courts
    Law enforcement and courts--Courts--Superior courts
    Law enforcement and courts--Crime
    Law enforcement and courts--Courts--Al Capone
    Law enforcement and courts--Crime--Computer crime
    Law enforcement and courts--Crime--Crime statistics
    Law enforcement--Crime--Domestic violence
    Law enforcement and courts--Crime--Sex offenders
    Law enforcement and courts--Death penalty
    Law enforcement and courts--Juvenile justice and rehabilitation
    Law enforcement and courts--Law enforcement pages for kids
    Law enforcement and courts--Local police
    Law enforcement and courts--Sentencing
    Law enforcement--Sheriffs
    Law enforcement and courts--State police
    Law enforcement and courts--Victim assistance
    Laws and regulations
    Laws and regulations--Administrative rules
    Laws and regulations--County codes
    Laws and regulations--Election laws
    Laws and regulations--Environmental laws and regulations
    Laws and regulations--Federal laws
    Laws and regulations--Labor laws
    Laws and regulations--Labor laws--Affirmative action
    Laws and regulations--Labor laws--Equal employment opportunity
    Laws and regulations--Labor laws--Workers' rights
    Laws and regulations--Law pages for kids
    Laws and regulations--Legal forms
    Laws and regulations--Municipal ordinances
    Laws and regulations--Property laws
    Laws and regulations--State statutes
    Laws and regulations--Tax laws
    Laws and regulations--Telecommunications regulations
    Natural resources and environment
    Natural resources and environment--Air quality
    Natural resources and environment--Earth sciences
    Natural resources and environment--Ecology
    Natural resources and environment--Ecology--Animals
    Natural resources and environment--Ecology--Animals--Endangered animals
    Natural resources and environment--Ecology--Animals--Fish
    Natural resources and environment--Ecology--Animals--Endangered fish
    Natural resources and environment--Ecology--Endangered species
    Natural resources and environment--Ecology--Endangered species--Endangered animals
    Natural resources and environment--Ecology--Endangered species--Endangered fish
    Natural resources and environment--Ecology--Endangered species--Endangered plants
    Natural resources and environment--Ecology--Plants
    Natural resources and environment--Ecology--Plants--Endangered plants
    Natural resources and environment--Ecology--Types of environments
    Natural resources and environment--Ecology--Types of environments--Farmlands
    Natural resources and environment--Ecology--Types of environments--Forests
    Natural resources and environment--Ecology--Types of environments--Prairies
    Natural resources and environment--Ecology--Types of environments--Rural environments
    Natural resources and environment--Ecology--Types of environments--Urban environments
    Natural resources and environment--Energy resources
    Natural resources and environment--Energy resources--Alternative energy resources
    Natural resources and environment--Energy resources--Biomass energy
    Natural resources and environment--Energy resources--Coal
    Natural resources and environment--Energy resources--Hydroelectric resources
    Natural resources and environment--Energy resources--Petroleum and natural gas
    Natural resources and environment--Energy resources--Petroleum and natural gas--Urban environments--Gas utilities
    Natural resources and environment--Energy resources--Solar energy
    Natural resources and environment--Energy resources--Waste to energy
    Natural resources and environment--Energy resources--Wind energy
    Natural resources and environment--Environmental assessments
    Natural resources and environment--Environmental laws and regulations
    Natural resources and environment--Land use planning
    Natural resources and environment--Land use planning--Zoning
    Natural resources and environment--Maps and gazetteers
    Natural resources and environment--Natural resources pages for kids
    Natural resources and environment--Property laws
    Natural resources and environment--Public lands
    Natural resources and environment--Public lands--National parks
    Natural resources and environment--Public lands--State parks and beaches
    Natural resources and environment--Soil erosion
    Natural resources and environment--Waste disposal and hazardous sites
    Natural resources and environment--Waste disposal and hazardous sites--Cleanups of hazardous sites
    Natural resources and environment--Waste disposal and hazardous sites--Hazardous materials
    Natural resources and environment--Waste disposal and hazardous sites--Hazardous sites
    Natural resources and environment--Waste disposal and hazardous sites--Landfills
    Natural resources and environment--Waste disposal and hazardous sites--Landfills--Transfer centers
    Natural resources and environment--Waste disposal and hazardous sites--Recycling
    Natural resources and environment--Waste disposal and hazardous sites--Sewage treatment and disposal
    Natural resources and environment--Water resources
    Natural resources and environment--Water resources--Canals
    Natural resources and environment--Water resources--Dams
    Natural resources and environment--Water resources--Drinking water
    Natural resources and environment--Water resources--Groundwater
    Natural resources and environment--Water resources--Lakes
    Natural resources and environment--Water resources--Lakes--Lake Michigan
    Natural resources and environment--Water resources--Rivers and streams
    Natural resources and environment--Water resources--Rivers and streams--Chicago River
    Natural resources and environment--Water resources--Rivers and streams--Illinois River
    Natural resources and environment--Water resources--Rivers and streams--Mississippi River
    Natural resources and environment--Water resources--Rivers and streams--Ohio River
    Natural resources and environment--Water resources--Watersheds
    Natural resources and environment--Water resources--Watersheds
    Natural resources and environment--Water resources--Wetlands
    Natural resources and environment--Weather
    Natural resources and environment--Weather--Drought
    Natural resources and environment--Weather--Floods
    Natural resources and environment--Weather--Snow emergencies
    Natural resources and environment--Weather--Tornadoes
    Public safety and consumer protection
    Public safety and consumer protection--Complaint forms
    Public safety and consumer protection--Consumer protection pages for kids
    Public safety and consumer protection--Food handling and safety
    Public safety and consumer protection--Product safety
    Public safety and consumer protection--Public safety
    Public safety and consumer protection--Public safety--Animal licenses and animal control
    Public safety and consumer protection--Public safety--Emergency management
    Public safety and consumer protection--Public safety--Emergency management--Disasters
    Public safety and consumer protection--Public safety--Emergency management--Disasters--Floods
    Public safety and consumer protection--Public safety--Emergency management--Disasters--Tornadoes
    Public safety and consumer protection--Public safety--Emergency management--Snow emergencies
    Public safety and consumer protection--Public safety--Fire fighting and fire prevention
    Public safety and consumer protection--Public safety--Food handling and safety
    Public safety and consumer protection--Public safety--Gun control
    Public safety and consumer protection--Public safety--Gun control--Firearms licenses
    Public safety and consumer protection--Public safety--Hazardous materials
    Public safety and consumer protection--Public safety--Occupational and industrial safety
    Public safety and consumer protection--Public safety--Occupational and industrial safety--Workers' compensation
    Public safety and consumer protection--Public safety--Public safety pages for kids
    Public safety and consumer protection--Public safety--Water safety
    Recreation and tourism
    Recreation and tourism--Conference centers and resorts
    Recreation and tourism--Cultural resources
    Recreation and tourism--Cultural resources--Movies
    Recreation and tourism--Cultural resources--Museums and historical societies
    Recreation and tourism--Cultural resources--Performing arts
    Recreation and tourism--Cultural resources--Performing arts--Ethnic and folk art
    Recreation and tourism--Cultural resources--Visual arts
    Recreation and tourism--Cultural resources--Visual arts--Ethnic and folk art
    Recreation and tourism--Fairs and festivals
    Recreation and tourism--Fairs and festivals--DuQuoin State Fair
    Recreation and tourism--Fairs and festivals--Illinois State Fair
    Recreation and tourism--Gambling
    Recreation and tourism--Gambling--Casinos
    Recreation and tourism--Gambling--Lottery
    Recreation and tourism--Public lands
    Recreation and tourism--Public lands--National parks
    Recreation and tourism--Public lands--State parks and beaches
    Recreation and tourism--Racing
    Recreation and tourism--Recreation pages for kids
    Recreation and tourism--Sports
    Recreation and tourism--Sports--Bicycling
    Recreation and tourism--Sports--Fishing and hunting
    Recreation and tourism--Sport--Fishing and hunting--Fishing and hunting licenses
    Recreation and tourism--Sports--Fishing and hunting--Fishing and hunting licenses--Archery licenses
    Recreation and tourism--Sports--Golf
    Recreation and tourism--Sports--Hiking, climbing, and camping
    Recreation and tourism--Sports--Off road vehicles
    Recreation and tourism--Sports--Professional sports
    Recreation and tourism--Sports--Water sports
    Recreation and tourism--Sports--Water sports--Boating
    Recreation and tourism--Sports--Water sports--Boating--Water craft licenses
    Recreation and tourism--Sports--Water sports--Boating--Water safety
    Recreation and tourism--Sports--Water sports--Water safety
    Recreation and tourism--Sports--Winter sports
    Recreation and tourism--Tourism
    Regional government
    Regional government--Regional government agencies
    Regional government--Regional government finance
    Regional government--Regional government purchasing
    Social issues and programs
    Social issues and programs--Aging
    Social issues and programs--Aging--Medicare
    Social issues and programs--Aging--Nursing homes and assisted living facilities
    Social issues and programs--Children and youth
    Social issues and programs--Children and youth--Adoption and foster care
    Social issues and programs--Children and youth--Child abuse
    Social issues and programs--Children and youth--Child abuse--Child abuse and neglect forms
    Social issues and programs--Children and youth--Child care
    Social issues and programs--Children and youth--Child custody
    Social issues and programs--Children and youth--Child support
    Social issues and programs--Children and youth--Runaway
    Social issues and programs--Civil and human rights
    Social issues and programs--Civil and human rights--Desegregation and diversity
    Social issues and programs--Civil and human rights--Discrimination
    Social issues and programs--Civil and human rights--Discrimination--Affirmative action
    Social issues and programs--Civil and human rights--Equal employment opportunity
    Social issues and programs--Civil and human rights--Intellectual freedom and censorship
    Social issues and programs--Civil and human rights--Privacy
    Social issues and programs--Disabilities
    Social issues and programs--Disabilities--Developmental disabilities
    Social issues and programs--Disabilities--Hearing impairments
    Social issues and programs--Disabilities--Learning disabilities
    Social issues and programs--Disabilities--Visual disabilities
    Social issues and programs--Ethics
    Social issues and programs--Ethnic groups and minorities
    Social issues and programs--Ethnic groups and minorities--African Americans
    Social issues and programs--Ethnic groups and minorities--American Indians
    Social issues and programs--Ethnic groups and minorities--Asian Americans
    Social issues and programs--Ethnic groups and minorities--Hispanic Americans
    Social issues and programs--Family
    Social issues and programs--Family--Adoption and foster care
    Social issues and programs--Family--Child care
    Social issues and programs--Family--Child custody
    Social issues and programs--Family--Child support
    Social issues and programs--Family--Divorce
    Social issues and programs--Family--Domestic violence
    Social issues and programs--Family--Family planning
    Social issues and programs--Family--Genealogy
    Social issues and programs--Homeless people
    Social issues and programs--Immigration
    Social issues and programs--Welfare
    Social issues and programs--Welfare--Medicaid
    State government
    State government--Elected state officials
    State government--Elected state officials--Illinois Attorney General
    State government--Elected state officials--Illinois Auditor General
    State government--Elected state officials--Illinois Comptroller
    State government--Elected state officials--Illinois Governor
    State government--Elected state officials--Illinois Lieutenant Governor
    State government--Elected state officials--Illinois Secretary of State
    State government--Elected state officials--Illinois state legislators
    State government--Elected state officials--Illinois Treasurer
    State government--Illinois General Assembly
    State government--Illinois General Assembly--Bills (proposed legislation
    State government--Illinois General Assembly--Illinois House of Representatives
    State government--Illinois General Assembly--Illinois Senate
    State government--Illinois General Assembly--Illinois state legislators
    State government--Licenses and permits
    State government--Licenses and permits--Animal licenses and animal control
    State government--Licenses and permits--Business and professional licenses
    State government--Licenses and permits--Business and professional licenses--Business licenses
    State government--Licenses and permits--Professional licenses
    State government--Licenses and permits--Certification of school personnel
    State government--Licenses and permits--Drivers licenses
    State government--Licenses and permits--Drivers licenses--License application forms
    State government--Licenses and permits--Firearms licenses
    State government--Licenses and permits--Fishing and hunting licenses
    State government--Licenses and permits--Fishing and hunting licenses--Archery licenses
    State government--Licenses and permits--License application forms
    State government--Licenses and permits--License fees
    State government--Licenses and permits--Water craft licenses
    State government--Lobbying
    State government--Relations with other governments
    State government--Relations with other governments--Federal relations
    State government--Relations with other governments--Intergovernmental agreements
    State government--Relations with other governments--Intergovernmental relations
    State government--Relations with other governments--Local government relations
    State government--Relations with other governments--Regional government agencies
    State government--Relations with other governments--Relations with other states
    State government--State agencies
    State government--State agencies--Administrative rules
    State government--State audits and studies
    State government--State boards and commissions
    State government--State constitution
    State government--State employees
    State government--State employees--Elected state officials
    State government--State employees--Elected state officials--Illinois Attorney General
    State government--State employees--Elected state officials--Illinois Auditor General
    State government--State employees--Elected state officials--Illinois Comptroller
    State government--State employees--Elected state officials--Illinois Governor
    State government--State employees--Elected state officials--Illinois Lieutenant Governor
    State government--State employees--Elected state officials--Illinois Secretary of State
    State government--State employees--Elected state officials--Illinois state legislators
    State government--State employees--Elected state officials--Illinois Treasurer
    State government--State employees--State employment retirement system
    State government--State employees--State employment retirement system--Retirement forms
    State government--State employees--State employment retirement system--Retirement forms--Teachers' retirement forms
    State government--State employees--State employment forms
    State government--State finance
    State government--State finance--Grants made by states
    State government--State finance--Grants made by states--Grant application forms
    State government--State finance--Grants to states
    State government--State finance--Grants to states--Federal grants
    State government--State finance--Grants to states--Federal grants--Federal education grants
    State government--State finance--State aid to higher education
    State government--State finance--State aid to schools
    State government--State finance--State employee retirement system
    State government--State finance--State employee retirement system--Retirement forms
    State government--State finance--State employee retirement system--Retirement forms--Teachers' retirement forms
    State government--State finance--Teachers' retirement system
    State government--State finance--Teachers' retirement system--Teachers' retirement forms
    State government--State government facilities
    State government--State government purchasing
    State government--State symbols and flags
    State government--State symbols and flags--State flags
    Transportation
    Transportation--Air transportation
    Transportation--Airports
    Transportation--Automobiles
    Transportation--Automobiles--Automobile insurance
    Transportation--Automobiles--Drivers licenses
    Transportation--Automobiles--Drivers licenses--License application forms
    Transportation--Automobiles--Vehicle registration forms
    Transportation--Bicycling
    Transportation--Bridges and tunnels
    Transportation--Buses
    Transportation--Buses--Drivers licenses
    Transportation--Buses--Drivers licenses--License application forms
    Transportation--Buses--Vehicle registration form
    Transportation--Mass transit
    Transportation--Motor vehicles
    Transportation--Motor vehicles--Automobiles
    Transportation--Motor vehicles--Automobiles--Automobile insurance
    Transportation--Motor vehicles--Automobiles--Drivers licenses
    Transportation--Motor vehicles--Automobiles--Drivers licenses--License application forms
    Transportation--Motor vehicles--Automobiles--Vehicle registration forms
    Transportation--Motor vehicles--Buses
    Transportation--Motor vehicles--Buses--Drivers licenses
    Transportation--Motor vehicles--Buses--Drivers licenses--License application forms
    Transportation--Motor vehicles--Buses--Vehicle registration forms
    Transportation--Motor vehicles--Motocyles
    Transportation--Motor vehicles--Motocyles--Drivers licenses
    Transportation--Motor vehicles--Motocyles--Drivers licenses--License application forms
    Transportation--Motor vehicles--Motocyles--Vehicle registration forms
    Transportation--Motor vehicles--Off road vehicles
    Transportation--Motor vehicles--Trucks
    Transportation--Motor vehicles--Trucks--Drivers licenses
    Transportation--Motor vehicles--Trucks--Drivers licenses--License application forms
    Transportation--Motor vehicles--Trucks--Drivers licenses--Vehicle registration forms
    Transportation--Motor vehicles--Vehicle registration and titles
    Transportation--Motor vehicles--Vehicle registration and titles--Vehicle registration forms
    Transportation--Railroads
    Transportation--Roads and highways
    Transportation--Roads and highways--I-Pass forms
    Transportation--Traffic safety
    Transportation--Traffic safety--Accident report forms
    Transportation--Trucks
    Transportation--Trucks--Drivers licenses
    Transportation--Trucks--Drivers licenses--License application forms
    Transportation--Trucks--Vehicle registration forms
    Transportation--Vehicle registration and titles
    Transportation--Vehicle registration and titles--Vehicle registration forms
    Transportation--Water transportation
    Transportation--Water transportation--Boating
    Transportation--Water transportation--Boating--Water craft licenses
    Transportation--Water transportation--Boating--Water safety
    Transportation--Water transportation--Canals
    Transportation--Water transportation--Ferries
    Transportation--Water transportation--Ports and harbors
    Utilities and communications
    Utilities and communications--Drinking water
    Utilities and communications--Electric utilities
    Utilities and communications--Electric utilities--Cogeneration
    Utilities and communications--Electric utilities--Hydroelectric power plants
    Utilities and communications--Electric utilities--Nuclear power plants
    Utilities and communications--Energy resources
    Utilities and communications--Energy resources--Alternative energy resources
    Utilities and communications--Energy resources--Biomass energy
    Utilities and communications--Energy resources--Coal
    Utilities and communications--Energy resources--Hydroelectric resources
    Utilities and communications--Energy resources--Petroleum and natural gas
    Utilities and communications--Energy resources--Petroleum and natural gas--Gas utilities
    Utilities and communications--Energy resources--Solar energy
    Utilities and communications--Energy resources--Waste to energy
    Utilities and communications--Energy resources--Wind energy
    Utilities and communications--Gas utilities
    Utilities and communications--Sewage treatment and disposal
    Utilities and communications--Telecommunications
    Utilities and communications--Telecommunications--Cable communications
    Utilities and communications--Telecommunications--Internet
    Utilities and communications--Telecommunications--Radio
    Utilities and communications--Telecommunications--Satellite communications
    Utilities and communications--Telecommunications--Telecommunications regulations
    Utilities and communications--Telecommunications--Television and video
    Utilities and communications--Telecommunications--Television and video--Cable communications
    Utilities and communications--Telecommunications--Television and video--Satellite communications
    Utilities and communications--Utilities and communications pages for kids
    Voting and elections
    Voting and elections--Bond issues
    Voting and elections--Candidate petition forms
    Voting and elections--Candidates
    Voting and elections--Candidates--Candidate petition forms
    Voting and elections--Election laws
    Voting and elections--Election manuals
    Voting and elections--Election results
    Voting and elections--Initiatives (electoral)
    Voting and elections--Referenda
    Voting and elections--Voter registration
    Voting and elections--Voting and elections pages for kids