Indiana State and Local Government Archive
Collection Overview
Citing Web Sites in the Archive
Please cite individual seeds or web pages as follows:
“Title of web page.” Title of Collection. Archived by the
e.g.,
“
Selection Criteria
Scope: Currently we are collecting the web sites of any state government agency. We have also identified 4 local government sites to monitor and will be adding additional during 2007.
Volume: Active seeds will be crawled quarterly; County government web sites will be manually archived annually.
Crawl Parameters:
Ø Collection Dates: Start Date: July 1, 2006
Ø How often captured: Monthly for state agencies; annually for city/counties.
Acquisition Parameters:
Ø Depth: Complete
Ø Breadth: Links are followed out to one external level.
Searching
Archive-It provides full text search capability for all public collections. Alternately, if you know the site you are looking for, enter the URL into the search box, and Archive-It will search for instances of that archived URL.
Archive-It enables searching of both the full text of web sites and the metadata that has been assigned to the seeds, or individual URL’s.
The search tool used to provide full-text access to the Library's Web archive collections is powered by the open-source search engine, Nutch.
Some hints on searching:
Ø Generally, search results are ranked by relevance according to several factors:
o how often the query terms appear in the page relative to how often they appear throughout the collection
o how often the query terms appear in the page compared to the length of the page
o whether the query terms appear in the URL
o whether the query terms appear in the hostname
Ø The Boolean search default is AND.
Ø If you know that what you're looking for is in a specific type of file, you can limit your search to just that format by adding type:[file type] to your search terms.
o e.g., A PDF document about French Lick might be found using the following string: French Lick type:pdf.
Ø If you want to find out about a topic discussed specifically on an archived web site, you can limit your search by adding site:[URL of archived site] to your search terms.
o e.g., French Lick site:http://www.in.gov/ will find instances of the term French Lick on the Governor of Indiana’s web site.
Ø You can refine search results in the following ways:
o The link to other versions will take you to a list of archived versions that were captured on different dates.
o The more from... link will take you to other hits from that host.
Since the Indiana University Libraries have been archiving web sites only since spring 2006, you may wish to look for earlier versions of many of the sites in the Library's collections through the Internet Archive's general Wayback Machine. The Wayback Machine, however, is not text searchable; you must know the URL of the site that you would like to view.
Other Related Sites
Indiana Government: http://www.state.in.us/
Portal for all Indiana State Government agency web sites.
Indiana Commission on Public Records: http://www.state.in.us/icpr/
Agency responsible for ensuring “lawful, efficient retention of historically and legally significant public records, regardless of format, and coordinate destruction or permanent preservation when records are no longer actively needed by state agencies.
Contact Information:
Lou Malcomb
Head, Government Information, Microforms and Statistical Services
For historical Indiana State Documents, refer to Lou's Guide: Finding Historic INDIANA Documents http://www.libraries.iub.edu/index.php?pageId=3306
