Description: In this document, the authors present some aspects of data normalization of the decomposed records to improve the results of analysis. The data normalization processes use pattern-matching techniques to eliminate and/or generalize anomalous characters and terms. Since the unit of analysis in preparing the test dataset of 400,000 MARC 21 records is a "word," there was a need for data normalization to provide reliability in the subsequent analysis.

Description: This document provides a status report on the Z39.50 Interoperability Testbed Project (Z-Interop) covering the period of December 1, 2000 through April 30, 2001. The authors highlight activities and accomplishments to communicate to IMLS progress on their project. This period can be considered a project startup period.

Description: This document describes the data analysis procedures developed to create the Aggregate and Candidate Record Groups using SQL statements. This is the preliminary version of these procedures tested and validated on a sample of decomposed MARC records. (For a description of how the MARC records were decomposed see the Z-Interop document, Decomposing MARC 21 Records for Analysis. A subsequent version may be necessary as the authors move to the procedures for the entire file of decomposed records.