RCRG logo   Recordkeeping Metadata




Monash University Logo

Search

About RCRG - Contact Details, Map, Personnel, 
Research Areas: Metadata Control, Continuum Modelling & Research, Recordkeeping & Legal Systems          

Research - SPIRT Recordkeeping Metadata Project, TIF, InterPARES Project          

Publications - Author List, Title List          

Consulting - Recordkeeping Metadata, Recordkeeping Regimes, Personnel, Clients          

Links - Related Research Groups, Useful Resources          

SiteMap          


© 1998 Records Continuum Research Group, Monash University. All Rights Reserved.

 

Technical Introduction

 


This Section provides a roadmap for understanding the recordkeeping metadata element definitions.

(PDF Version (479 Kb) )





Technical Introduction

This Section provides a roadmap for understanding the recordkeeping metadata element definitions.

Conceptual Framework

  • Entity Types and Aggregation Levels
  • Relationships and Mandates
Metadata Model
  • Structure
  • Identifying Entities
  • Redundancy and Robustness of Descriptions
  • Metadata Syntax
  • Extensibility
Layout of Definitions
  • Summary of RKMS Elements and Qualifiers
  • RKMS Summary Register
  • RKMS Register
  • RKMS Schemes

Conceptual Framework
The Recordkeeping Metadata Element Schema (RKMS) is built on a conceptual framework, shown in the diagram below. The framework is concerned with four main classes of entities - Business entities, Agents (people) entities, Records entities, and Business-Recordkeeping entities - as well as the Relationships between entities and the Mandates that govern the entities and their relationships. The framework envisions description of entities at various layers of aggregation. The following diagram represents the entities as boxes, relationships as arrows, and aggregation layers as cascading boxes.

Each of the Business, Agents, Records, and Business-Recordkeeping classes has a separate metadata element set. Each set shares ten common metadata elements, and includes other elements specific to the description of each entity:


Entity Types and Aggregation Levels
A feature of RKMS is that it is possible to indicate the type of entity being described as well as the level of aggregation of the entity. The type of entity being described is indicated by including the name of the element set with the name of each element. That is, the "Title" element from the Agents and Records sets are identified as "Agents.Title" and "Records.Title" respectively.

An entity's level of aggregation is indicated using the Category Type element. For example, the description of a person would contain an "Agents.CategoryType" element with value "Person / Actor". The complete list of RKMS layers of aggregation are defined in the RKMS Category Type Scheme, described in the Schemes section below.

Relationships and Mandates
Another feature of RKMS is the ability to describe relationships between entities, and the mandates that govern those relationships. Although the conceptual framework is concerned with relations and mandates these are not considered primary entities in the RKMS conceptual model and so do not have separate metadata element sets. It was decided that modelling relationships and mandates as entities would distract from the task of defining metadata for the primary recordkeeping entities (Agents, Business, Records). They are described, instead, using the Relation and Mandate elements within the metadata element sets.

The decision to describe relations and mandates using element has many implications. In particular, the amount of information that can be recorded about relationships and mandates is limited, and only binary relationships can be depicted. Given the complexity and importance of relationships and mandates in defining records context and the possibility that depicting only binary relationships is too restrictive, this decision may be revisited in future revisions of the RKMS. These issues are discussed in greater depth in Ongoing Research, Re-modelling the RKMS Set.

Metadata Model
RKMS supports highly structured descriptions of entities. This structure is defined in a data model informed by the metadata modelling work of the W3C Resource Description Framework (RDF: http://www.w3.org/RDF) initiative and the Dublin Core Metadata Initiative (DC: http://purl.oclc.org/dc/).

Structure
Abstractly, the data model supports descriptions of entities. Descriptions are realised by aggregations of statements about entities. A statement is any characteristic of the entity, and the value of that characteristic. For example, an Agents entity might have a "Title" characteristic with value "Loans Officer".

The RKMS data model supports description of any Business, Agents, Records and Business-Recordkeeping entities that have identity.

Statements come in two forms: Basic and Qualified. A basic statement consists of a metadata element and a simple text value.

For example, a collection of two basic statements about title and date characteristics of an Agents entity might be:

Qualified statements consist of elements and values that are structured. Elements can be structured with element qualifiers that specialise or refine the meaning of an element. For example, "Established" is an element qualifier that can refine the meaning of the "Date" element for an Agents entity.

Values can be qualified in two ways:

Schemes These indicate either:

  • · An authority for the terms used in the value. For example, the AGIFT thesaurus could be used as an authority for the terms used in the value of the "Functional Classification" element.
  • · A notation or encoding syntax used to represent the value. For example, the ISO 8601 standard could be used as an encoding scheme for values of the "Date" element.
Value Components

These identify structure within the value. For example, the value of a "Mandate" element may have value components representing the "Title", "Date", "Description", and "Jurisdiction" of the mandate. These value components can be thought of as metadata about the value itself. In the above example, this is metadata for the mandate.

A qualified value can have a single text value (like a basic value), or multiple value components. Value components consist of value component labels representing the name of the value component and the value itself, represented as text value.

A qualified value may have a scheme relating to the whole value. When the value is a simple text string, this scheme defines an authority or structure for that text. For example, the scheme may indicate that the simple text value "1999-01-01" conforms to the ISO 8601 format for dates. When the value is a set of value components, the scheme usually indicates a naming authority for the value component labels. For example, the scheme may indicate that the labels come from the vCard (http://www.imc.org/pdi/) system for describing people.

Additionally, if the value consists of value components, then each value component may have an individual scheme.

When the value consists of multiple value components, at least one of those components should represent the information that would be entered into the "unstructured value". For example, the "Business.Mandate" element is defined as "Information about the mandate or instrument that provides the legal or administrative basis of the business, ...". It has many value components: "Title", "ID", "Date", "Description", and "Jurisdiction". A Mandate description that only used the "Date" and "Jurisdiction" components is not a valid description because it does not convey enough information to effectively describe the Mandate. Such a description should at least use the "Title" or "ID" components.

The following table contains examples of four qualified statements about an Agents entity.

Element
Value
Element Element Qualifier Value Component Value Scheme
Agents.Title Unlimited Loans Officer
Agents.Functional Classification   Loans Establishment EastPAC Functional Thesaurus
Agents.Date Established 1998-01-07 ISO8601
Agents.Mandate   Title Loan Management Authority
ID LM23
Jurisdiction Victoria

Identifying Entities
The RKMS data model supports description of any Business, Agents, Records and Business-Recordkeeping entities that have identity. Entities are identified using the RKMS Identifier element. Every description must contain at least one instance of this element.

The value of the Identifier element is a text string that acts as a reference to the entity. In general, understanding this reference involves understanding the context of the reference. For example, the identifier "ACN 052 372 577" is probably understandable within an Australian context as an Australian Company Number, but would probably not be well understood outside Australia. For this reason, it is recommended that identifiers contain context information that allows understanding beyond the immediate domain of application. For instance, the previous example could be better written as "[Australian Securities and Investments Commission, Australian Company Number] ACN 052 372 577".

Some qualified metadata statements contain an "ID" value component, intended to identify the value being described. These identifiers should also contain context if they are to be understood beyond the immediate domain.

Redundancy and Robustness of Descriptions
A fundamental premise of RKMS is that it should support persistence and understanding of records through time and space. To achieve this, metadata descriptions must contain enough of their original context to be understood independent of that context.

Context is captured within RKMS descriptions in two ways: as text within a metadata description, and as relationships to other entities. For example, the title of a Records entity would be described in the "Records.Title" metadata element, but the creator of the record would be described within the "Records.Relation" metadata element as a reference to a separate Agents entity.

The viability of capturing context using references to other entities depends on the feasibility of achieving persistent associations between entity descriptions, and the persistence of the other entity descriptions themselves. Uncertainties about persistence may lead to implementation of recordkeeping metadata in records-centric ways - if other systems cannot be trusted to sustain the links over time, then metadata must be brought explicitly within the boundaries of the records system itself.

RKMS addresses the problem of maintaining persistent metadata over time in three ways:

  1. Allow the same context to be captured as a reference to another description, as an in-place description, or both.
  2. Capture context in-place, even if it could be more efficiently be captured using a reference to another description.
  3. Capture context as a reference, but annotate the reference itself with some of the context of the other entity.

The first approach is taken in the case of description of recordkeeping activities. These are captured either separately from the Records as Business-Recordkeeping entities referenced from a Records entity description, or within the description of the Records entity itself using the Appraisal, Control, Preservation, Retrieval, Access, and Use metadata elements. In this way, the RKMS has been designed to allow both the separated and record-centric description of recordkeeping business.

The second approach is taken in the case of mandate descriptions. Mandates are not separate entities in the RKMS conceptual framework. As a result, the description of a mandate that governs many entities will be repeated within the Mandate element of each of those entities. This ensures that Mandate descriptions are always accessible within the description of each entity. It does mean, however, that Mandate descriptions are less detailed than if they were separate entities that had own metadata sets.

The final approach is available to any context captured using a relationship. The RKMS Relation element is used to describe references to other entities. It has "Related To", "Type", "Definition", "Date", "Mandate", and "Business Rules" value components intended to capture some of the context of the referenced entity. This means that even if a description of the referenced entity is lost, some context is available within the referencing description.

Metadata Syntax
The element definition tables within the RKMS register contain examples of each element's use. The RKMS Simple Text Syntax used to express these examples is described below. This syntax is based on the proposed Dublin Core Structured Values scheme (http://www.agcrc.csiro.au/projects/3018CO/metadata/dcsv/). Future RKMS work will include definitions for HTML (http://www.w3.org/TR/html4/) and RDF (http://www.w3.org/RDF) representations of RKMS metadata.

An RKMS Simple Text Syntax metadata statement consists of an element definition and a value definition, separated by an equals sign (=):

element definition = value definition

Element definitions consist of a schema identifier(RKMS), entity name, element name, and an optional element qualifier, separated by dots (.). For example,

RKMS.Agents.Title = …
RKMS.Agents.Date.Established = …

Elements and element qualifiers with multiple word names are compacted by capitalising the first letter in each word and concatenating them together. For example,

RKMS.Business.FunctionalClassification = …

Qualified values are represented in the syntax as a series of value components, separated by semi-colons (;). For example, a value with three value components c1, c2, c3 would be written as:

… = c1; c2; c3

Value components consist of labels representing the name of the value component and values representing the data itself. Colons (:) separate a value component label from the value component value. For example, the mandate description given in the data model section above would be written:

RKMS.Agents.Mandate = ID: LM23; Description: Loan Management Authority; Jurisdiction: Victoria

Value component labels containing multiple words are compacted in the same way as multiple word elements:

RKMS.Business.Relation = RelatedTo: RECORD001; Definition: Document In

Schemes are represented in the syntax as names within square brackets ([name]) which precede values. The Functional Classification and Date examples given in the table above would be written as:

RKMS.Agents.FunctionalClassification = [EastPAC Functional Thesaurus] Loans Establishment
RKMS.Agents.Date.Established = [ISO 8601] 1998-01-07

Value component values can also have schemes. In the following example there is a scheme associated with the value of the "Date" value component:

RKMS.Agents.Mandate = Description: Loan Management Authority; Jurisdiction: Victoria; Date: [ISO 8601] 1998-01-10

Extensibility
The RKMS envisages use of metadata elements, element qualifiers, value components and schemes from other metadata schemas. These allow RKMS descriptions to be extended using the structure and semantics from other descriptive standards.

Use of other independently maintained metadata schemas requires a metadata syntax that can distinguish between metadata components from different schemas. RDF has an XML syntax designed to support this requirement. The RKMS Simple Text Syntax, however, is restricted in this sense. It can only identify elements and schemes from other schemas, but not element qualifiers or value components from other schemas.

Elements from other metadata schemas are identified in the RKMS Simple Text Syntax using an appropriate schema identifier. For example, a Records description could use a combination of elements from RKMS and the Australian Government Locator Service (AGLS):

RKMS.Records.Title = Youth Loans
RKMS.Records.Mandate = ID: LM23; Description: Loan Management Authority; Jurisdiction: Victoria
AGLS.Audience = [AGLS Agegroup] youth, U25

Schemes from other metadata schemas are identified in the RKMS Simple Text Syntax as names within square brackets. For example,

RKMS.Agents.CategoryType = [RKMS Category Type Scheme] Organisational Unit / Work Group
RKMS.Agents.FunctionalClassification = [EastPAC Functional Thesaurus] Loans Establishment
RKMS.Agents.Date.Established = [ISO 8601] 1998-01-07

Layout of Definitions
The definition of the Business, Agents, Records, and Business-Recordkeeping metadata sets is split into four documents, each with their own purpose and structure.

Summary of Elements and Qualifiers
The Summary of Elements and Qualifiers contains a quick reference to the structure of RKMS. It consists of a table indicating the element qualifiers and value components that apply within each of the RKMS elements.

RKMS Register Summary
The Register Summary is a quick reference for the structure of RKMS, and the schemes used within the metadata set. It consists of a table indicating the element qualifiers and value components that apply within each of the RKMS elements, and which schemes are applicable within this structure.

RKMS Register
The Register contains a highly detailed description of each RKMS metadata element. The elements are described using attributes from the ISO 111791 standard for the description of metadata elements . These attributes include:

  • Name - The label assigned to the data element.
  • Identifier - The unique identifier assigned to the data element.
  • Version - The version of the data element.
  • Registration Authority - The entity authorised to register the data element.
  • Language - The language in which the data element is specified.
  • Definition - A statement that clearly represents the concept and essential nature of the data element.
  • Obligation - Indicates if the data element is required to always or sometimes be present (contain a value).
  • Datatype - Indicates the type of data that can be represented in the value of the data element.
  • Maximum Occurrence - Indicates any limit to the repeatability of the data element.
  • Comment - A remark concerning the application of the data element.

Additionally, the following attributes are used to describe each element:

  • Element Qualifier - A list of possible element qualifiers for the element.
  • Value Components - A list of components that may be used to structure a value.
  • Scheme - List of mandatory and relevant schemes, plus an indication of the element qualifiers and value components they may qualify.
  • Description - An elaboration on or further explanation of the meaning of the element.
  • Examples - Some examples of the element's use. These examples are intentionally fictional to avoid misrepresenting the business processes of real organisations. Any resemblance to records, businesses, or agents, living or dead, is purely coincidental.

Fortunately, some of these attributes are common to all RKMS elements, and so are not shown in the element definitions. These are:

Version 1.0
Registration Authority Monash Records Continuum Research Group
Language en
Datatype Character String

Additionally, most elements have the same occurrence restrictions. That is, most elements have:

Obligation Optional
Maximum Ocurrence Unlimited

There are two exceptions to this. The "Category Type" elements must occur exactly once:

Category Type
Obligation Mandatory
Maximum Ocurrence 1

And "Identifier" elements must occur at least once:

Identifier
Obligation Mandatory
Maximum Ocurrence Unlimited

RKMS Schemes
The schemes section describes the schemes defined within RKMS. These are

  • RKMS Category Type Scheme - a vocabulary of terms used within the Category type element to indicate the level of aggregation of the entity being described, and within the Relation element to indicate the level of aggregation of the entities being related.
  • RKMS Entity Relationships Scheme - a vocabulary of terms used within the Relation element for indicating the nature of the relationship. This scheme is currently under development.
  • RKMS Extension for ISO 8601 - an encoding scheme for dates that extends ISO8601 to include open ended date ranges.
  • RKMS Business-Recordkeeping Functions and Activities Scheme - a vocabulary of terms used to classify recordkeeping functions and activities. This scheme is currently under development.

Endnotes
ISO 11179 - Specification and Standardization of Data Elements, Parts 1-6.


News TIF Project

 

About     Research     Publications     Consulting     Links     Sitemap


Authorised by Head, School of Information Management and Systems. Caution.
Maintained by Records Continuum Research Group.
AGLS Compliant
Last updated 23 June 2000.