Cultivate Interactive Home Page *
*

Search Disabled

  Home | Current Issue | Index of Back Issues
  Issue 6 Home | Editorial | Features | Regular Columns | News & Events | Misc.

Print All - Complete Issue

This page is intended for printing purposes.

-------------------------------------------------------------

Cultivate Interactive Issue 6: Editorial

Welcome to the sixth issue of Cultivate Interactive!

Red SquareWith the recent launch of Cultivate-Rusisia we thought it would be appropriate to give this issue of Cultivate Interactive an Eastern European feel. We have two articles on the Cultivate-Russia launch, one by David Fuegi outlining the project and an ‘At the Event article’ by myself revealing how the Launch meeting went. There is also a piece on recent events and funding at the Russian State Library by Monika Segbert, an article on Chodovec, the recently opened archive building located in the centre of Prague and the facilities it will offer for Czech archives by Borivoj Indra and Vladimíra Hradecká and a piece on the ARCCHIP project which uses workshops as a way of exchanging knowledge between EU and CEE countries. Keeping with the Eastern European theme, see if you can identify this issue’s ‘Spot the City’ photographs!

Other feature articles in the magazine cover a wide variety of Digital Cultural Heritage projects; some funded by DIGICULT such as Collaboratory for Annotation, Indexing and Retrieval of Digitized Historical Archive Material (COLLATE), Cultural Heritage in Regional Networks (REGNET) and BioNet. And others, such as Safeguarding European Photographic Images for Access (SEPIA), Children's Historical Literature Disseminated Throughout Europe (CHILDE), Cross-Language Evaluation Forum (CLEF), Electronic Information for Librariesn (eIFL), New Access And Services For Cultural Content (Activate) funded by a variety of other funding schemes.

Covering a more general subject area John Perkins, the Executive Director of the CIMI Consortium, has written an article discussing the potential the much talked about Open Archives Initiative may hold for museums. He also talks about how CIMI have been testing the OAI protocol by creating a generic OAI-compliant repository, and how they hope to conduct a more formal, large-scale test of the OAI for museums by harvesting museum metadata. As John explains “the purpose of the research is to explore how a specific community of users can use the OAI protocol and to begin the process of assessing the effectiveness and efficiency of OAI.

In the regular columns section we have the usual DIGICULT column, plus an extra column on DIGICULT's trial action Projects and their accompanying measure TRIS. There will be more news on the new projects in forthcoming issues of Cultivate Interactive. We unfortunately have no National Node column this issue but have a really interesting Praxis article in compensation. Paul Lewis and his colleagues from the University of Southampton discuss retrieval and navigation as ways of accessing multimedia information. In the past metadata has been used to retrieve multimedia data from multimedia information bases. Recent research has tried to address some of the limitations of using text metadata by making use of multimedia content as a basis for retrieval. The article asks whether content-based techniques are really making a useful contribution or whether we should restrict ourselves to the use of metadata. In the metadata column Frans Smit provides us with an English and Dutch version of his report on how we can adapt concepts from Information and Knowledge Management (IKM) and Information and Communication Technology (ICT) and create data Warehouses in which to organize and give access to metadata about historical archives and collections.

Finally we also have one extra Miscellaneous article by Michelle Fiander, Manager of the Web Reference Center, Maryland, America. Michelle has written a letter to us over here in Europe explaining what Virtual Reference is and how it came about.

It is also time for a farewell. I have been the editor of Cultivate Interactive since its launch in July 2000 but am now moving on to pastures new. I am staying at UKOLN but have recently been appointed as the QA Focus/NOF-digitise Advisor. Most of my work will revolve around advising and supporting projects within the DNER and NOF-digi programmes for Higher Education, Further Education and life long learning, here in the UK. Cultivate Interactive will continue to be published by new blood. So farewell, and thanks to all the readers and writers whom I have got to know so well over the last year and a half. Also a huge thanks to Shirley Keane who has helped me pull together this issue in a very limited amount of time. Without Shirley this issue would still be a collection of word files and scribbles in my note book!

Marieke Napier (Editor)

-------------------------------------------------------------

Cultivate Interactive Issue 6: Features

IST Projects:

Other Areas:

-------------------------------------------------------------

IST Projects

Russia Joins the Cultivate Family

By David Fuegi - February 2002

CULTIVATE-Russia is a new addition to the Cultivate Cultural Heritage Applications Network (CULTIVATE) which supports the cooperation of memory institutions (archives, libraries and museums) under the European Commission's Information Society Technologies Programme (IST). It brings Russia into the network and completes the network in its present form. CULTIVATE-Russia starts in January 2002 and runs for 18 months. More information will be available shortly from the Cultivate Web site [1].

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Image of Russia From 1st January 2002 the Cultivate family of projects will be completed as CULTIVATE-Russia comes on stream funded by DGINFOS. The project's kick-off meeting will be held in Moscow in mid-January [2] and the Russian partners plan to get up to speed rapidly. The project will run for 18 months so that it ends around the same time as the rest of the Cultivate network, which includes Cultivate EU and Cultivate CEE. Although it is formally a separate project with its own contract, budget and partners it will aim to work very closely with the other Cultivates. It will join their joint management meetings and workshops including those planned for Barcelona in May, contribute to Cultivate Interactive and to the policy monitoring and awareness raising activities of the network. But primarily it will expend its energies in Russia because Russia is very big and Russia is a bit different.

CULTIVATE-Russia will contribute to the overall strategic need to link Russia to Europe by building important human networks and helping to integrate the Russian cultural heritage network into the European network. Russia has a great wealth of cultural heritage and ICT expertise, which is still mostly inaccessible to EU citizens. CULTIVATE-Russia will play its part in ensuring that the European Commission has reliable information on trends and developments in the archives, libraries and museums in Russia, an informed international human network of participants in the IST Programme and informed political support for the Programme.

In Russia, with its enormous territory and low income population, it is impossible to hope that multimedia computers and access to Internet will be found in each family in the near future. This is why any broad access of the population to electronic resources, and the overcoming of the "digital divide" is first and foremost connected with public places of access in libraries, museums, and information centres. Libraries, museums and archives in Russia are entering the electronic age. Practically all the regional libraries, as well as a majority of city libraries in Russia have electronic reading rooms or are going to organise such rooms in the near future. Central libraries of the country, including the Russian State Library, the Science and Technology Library, the Library of Foreign Literature and other libraries of large cities have Internet connections.

A Russian node to represent the IST Programme to the cultural heritage domain has not previously existed and this will be remedied through CULTIVATE-Russia , which will aim to create a cadre of influential staff who are knowledgeable about the programme and keen to build on its outcomes. Language, distance and a comparatively poor IT and telecommunications infrastructure mean that achieving these objectives in Russia will be challenging. CULTIVATE-Russia expects to have to make a good deal of material available in Russian (including a Russian equivalent of Cultivate Interactive) and to use paper-based communication on a larger scale than in other countries. Because of the country's vast size, events will need to take place in major regional centres and not only in the capital.

CULTIVATE-Russia has 7 partners. The British Council (Moscow) is the principal contractor and responsible for the administrative/financial co-ordination function. The Council has enormous experience of running projects in Russia and will support the project in many ways, making full use of its extensive network of regional contacts and offices. The five Russian partners, representing the 3 main cultural heritage domains, have leadership positions and a wide geographical spread within Russia, which is, of course, a huge country. David Fuegi of MDR Partners, which is responsible for the management of CULTIVATE-CEE will also manage CULTIVATE-Russia. This will ensure continuity and close liaison between the projects. Monika Segbert will work closely with MDR and the principal Russian partners on key issues including publicity. To help achieve close personal and professional liaison within the Cultivate group of projects, many of the technical partners in Cultivate CEE and Cultivate EU will lead workshops in Moscow linked to the kick-off of Cultivate Russia.

A Few Words about the Russian Partners

The Russian partners are the Russian Cultural Heritage Network, Department for Archives of Khabarovsk Regional Administration, Chelyabinsk Regional Universal Library, Smolensk State Union of Museums and Centre PIC. Three of the Russian partners represent major players in the three main sectors represented within the cultural heritage domain - libraries, museums and archives. The fourth (Russian Cultural Heritage Network) is a cross-sectoral body based in the beautiful Darwin Museum in Mosocw. Centre PIC, which is also Moscow based has extensive experience in managing the prestigious EVA conference and will organise a major conference in Russia as parto f the CULTIVATE-Russia activity.

The Russian Cultural Heritage Network [3] [4] works closely with a wide range of cultural heritage institutions across Russia and brings together many regional and international partners. RCHN has substantial experience of information collection and dissemination and has built up large web-sites for the cultural heritage domains. Based in modern offices in the Darwin Museum, the Network has staff with high levels of technical and managerial expertise. Amongst the websites created by RCHN are All-Russian Museums Registration, Russian Museums and Galleries Online, Russian State Library, Russian Culture and Zoos of Russia.

The Department for Archives of Khabarovsk Regional Administration is an independent institution that is engaged in the process of archive administration under the statute of Khabarovsk region "Archive's stock, archives and the practices Act". Chelyabinsk Regional Universal Library is one of the oldest libraries in the Urals region founded in 1898. Nowadays the library is involved in many different activities and as a result it has significant human networks. The library ranks as one of the large regional libraries of Russia with about 2 million stock items and plays an important role in the cultural life of the Chelyabinsk region where it functions as the regional depository and methodology research centre for all libraries in the area. Smolensk State Union of Museums is one of the largest unions of museums in the Russian Federation and is involved in a wide range of activities including seminars and workshops on art and culture issues and has strong connections with museums and their associations and regional authorities and has been heavily involved with IT applications since 1994. Centre PIC - the Centre for Informatisation in the cultural sphere of the Ministry of Culture of the Russian Federation - was established in 1992. Its primary objective is to investigate new information technologies and to apply them in the cultural area.

Centre PIC is a state research and design body subordinate to the Ministry of Culture and works under contracts with this Ministry, with the Ministry of Science and Technological Policy, and with Russian museums and other cultural institutions. In 1995 the Ministry of Culture of Russia set up the Centre of Multimedia Technologies (as a subsidiary of Centre PIC) to co-ordinate the efforts of different bodies connected with multimedia publishing and distribution in culture and art. The Centre's services include evaluation of proposed projects and works-in-process. In addition, the Centre prepares drafts for laws in the area of rights and ownership of cultural properties. Centre PIC monitors and analyses new information and communication technologies in culture and art and presents annual reports on the topic. In 1998, 1999, 2000 and 2001Centre PIC organised EVA Moscow, a leading international event in Russia and Eastern Europe on new information technologies in the cultural area.

It is intended that more details about the project will appear shortly on the Cultivate Web site.

References

  1. Cultivate Web site
    URL: <http://www.cultivate-europe.org/> Link to external resource
  2. Cultivate-Russia Kick off Meeting, 14th – 16th January 2002, Marieke Napier, in Cultivate Interactive, issue 6, 11 February 2002
    URL: <http://www.cultivate-int.org/issue6/moscow/>
  3. Cultural Heritage Networking in Russia: Permanently Upcoming Perspectives, Dmitriy Luchkin, in Cultivate Interactive, issue 2, 16 October 2000
    URL: <http://www.cultivate-int.org/issue2/russian/>
  4. Developing Russian Museums Online, Dmitriy Luchkin, in Cultivate Interactive, issue 3, 29 January 2001
    URL: <http://www.cultivate-int.org/issue3/russian/>

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

David Fuegi

david.fuegi@mdrpartners.com Link to an email address
http://www.mdrpartners.com Link to external resource

David Fuegi manages Cultivate CEE. He is a partner in MDR Partners, a consultancy company established to engage with strategic IST developments in Europe and specialising in international work involving libraries. He is joint author of "Library Performance Indicators and Library Management Tools" (1995) and of "Study of Library Economics of Central and Eastern Europe" (1998) both published in Luxembourg by the Office for Official Publications of the European Communities.

He is currently project manager for LIBECON and is joint author of the project’s Millennium Study. Other major recent projects include business planning for the TACIS Russian State Library Project in Moscow and drafting public library standards for England for the Department of Culture, Media and Sport. He manages the IPF public library benchmarking clubs and is special adviser to the UK Committee on Public Library Statistics. Other European library projects in which he has worked include the Publica Project (DG13), ISTAR (DG5), PLDP (DG16), PULMAN, TACIS TELRUS etc. Formerly he was Library Advisor to UK government ministers responsible for Libraries and held senior positions in public libraries.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Fuegi, D. "Russia Joins the Cultivate Family", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/russia/>

-------------------------------------------------------------

Cultural Heritage in Regional Networks: The REGNET Project

By Silke Grossmann, Vic Haesaerts, Gerda Koch and Walter Koch - February 2002

Silke Grossmann, Vic Haesaerts, Gerda Koch and Walter Koch report on the Regnet Project which aims to set up a functional network of service centres in Europe, providing IT-services dedicated to Cultural Heritage organisations.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

The concept of a European Digital Library cannot only be based on a technical framework enabling access to digital goods – there is also a need to introduce new ways of cooperation between different stakeholders (Cultural Organisations, Industries, Administrations, etc.) as well as to “re-engineer” traditional business processes in the light of globalisation and world wide markets. The main activities within a support environment for I(nternet) Markets comprise of Content Engineering, Platform Engineering, and Enterprise Engineering.

The REGNET - Cultural Heritage in REGional NETworks project - targets all of above the three areas. The project was introduced under the Action Line ‘Access to digital collections of cultural and scientific content’ of the European Union IST-Information Society Technologies Programme and 23 partners, cultural organisations (museums, libraries and archives) and IT-industry representatives, from 10 European Union states as well as Bulgaria and Russia are participating. REGNET aims to set up a functional network of service centres in Europe, which provide IT-services dedicated to Cultural Heritage organisations and will be an enabler of e-Business activities for CH organizations. REGNET is a research and development project with a demonstration phase. It started April 1st 2001 and will last for two years. The whole project budget is above 5 Mio. EUR and 48,5% funded by the European Union. The Project Coordinator is AIT Applied Information Technique Research Ltd [1].

Envisaged Achievements

REGNET will set up a functional network of cultural service centres through Europe, which will provide IT-services dedicated to cultural heritage organisations. A technical and legal framework, the REGNET system, for such a service infrastructure will be developed. This will offer services like data entry, search and retrieval, and e-Business. It will be based mainly on integration work using state-of-the-art components. The network will integrate multi-media industries, content providers and service centre operators. Existing cultural infrastructures will be exploited and new infrastructures should be developed where necessary.

The technical infrastructure will allow the setting up of low cost service centres. The membership concept of the service centres will generate a critical mass of digital or physical goods contained in Content Provider's organisations.

The four players within the network are the content providers, the service centre operators, the system developers and end users. The content providers (museums, libraries, archives etc.) will provide access (via wired and wireless communication) to their digital contents, services and products and offer them to their clients (B2C). In return they can use the REGNET facilities for multimedia productions and database management, or cooperate with other REGNET partners during the creation of databases, generation of multimedia products or creation of a virtual exhibition (B2B). The service centre operators will generate income by providing the technical infrastructure (software/hardware) to content providers and other partners within the REGNET network. They offer additional IT-services and consultancies. And the system developers are selling the REGNET system to other cultural service centres and content providers. They implement additional components for the REGNET software system (additional ‘nodes’ like an ‘exhibition creator’, etc), and will generate income via licence fees for the REGNET system. For the end user the system will offer easy and wide access to cultural heritage (CH) data information and the purchase of CH related goods and services at one point, with stress on the production of personalized goods (e.g. CD-ROM) and services.

REGNET will also provide a unique facility for metadata definitions for both CH-related data and e-business related data via the search entry in the REGNET – Portal and the REGNET - Ontology subsystem. Here not only metadata related to objects is addressed, but also definitions of work flows, business processes, etc. are included which lead to the concept of Ontology within the Cultural Heritage domain. Within the project a data entry facility using Web-browsers and adaptable to different needs within different domains (Museums, Library, Archives) will be established. This facility will support existing standards (ICOM/CIDOC, UNIMARC, ISAD(G), etc) and is configurable by the REGNET System user. At the e-Business level REGNET provides the generation of a customised shopping cart systems within the business to consumer (B2C) framework.

REGNET will also examine how next generation mobile networks can be exploited to widen the potentiality of Web services in the field of Cultural Heritage. Appropriate gateway functions and interworking units will be envisaged to interface the REGNET architecture with the UMTS (Universal Mobile Telecommunication Systems) structure and ensure a seamless provisioning of REGNET services to mobile customers.

Already existing electronic catalogues (OPACS: Online Public Access Catalogues) referring to cultural and scientific objects contained in libraries, museums, archives, and galleries, as well as to goods and services will be used. Where there are no catalogues yet new catalogues will be developed in order to establish a 'virtual union' catalogue of all OPACS and product/service catalogues held locally.

In addition the work includes the outlining of necessary 'supply chains' and the connected business processes and functions to deliver digital and physical goods. ADevelopment ll the business transactions on the B2B and B2C level (containing payment features, copyright systems, authentication control, etc) will be guided by a legal framework that will be worked out accordingly. The reorganisation of existing processes and introduction of computerised functions which make transactions for low money goods worth while (e.g. small copyright fees for digital images) is an essential part of the project.

The REGNET System Architecture

The building blocks which constitute the REGNET-System are supporting access to cultural and scientific information as well as to product and service information offered by different organizations. The building blocks are outlined in the figure below. The REGNET-Portal enables entry of remote data, distributed search and e-business functions. The Cultural Heritage Data Management allows the search in distributed metadata repositories connected to data of cultural heritage content. The e-Business Data Management gives access to distributed goods and services catalogues via an e-business system. And the Ontology Checker contains specifications of metadata, terminologies etc. which are used in the e-business and cultural heritage field. Finally the Electronic Publisher allows the production of personalized digital products based on standardized metadata and work flows.

The REGNET building blocks

REGNET extends the traditional access facilities to distributed heterogeneous catalogues by integration of product and service catalogues managed by e-business systems. Because of the large REGNET consortium real world tests covering 6 European regions (a dozen states) are possible.

The project will observe and take into account the ongoing and recent work of CEN/ISSS (Electronic Commerce Workshop) but will also look into recent activities like RosettaNet, eCo-Framework, OBI, etc.

REGNET will evaluate a networked organisation model (comparable to the Art Museums Image Consortium/AMICO in the United States) and the integration possibilities of tools and systems for co-operative working.

Content Creation, Platform Management, Enterprise Engineering

Within the REGNET system there are three building blocks which can be considered as vertical functions:

1) Content creation and management is based on actual standards efforts in the field of the different organisations involved. The inclusion of so called 'Dublin Core' metadata is as well an actual topic as the Z39.50/XML-related standardisation work is, which is targeted to a harmonised search and retrieval facility across different domains. REGNET may influence the development of a so called Z39.50/XML-application profile: the Bath-Profile. This technique, to make Online Public Access Catalogues (OPACs) interoperable, is essential to REGNET, since by this way the catalogue of the REGNET shopping system is generated dynamically according to the request of a user.

2) The platform management is based on up-to-date Internet technology and is the basis for the middle ware being the agent between content and service supplier and the requester (user). It is expected that during the implementation of REGNET the first large trials of systems following the OBI reference architecture will deliver results (RosettaNet).

3) The enterprise engineering will focus on some selected business processes and functional units: access to distributed catalogues, a shopping cart system, creation of a personalised catalogue based on retrieved data from the 'virtual catalogue' (in printed and electronic form), an Internet auction system (e.g. offering duplicates of posters), and a delivery system for physical goods (e.g. goods from museum stores).

On a 'horizontal' basis the XML/XSL-technology is used to structure data semantically and physically. This affects the creation of metadata, describing real (primary) objects (artefacts, naturefacts,...), media objects (photos, videos, ...) or bibliographic type objects (literature in the broadest sense). On the other hand all information within business transactions is wrapped within XML structures: Order, Invoice, Despatch, Report, ...). The recent developments in the field of XML/EDI standardisation will be used (ebXML). Another topic will be the definition of information products by appropriate document type definitions and style sheets. This should enable the 'non-media-professional' end user to easily generate catalogues or even CD-ROMs on demand. This might be the first step into the direction to create virtual exhibitions on demand by users themselves.

User Scenarios

The scenarios outlined below can be considered as a first reference to e-Business processes only.

1. Business to Consumer (B2C):

A tourist wants to buy articles related to the cultural heritage of a region; he/she is interested in physical goods from one or more museum shops as well as in specific surrogate (images) of cultural objects located in museums, archives, or libraries in the region. He/she can use a terminal at a tourist office where he/she searches the REGNET virtual catalogue for relevant images and information, browses for articles located in museum stores and places orders.

Business to Consumer (B2C)

2. Business to Business (B2B):

A museum wants to produce a CDROM containing information about specific objects that he/she is interested in and which are related to a region.

The curator at the museum's side selects relevant information (text, images, films); he/she contacts a media producer, selects the basic layout/storyboard for the CDROM and works out together with the industrial partner - eventually with the help of other expert(s) - the final storyboard and the work flow necessary for the production process. He/she signs a contract for the production process and receives the master copy of the CDROM.

Business to Business (B2B)

3. Consumer to Consumer (C2C):

The owner of a private collection wants to sell some of his/her assets on a maximum prize. He/She thinks about putting the offers on the Internet.

He/She digitises with the help of a REGNET-Partner (e.g. Cultural Service Centre/CSC) surrogates of the goods which will be sold. Additional information and expertises are gained after consultation of the Virtual Catalogue. An dedicated Internet Auction System is set up at an Cultural Service Centres (CSC) site and offers bidding to interested user communities. The marketing of this auction is supported by the CSC. This business case can be considered as combination of B2C and B2B functions and demonstrates that a stakeholder can obtain different roles in different business-cases.

Consumer to Consumer (C2C)

Market Situation

Electronic commerce, notably business to business e-commerce, is booming and world-wide e-commerce sales are expected to grow 40 times between 1998 and 2003 by which time they will become over 15% of all sales. The Internet and e-commerce are also leading to a surge in new company creation. The market for REGNET systems and services can to some extent be reckoned by the number of museums, libraries and archives in Europe. A rough estimation would deliver a potential number of 3.000 (15% of 20.000 museums) museum shops throughout Europe being candidates to be partner of the REGNET network!

Content Creation and Content Development – Success Factors for the Internet

“Content” is the basis for the planned REGNET portal and key success factor of the whole project. In order to make the system able to adapt to and compete on the market and to provide high quality digital assets and services the analysis of the available content at partner sites is - with regard to an effective content creation and content management - one of the main tasks in the project. The important decision to take consists in defining what objects or surrogates might attract potential users or could be a basis for follow up added value processing. The nature of data will not only be of cultural & scientific type; goods available in museum shops have to be documented too to provide Internet access to the museum shop. It might be useful not only store “primary” goods immediately for use of further processing, but also “eye catcher” suited to invoke a on demand process eventually doing a painting according to a theme offered in REGNET (“painting on demand”).

Analysis of the State-of-the-Art

The analysis of collections and items located in libraries, museums, archives, research centres, etc. in a participating region was done by a multi-step approach (Figure 1).

Figure 1: Objectives of the audit concept of WP 1.1
Figure 1: Objectives of the audit concept of WP 1.1

The chosen methodology sought to describe not only the current practises (the status quo) at the side of the content providers in different domains but also to provide a guide on international best practise regarding the usage of the Internet for presentation of collections (search and browse), communication and interaction with potential users, e-Commerce-transactions etc. By examining and documenting the State-of-the-Art in the cultural sector an orientation on well established trends and methods should be assured and features be identified to be applied for the REGNET system. The overall goal was to derive and to define functional requirements for the REGNET system.

As part of the envisaged system the theme-based approach was developed - the idea and the aim behind was to deliver further added value in areas such as education, electronic publishing and virtual exhibitions. The themes should deliver supporting material in selling pieces of art or surrogates, offer new possibilities in the area of cultural tourism and realize innovative access methods to search for art objects (paints, sculptures, installations, net art, design, ceramics, jewellery etc.), museum objects, physical goods in the museum shops, photography and library collections (e. g. books, manuscripts and maps). The theme-based approach will be described in chapter xy ).

Development of Digitisation Plans

All participating content providers could provide normally more than one collection – the challenge was to select assets which fit to the envisaged clientele. Due to the fact that almost all content providers directly involved started digitisation activities but not finished them it was necessary to try to achieve some critical mass also by complementing the different offerings.

Once it is clarified what material is going into the REGNET repositories concrete guidelines based on best practice experiences have to be developed. The recommendations and guidelines to be worked out should be based on two basic principles: efficiency and cost effectiveness. Because collections differ widely in their types of material, audience, and institutional purpose, specific practices may vary from institution to institution as well as for different collections within a single institution. Therefore, the sets of recommendations must be broad enough to apply to most cases, and try to synthesize the differing recommendations previously made for specific target collections/audiences.

The "general" digitisation plan should serve as an guideline for all digitisation activities in the REGNET project. The general policy of digitisation in the project and selections with regard to content and technological issues are described. The digitisation plan lists all tasks (which can be tackled in sequence or in parallel) valid for digitisation projects in general. The "modules" of the general digitisation plan should allow to convert any individual collection according to the described procedures by considering the large number of variables, driven by the priorities, the institution, user needs, technical issues and available funding. It should help to consider all things needed to build up a digital collection: from the collection itself, the data that describes the objects and the user groups. Such a systematic planning should provide the basis for high quality content!

The Thematic Approach

The concept

The thematic approach of cultural heritage information within REGNET is based on a cultural heritage data triangle. This triangle consists of “themes”, “thematic texts” and “collection object images & descriptions”(see figure REGNET – Cultural heritage data triangle).

Whereas collection objects images and descriptions pertain in most cases strictly to the objects themselves (see the equivalence with the standard collection management systems fundamentals), thematic texts are delivering supplementary and contextual information related to an object or to a set of objects. Moreover, these thematic texts are set up in a way that they can be reused for different contextual views and situations.

In contrast with collection objects and thematic texts, which contain real basic descriptive information about something, themes reflect the scope, resource locations and relationships of all this descriptive information. By their nature, themes are part of the Ontology function of the REGNET system. On a more practical side, a theme can be considered as a collection of thematic texts and/or objects that have one or more characteristics (“keywords”) in common. One can easily understand that with this approach the theme concept is quite flexible and that a single thematic text can show up in different contexts. Themes can either refer to thematic texts only, or to collection objects only or, in most cases, to both, thematic texts and collection objects. The term “fragment” reflects best the “building block”-functionality, vis-à-vis the themes, of those thematic texts and collection objects.

Applying this theme-fragment approach on a search into a collection databases, a user can gain access in an indirect way to the objects via themes and thematic texts in addition to the standard direct search through the same objects. The opposite, accessing information contained in themes and thematic texts through the objects’ databases, is also possible.

REGNET - Cultural heritage data triangle
REGNET - Cultural heritage data triangle

A theme, in the concept of REGNET, should not be defined as a predefined hierarchy of a set of thematic texts and collection objects with fixed relationships. The constituting parts of themes should be build up of data containing a set of keywords that reflect the main content, the main idea of it. Those parts can be (automatically or manually) grouped together into a theme by the keywords that accompany them. One can easily imagine that a text that has been written as part of a range of texts about measuring time can also show up as part of a theme about famous scientists. A contribution about a certain saint can show up in a theme about things or persons of which people believe that they will give protection.

However, in order to achieve quickly a coherent and consistent entity of related information, the production of texts and the digitising of objects has to be done following a certain top down approach (« » the keyword bottom up approach described higher). Hereby each content provider has to choose a central theme with a set of related themes, within which she/he can develop a substantial number of relevant thematic texts and related objects.

Mapping of Themes to the Main REGNET-functions

The data entry part of themes has many facets and varies from scanning and OCR, over on line Z39.50 connections, to specific fragment data entry and topic map generation. The use of themes in outputting something interferes with a lot of functions incorporated in the REGNET system as can be seen in the figure “Theme mapping to functionalities”. Themes, together with the thematic texts and the objects, can as well deliver constituting parts for Search & Retrieval and Publishing as offer supporting material for the e-Business part.

REGNET - Theme mapping to functionalities
REGNET - Theme mapping to functionalities

References

  1. AIT Applied Information Technique Research Ltd
    URL: <http://www.regnet.org/> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Silke Grossmannn

Silke Grossmannn works for IMAC, Constance as a Project Manager / Consultant. She is the Project manager, responsible for the field of electronic markets, a Consultant in the field of information and knowledge management and offers Training: Themes: Information delivery in online services, Internet-Information systems, Intranets and in the use of new media. She is participating in the following projects: building of an online community in Switzerland (Arte 24), Consultation of museums on the application of new media, development of a software system for museums (Museums Access), creating a variety of projects in the area of Online-Services (information needs analyses, conception of information services and systems, evaluation of information services and systems). Realization of an empirical study in the field of regional and specialized electronic markets, conception and realization of different types of electronic markets (needs analysis, definition of a portfolio for an offer, creation of a business plan, project management in the realization and application of a marketing strategy). Participation in EU-Projects: NBE-BIS, ETTN, DeMes.

Prior to joining IMAC Silke worked at the University of Constance as a Lecturer in Information mediation and as a Scientific assistant to Prof. Rainer Kuhlen, Creating and updating Web sites about the online and information market. Before this she was a Librarian working in Information structure and classification, formal and technical cataloging

Vic Haesaerts

Vic Haesaerts has a degree in Electronic Engineering. He has considerable experience in the management of innovative projects in the telecommunications and banking sector. His research activities have mainly been focused on artificial intelligence techniques and electronic distribution channels. At the moment he is the managing director of Tarx and an expert-evaluator for European projects.

Walter Koch
kochw@ait.co.at Link to an email address

Professor Walter Koch received a PhD in Mathematics and Physics from University of Graz in 1970. He is shareholder and director for AIT Research Ltd. (Angewandte Informationstechnik Forschungsgesellschaft mbH.), and is an Associate Professor at the University of Graz and a Guest Professor at the University of Krems. Prof. Koch's relevant experience includes: heading different research institutes at JOANNEUM RESEARCH Ltd, working as a Consultant to different national and international organizations (eg UNESCO, DFG, EC, Austrian ministries, Graz municipality). He is a member of several international and international scientific associations (eg. ICOM, ÖGDI, ONORM, VÖB, UNESCO). Walter has a vast amount of project experience in bibliographic information, information systems, IT-management, EU Projects (TAP, Raphael, TenTelecom; IST), ESA, CIMI. He is the overall Project Co-ordinator and managert of the EU-project REGNET, Cultural Heritage in Regional Networks. He is also a personal consultant to the EU-Project DIGICULT. Walter has published more than 70 papers and technical reports, and presented papers in four continents at least at 100 conferences, seminars and workshops.

Gerda Koch
kochg@ait.co.at Link to an email address

Gerda Koch received a Master of Arts in English with a combination of studies in Economics and Law at the University of Graz, and also received an academic degree in a course of studies on Multimedia at the University of Graz. Gerda has been employed at AIT, Applied Information Technique Ltd since 1991. There she has worked on diverse informatics projects, for example a production automation system for a window-producing company, the database publishing production of the yearly published Austrian Statistics on Non-University Education. She also collaborated in the European Union TenTelecom project MOSAIC (Museums Over States And vIrtual Culture) for the Austrian partner of the project.

Since 1998 Gerda Koch has been managing director of AIT Ltd. She now focuses on cultural digital content provision whilst working on the national project MODOK (Modular Documentation System) for the Austrian Ministry of Science and Traffic and the EU-project COVAX (Contemporary Culture Virtual Archive in XML). Within the EU-project REGNET, Cultural Heritage in Regional Networks Gerda Koch is also responsible for co-ordination and project management.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Grossman, S., Haesaerts, V., Koch, G., and Koch, W. "Cultural Heritage in Regional Networks: The REGNET Project", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/regnet/>

-------------------------------------------------------------

Strategic Development of European Networks in Biomechanics

By Gordon Clapworthy - February 2002

International projects, such as the investigation of the Human Genome, have demonstrated what can be achieved when an international academic community organises itself in a concerted set of activities. Gordon Clapworthy introduces BioNet a project which has taken its inspiration from such far-sighted projects and seeks to emulate them within the relatively constrained area of biomechanics.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Introduction

The highly successful Decade of the Brain, 1990-2000, produced both significant scientific progress and a raised awareness of the impact of brain disorder. The achievements of this initiative led to the declaration of the Decade of the Bone and Joint, 2000-2010. This aims "to improve the health-related quality of life for people with musculo-skeletal disorders by working in partnership with all stakeholders to raise awareness, identify needs, empower patients, promote cost-effective prevention and treatment and advance understanding through research to improve prevention and treatment." It has been endorsed by figures of international stature, including Kofi Annan and Pope John Paul II, and is supported by organisations such as the United Nations, the World Health Organisation and the International Red Cross.

BioNet [1], IST-2000-28074, has taken its inspiration from such far-sighted projects and seeks to emulate them within the relatively constrained area of biomechanics. Biomechanics is a diverse and multidisciplinary field, which makes it difficult to create partnerships containing the range and blend of skills necessary to tackle the increasingly complex problems that are arising. It also makes it difficult to create a coherent strategy for future development, that is, to gain widespread agreement on what the fundamental problems of the moment are, and to ensure that they are addressed in a coherent and rigorous way.

Musculo-skeletal conditions are the most common cause of severe long-term pain and physical disability; they affect millions of people across the world. For example, joint diseases currently account for over half of all chronic conditions in persons over 65 years old.

Further, these conditions do not have to be chronic or disabling in order to have a sizeable impact on the lifestyle of the sufferer. Most people will have, at some time, pulled a back muscle or twisted a knee and will thus be able to testify to the discomfort and disruption to normal movement that even such minor complaints can cause. It then becomes easy to comprehend the substantial impact that a more severe, but not necessarily chronic, condition can have on an individual's life.

In addition to the individual cost, the demand for resources to treat and alleviate musculo-skeletal disorders continues to put an increasing strain on national health systems. This strain will be exacerbated by the general demographic trend towards an older population. Not only will a higher proportion of the population suffer from such conditions, but as the average lifespan continues to increase, treatment will be required for longer periods.

Although the medical aspects of biomechanics have the highest profile, it is important in many other areas - sports science is an obvious one - and there are many uses in industry, for example in car design, both in normal use and when involved in a crash. In fact, biomechanics is relevant to any activity in which the body is in motion or in which the body is subjected to physical stress and strain. And while the main focus is always on the human, biomechanics is equally relevant to studies involving animals.

Project Concept

BioNet is funded under the Working Groups Programme of IST. It began in September 2001 and will run until the end of November 2002. The main concept of the project is that biomechanics should confront its multidisciplinary nature and undertake to involve all of its participants in establishing a consensus regarding its major future directions. BioNet will provide a forum within which future priorities can be established, and the means by which responses can be harmonised, so that these priorities are addressed with greater directness and economy.

BioNet will create, within the European biomechanics community, one (or, more likely, several inter-related) Working Groups to allow the development of concerted approaches to issues of high contemporary relevance. Until now, collaboration, particularly across disciplines, has generally been the result of ad hoc meetings and chance acquaintanceships. BioNet will improve communication between the many disciplines involved in biomechanics and thus activate synergies that may, at present, be dormant.

The project also seeks to unite all aspects of the biomechanics community - industrial, educational, clinical, research, charity - into a coherent structure, and will provide an on-going resource to support and co-ordinate collaborative work in biomechanics. Once initiated, much of the future activity will be sustained by the international or European societies that are active in the area.

Project Structure

The central activity of the project is a one-off conference in April 2002, though as its nature is radically different from standard international conferences, we prefer to describe it as the Event. The Event will provide a forum for the discussion of contemporary issues, with a view to gaining widespread agreement on the areas in which concerted activity is likely to produce the greatest benefits.

The Event divides the project into three distinct phases. We are currently in the pre-Event phase, the programme for the Event is now complete, and we are launching a series of discussions to define the topics mostly suited for short and middle term activity. We are greatly assisted in this scene-setting by the existence of the BIOMCH-L mailing list, which is the major medium for communication within the biomechanics community worldwide.

The Event is described more fully below. Post-Event support will be provided to the emerging Working Groups. BIOMCH-L has promised to host mailing lists on specific topics so that the agendas of the different groups can be developed. The whole of this activity will be carefully monitored to ensure that all necessary decisions are taken before the end of the project in November 2002.

The project is very timely because another EC-funded project, VAKHUM, IST-1999-10954, will be completed by June 2002. This project is producing, for the first time, a comprehensive set of morphological and kinematic data, gathered according to published scientific protocols and will provide new opportunities for future work in research, education and industry.

The Event

The Event, "Biomechanics in the Decade of the Bone & Joint", will take place in Brussels on 27-29 April 2002. It has an inclusive policy, so registration fees are being kept to a minimum to encourage the widest participation. Fees for students are being kept particularly low to promote the involvement of the next generation of researchers, as are the fees for delegates from Central & Eastern Europe, in line with the initiatives of the European Society of Biomechanics to encourage the rapid development of biomechanics in these countries. The emphasis of the Event is on computational biomechanics.

The programme will start with keynote addresses on the historical development of biomechanics and a survey of some of the current "hot topics" by internationally renowned speakers, including some of the "founding fathers" of biomechanics. This will be followed by a poster session in which all current work in biomechanics will be displayed, arranged by subject, to provide a unique "snapshot" of European biomechanics.

The highlight of the first afternoon is a "Get It Off Your Chest" session in which participants will have the opportunity to make a controversial statement, with a maximum duration of 5 minutes, on a topic in biomechanics on which they feel strongly. There will be no discussion at this point (for reasons that are probably obvious) - an opportunity to reflect on these presentations is provided on the second day. There will also be a chance to hold discussions with like-minded professionals in Birds of a Feather meetings.

The second day begins with two pairs of parallel sessions presented by international speakers specifically selected for their expertise in the areas being considered. There will be an opportunity for a partner search before a panel session in which the previous presentations will be considered and the foundations laid for establishing the specific working groups. Concluding remarks will be made by distinguished speakers from both sides of the Atlantic.

We have been fortunate that so many highly respected figures in biomechanics have agreed to make presentations - it promises to be a landmark event, and one that will provide European biomechanics with a timely impetus in its future development.

References

  1. BioNet
    URL: <http://www.mk.dmu.ac.uk/bionet/> Link to external resource

Partners

Prof. G J Clapworthy
De Montfort University

Dr. Serge Van Sint Jan
Université Libre de Bruxelles, Belgium

Dr. Marco Viceconti
Istituti Ortopedici Rizzoli, Italy

Dr. Diane Dan
ESI Software, France

Author Details

Prof. G J Clapworthy
Dept of Computer & Information Sciences,
De Montfort University,
Milton Keynes MK7 6HP,
UK

gc@dmu.ac.uk Link to an email address

Gordon Clapworthy is Professor of Computer Graphics at De Montfort University. His main research interests are medical visualisation, biomechanics, computer graphics, figure animation, robot teleoperation, virtual reality and fundamental graphics algorithms.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Clapworthy, G J. "Bionet: Strategic Development of European Networks in Biomechanics", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/bionet/>

-------------------------------------------------------------

Going Beyond Traditional Digital Libraries for Cultural Heritage: The COLLATE Collaboratory

By Adelheit Stein, Ulrich Thiel and Jürgen Keiper - February 2002

Project COLLATE develops a new type of a WWW-based collaboratory for cultural heritage information. The implemented system provides access to a newly constructed digital library on rare sources of historical film documents. Professional domain experts use the COLLATE system to analyze, evaluate and collaborate on the interpretation and indexing/annotation of the digital repository documents. The hereby provided metadata are managed by an advanced XML-based content manager and an intelligent content and context based retrieval system.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Introduction

Collate logoIn September 2000 an international team of cultural content providers, film domain experts and technology developers - together with a designated evaluation partner - started out to develop and put into practice a new type of collaboratory in the domain of cultural heritage. The EU-funded project “COLLATE – Collaboratory for Annotation, Indexing and Retrieval of Digitized Historical Archive Material” (IST-1999-20882) is part of the EU DIGICULT programme, and runs for three years: see the project Web site [1].

During the last few years the idea of collaboratories emerged in the Natural Sciences area and was applied by several disciplines and research groups. As defined by Kouzes et al. [2] a “collaboratory” (merger of the terms collaboration and laboratory) is a virtual centre on the Web, where professionals and lay persons are provided with the means for interacting with colleagues, accessing instrumentation, sharing data and computational resources, and accessing information stored in digital libraries and archives.

Various collaboratories have been employed since the early 90s, mainly in Natural Sciences, but so far we have found – aside from some systems with very limited functionality – only a few similar efforts in Arts and Humanities. Whereas the organisation and preservation of historical knowledge in Arts and Humanities are still comparable, some of the work processes in the more interpreting sciences are different and need to be supported by appropriate system functionalities. Large collections of important historic and cultural sources are scattered in national archives with no electronic versions available, so that immediate access to and work with this material are severely impeded. On the other hand, there exist many – but mostly informal and non-institutional – contacts between cultural archives constituting specific professional communities. However, effective and efficient technological support for collaborative knowledge working is still missing. Technologically, the World Wide Web can serve both as a standard communication platform for such communities and as a gateway for document-centered digital library applications.

The COLLATE system not only provides the functions of a traditional digital library, but on top of that it employs a generic approach to collaborative knowledge working with cultural sources, i.e. supporting users in their analysis, interpretation, evaluation of the sources and the creation of new knowledge as a result of this work with the material. Therefore, the system and its components are adaptable to heterogeneous cultural domains. As an example application COLLATE focuses on the film domain in general, and in particular on specific questions of historic film documentation like comparative film censorship analysis in several countries. Three major European film archives are providing several thousand digitized multi-format documents on European early 20th century films for COLLATE’s data repository. But in principle the tools and user interfaces developed can easily be adapted to other content domains, types of applications and users.

The COLLATE Approach

Designed as a content and context based knowledge working environment for distributed user groups, the COLLATE system supports both individual work and collaboration of domain experts who are analyzing, evaluating, indexing and annotating the material in the multimedia data repository. It continuously integrates the user knowledge thus derived into its metadata repositories, and on this basis can offer improved content-based retrieval functionalities within the information system [3]. Users can therefore both access and create valuable knowledge about the cultural, political and social contexts, which in turn allows other end-users to better retrieve and interpret the material.

Technology development on the one hand and extended empirical investigation of the acceptance of a collaboratory in the cultural domain on the other are the two backbones of the COLLATE project:

Results from both areas of project work strongly influence each other to enable an iterative, dynamic system development. Evaluation steps are explicitly built in, and the users themselves are actively involved throughout the various development cycles.

A first prototype of the COLLATE system was implemented at the end of the first project year, allowing formal cataloguing, content-based indexing and annotation of digitized text documents and document passages [4]. Subsequent system versions will incorporate advanced document preprocessing modules for automatic analysis and indexing of multimedia data, and especially more extended support of the collaboration between users based on an explicit collaborative task model.

COLLATE focuses on the film domain and has built up a large digital repository of rare historic film censorship documents from the 20s and 30s, most of them as yet unavailable in electronic form, but scattered in various archives; and a proportion is not yet even analyzed and catalogued. These documents are highly relevant for film historians analyzing censorship history, as well as for any social scientists. For a subset of significant films the data repository offers in addition enriched documentation, including digitized newspaper articles, photos, stills, posters and film fragments. In-depth analyses and comparison of such documents provide, for example, evidence about different film versions and cuts, which can be used for the reconstruction of lost or damaged films or for the identification of actors and film fragments of unknown origin.

As a prerequisite for collaborative work all material is analyzed, indexed, partly translated, annotated and interlinked by film experts. The COLLATE system provides them with appropriate task-based interfaces for in-depth indexing/annotation and other tasks as well as with supporting knowledge management tools (indexing aids and special keyword lists). End-users may also take an active part in evaluating sources and adding valuable information through annotations. In this way, a growing body of metadata is emerging over time. The system exploits this data by employing advanced XML-based content management and advanced retrieval methods [5]. The final version of the online collaboratory will integrate cutting-edge document processing and management facilities, e.g., XML-based document handling, digital watermarking and semi-automatic segmentation, categorization and indexing of digitized text documents and pictorial material (photos, posters, film fragments).

By combining the results from the manual and automatic indexing procedures, elaborate content-based retrieval mechanisms can be applied. This helps users find what they are actually looking for, to combine evidence from various sources and to interrelate so far unrelated sources and knowledge. Thus, not only the size and richness but also the quality, affordability and acceptability of the information repository are constantly being improved.

The COLLATE System

The system features innovative models and techniques in the following areas:

Offering content and context based access to the data repository is a crucial feature of COLLATE. To implement an information system with advanced retrieval functions, we must go beyond current practices of merely providing digital reproductions of and simple online access to historic sources. Instead, results from current and previous scholarly work such as evaluating and indexing these sources must be incorporated into the information system, e.g., in the form of metadata and annotations.

COLLATE users are directly and indirectly involved in system development because they participate in enriching the document repository through new sources and successive annotations and indexation. In particular, the focus on annotation as a typical task in the domain of the humanities is central to the COLLATE concept. Annotation as a multifunctional means of in-depth analysis can be done individually but also collaboratively, for example in the form of annotation of annotations, collaborative evaluation and comparison of documents. As a result, a large amount of value-added information is being provided in addition to the digitized documents.

Support of collaborative work goes beyond contemporary groupware products, offering innovative functions such as:

The dynamic accumulation of value-added information through annotations requires the data structures to be scaleable and extensible. In order to capture these dynamics we chose XML as a de facto standard for the encoding of generic document and metadata representation schemata. Through the use of XML we are able to guarantee the generality of our approach, since these schemata can be enriched and tailored to additional sources and knowledge incorporated into our system without any need for re-modeling the whole system. In addition, XML is the basis for the integration of knowledge processing methodology and retrieval functionality in the system. Therefore, the system is capable of capturing the dynamics of collaboration without neglecting the necessary flexibility of scaleable and extensible representation schemata, which can be transferred to other content domains as well.

The COLLATE collaboratory is a multi-functional software package integrating a large variety of functionalities, which are realized by inter-related software modules. It comprises several databases and different document representation schemata. XML is used as the uniform internal representation language for the documents in the repository and the associated metadata as well as for the implementation of the communication protocol among its system modules.

Figure 1: The COLLATE system is structured into several functional layers
Figure 1: The COLLATE system is structured into several functional layers

Three document pre-processing modules are being developed and will be incorporated into the final COLLATE system version:

As shown in Figure 1 the COLLATE system is structured into several functional layers:

Operational Layer – The distributed digital data repository comprises a variety of data, ranging from scanned-in text documents to multimedia data and the accumulated annotations related to one or more of these original data sources.

Domain Metadata Layer – In order to organize the stored data in a way that supports the complex knowledge-intensive tasks users perform suitable tools for metadata management are being provided. The knowledge structures, which are represented by specific XML schemata, constitute the Domain Model. They comply with the TEI (Text Encoding Initiative) and CES (Corpus Encoding Standard) metadata standards, but we needed to extend these in order to cope with the rich structure of our domain.

Collaborative Task Layer – The COLLATE system allows a wide variety of user types to access, work with and evaluate the digitized material. A generic task model has been developed for complex working tasks like source edition, identification of lost or cut film scenes, preparation of a virtual exhibition, etc. As some of these tasks can be performed collaboratively, e.g., collaborative inspection and interpretation of source material, a collaboration model is being developed which builds the basis for offering context-dependent interface functions for collaboration between users.

Interface Layer – In order to support the users in accomplishing their tasks COLLATE provides appropriate interfaces for convenient work with the digital documents. In future system versions, these interfaces can be semi-automatically derived from the underlying task model. Certain specialized interface components for annotation, mark-up, editing, search and retrieval are used to facilitate user interaction. The specification of the interface structure also utilizes XML to allow for generic mapping to concrete instantiations (e.g., Java Swing).

As indicated above, communication between these layers is realized through XML-based communication protocols. The implementation employs XML transformations (XSLT) as a basis for the communication infrastructure in the COLLATE system.

Conclusion

The COLLATE system represents a new type of collaboratory supporting work with cultural document sources by innovative task-based interfaces for in-depth content indexing and annotation. In this way, the access and retrieval of content from traditionally scattered and electronically unavailable sources is significantly improved. Furthermore, advanced features of the system - such as digital watermarking, semi-automatic document segmentation and picture analysis - increase the usability of the system.

References

  1. Project Web Site
    URL: <http://www.collate.de> Link to external resource
  2. Collaboratories: Doing Science On The Internet, Richard T. Kouzes, James D. Myers, William A. Wulf: IEEE Computer, Volume 29, Number 8, August 1996.
    URL: < http://www.emsl.pnl.gov:2080/docs/collab/presentations/papers/IEEECollaboratories.html> Link to external resource
  3. Brocks, Holger; Thiel, Ulrich; Stein, Adelheit & Dirsch-Weigand, Andrea (2001) Customizable Retrieval Functions Based on User Tasks in the Cultural Heritage Domain, in Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries (ECDL '01), September 4-9, 2001, Darmstadt, Germany, Berlin: Springer, 2001, 37-48.
  4. Keiper, Jürgen; Brocks, Holger; Dirsch-Weigand, Andrea; Stein, Adelheit & Thiel, Ulrich (2001) COLLATE – A Web-Based Collaboratory for Content-Based Access to and Work with Digitized Cultural Material, in Proceedings of the International Cultural Heritage Informatics Meeting (ICHIM '01, ) ed. Bearman, D. & Garzotti, F., Milano: Politecnico di Milano, 2001, 495-511.
  5. Stein, Adelheit, Gulla, Jon Atle, Müller, Adrian & Thiel, Ulrich (1998) Abductive dialogue planning for concept-based multimedia information retrieval, in Integrated Publication and Information Systems. 10 Years of Research and Development, ed. Fankhauser, P. and Ockenfeld, M., Sankt Augustin: GMD – Forschungszentrum Informationstechnik, 1998, 129-148.

Author Details

Dr Adelheit SteinDr. Adelheit Stein
Project manager
Fraunhofer IPSI (Institute for Integrated Publication and Information Systems)
Dolivostrasse 15
D-64293 Darmstadt
Germany

Phone: +49 6151 869-841

stein@ipsi.fraunhofer.de Link to an email address
<http://www.ipsi.fhg.de/~stein> Link to external resource

Dr. Adelheit Stein is the head coordinator of the COLLATE project. She has been a senior researcher at Fraunhofer-IPSI (formerly GMD-IPSI) since several years. Her background is in Sociology and Philosophy, with a special focus on cognition and social interaction. At IPSI she was involved in several European IT projects and university teaching. Her current research interests include: human-computer interaction, collaboration support, dialogue planning, intelligent user interfaces.

Dr. Ulrich ThielDr. Ulrich Thiel
Project manager
Fraunhofer IPSI (Institute for Integrated Publication and Information Systems)
Dolivostrasse 15
D-64293 Darmstadt
Germany

Phone: +49 6151 869-855

thiel@ipsi.fraunhofer.de Link to an email address
<http://www.ipsi.fhg.de/~thiel> Link to external resource

Dr. Ulrich Thiel is responsible for the technical coordination of the COLLATE system. He holds both a diploma in Computer Science and a PhD in Information Science. Since several years he has been a senior researcher at Fraunhofer-IPSI (formerly GMD-IPSI). He was manager of many EU-funded projects. His research interests include: intelligent information retrieval, dialogue planning, conversational and adaptive information systems.

Juergen KeiperJürgen Keiper
Project manager
Deutsches Filminstitut – DIF
Schaumainkai 41 60596
Frankfurt am Main
Germany

Phone: +49 69 96 12 20 0

keiper@deutsches-filminstitut.de Link to an email address
<http://www.deutsches-filminstitut.de> Link to external resource

Jürgen Keiper is the scientific coordinator of the content-providers and film archives of the COLLATE project. He holds a masters degree from the University of Frankfurt, and his special background is in Theater, Media and Film Sciences. Since several years he has worked for the DIF (German Film Institute) in various research projects. His research interests include: film theory and criticism and social history of film. He is editor of the film journal Film und Kritik.

Bibliography

  1. Altamura, Oronzo; Esposito, Floriana & Malerba, Donato (2001) Learning to Correct the Layout.Extracted from Document Images, in Proceedings of the Workshop on Artificial Intelligence, Vision and Pattern Recognition in the 7th Congress of the Italian Association for Artificial Intelligence (AI*IA '01), ed. A. Chella, D. Malerba, Bari: 24 September 2001, 63-73.
  2. Brocks, Holger; Dirsch-Weigand, Andrea; Keiper, Jürgen; Stein, Adelheit & Thiel, Ulrich (2001) COLLATE – Historische Filmforschung in einem verteilten Annotationssystem im WWW, in Information Research & Content Management - Orientierung, Ordnung und Organisation im Wissensmarkt. Proceedings der 23. DGI-Online-Tagung 2001, ed. R. Schmidt, Frankfurt am Main: DGI, 2001, 183-196.
  3. Brocks, Holger; Thiel, Ulrich; Stein, Adelheit & Dirsch-Weigand, Andrea (2001) Customizable Retrieval Functions Based on User Tasks in the Cultural Heritage Domain, in Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries (ECDL '01), September 4-9, 2001, Darmstadt, Germany, Berlin: Springer, 2001, 37-48.
  4. Caneppele, Paolo (2001) Beschnittene Schaulust. Entstehung und Entwicklung der Filmzensur in Österreich. Ein Abriß. (1900-1938). Medien und Zeit, 16 (2), 2001, 22-34.
  5. Ferilli, Stefano (2001) Management of Cultural Heritage Material: The COLLATE project, in Proceedings of the Workshop on Artificial Intelligence for Cultural Heritage and Digital Libraries in the 7th Congress of the Italian Association for Artificial Intelligence (AI*IA '01), ed. L. Bordoni, G. Semeraro, Bari: 2001, 29-33.
  6. Ferilli, Stefano; Fanizzi, Nicola & Semeraro, Giovanni (2001). Learning Logic Models for Automated Text Categorization, in Advances in Artificial Intelligence AI*IA, ed. F. Esposito, Lecture Notes in Artificial Intelligence 2175, Berlin: Springer, 2001, 81-86.
  7. Keiper, Jürgen; Brocks, Holger; Dirsch-Weigand, Andrea; Stein, Adelheit & Thiel, Ulrich (2001) COLLATE – A Web-Based Collaboratory for Content-Based Access to and Work with Digitized Cultural Material, in Proceedings of the International Cultural Heritage Informatics Meeting (ICHIM '01, ) ed. Bearman, D. & Garzotti, F., Milano: Politecnico di Milano, 2001, 495-511.
  8. Malerba, Donato; Esposito, Floriana; Lisi, Francesca A. & Altamura, Oronzo (2001) Automated Discovery of Dependencies Between Logical Components in Document Image Understanding, in Proceedings of the Sixth International Conference on Document Analysis and Recognition (10-13 September 2001), Seattle: 2001, 174-178.
  9. Malerba, Donato; Esposito, Floriana & Altamura, Oronzo (2001) Learning Rules for Layout Analysis Correction. Paper presented at Workshop on Document Layout Interpretation and its Applications (DLIA '01), Seattle: 9 September 2001.
  10. Semeraro, Giovanni; Ferilli, Stefano; Fanizzi, Nicola & Floriana Esposito (2001) Document Classification and Interpretation through the Inference of Logic-Based Models, in Proceedings of the 5th European Conference on Research and Advanced Technology for Digital Libraries (ECDL '01), September 4-9, 2001, Darmstadt, Germany, Berlin: Springer, 2001, 59-70.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Stein A, Thiel U and Keiper, J. "Going Beyond Traditional Digital Libraries for Cultural Heritage: The COLLATE Collaboratory", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/collate/>

-------------------------------------------------------------

Other Areas

Disclosing Digital Cultural Wealth: Museums and the Open Archives Initiative

By John Perkins - February 2002

John Perkins, Executive Director of the CIMI Consortium, discusses the potential the Open Archives Initiative may hold for museums and talks about how CIMI have been testing the OAI protocol.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Introduction

Museums [1] have immensely rich information resources, but, access to much of the kinds of materials museums hold, is often not available through web search engines because it is in databases, dynamically generated, or in some other non-HTML form. These resources constitute what is becoming known as the Hidden Web estimated to contain 400-550 times more content than the commonly defined Web. [2]

If this problem alone were solved and all the Hidden Web resources were suddenly available for indexing the difficulty of finding reliable, useful, precise information would be seriously compounded not alleviated. One way to address this is through collecting and indexing metadata records, rather than indexing the entire contents of HTML pages, thereby providing greater possibilities for precision. This is essentially the traditional library approach of creating descriptive metadata and building union catalogs. However library-like catalogs are expensive to maintain and in the Web world difficult to find and hard to search across.

A particularly promising solution is to explore the utility of combining the best of traditional library and museum techniques such as creating descriptive metadata records in catalogs, with the best of new Internet techniques like large scale, machine harvesting of information. It is possible to consider this because of new developments in Web workable technical protocols, the uptake of XML as a way to package and transfer information, and the development of international standards for describing museum metadata content.

The Open Archives Initiative

The Open Archives Initiative (OAI) develops and promotes technical protocols and standards, collectively called the OAI technical framework, to facilitate the access to research information on the Web. It is based on the premise that a simple, easily implemented technical framework can allow holders of information to create repositories of metadata describing their resources that in turn can be harvested and made available for further processing or use. [3]

A vision of how the OAI protocol might be implemented is offered by the USA's Digital Library Federation. They describe a framework in which:

A data provider agrees to support a simple harvesting protocol and to provide extracts of item-level metadata in a common, minimal-level format in response to harvest requests from trusted service providers. …. A service provider …uses the harvest protocol to collect metadata … possibly after reaching some kind of formal agreement on terms and conditions of use. The service provider is then able to build intellectually useful services, such as catalogs and portals to materials distributed across multiple sites. The framework applies to a wide range of information resources of academic and scholarly interest including printed and electronic texts, science and social science data sets, visual materials, archival collections, geographic information system (GIS) data, sound and music, video, and any other type of resource for which metadata is typically created. [4]

A technical overview of the OAI Protocol and its relevance to museums has been previously described by CIMI [5] and presented at workshops [6].

CIMI's test of OAI

Because of the perceived potential of OAI for museums CIMI participated as a pre-release tester of the OAI protocol. [7] As part of the test we built a generic OAI-compliant repository [8]. The repository architecture is described in detail in [5]

The initial evaluation demonstrated that the OAI protocol is indeed simple to build. CIMI has limited technical resources and skills but was nonetheless able to successfully build an OAI repository that appears to be useful. Based on the positive experience as an alpha implementer, CIMI plans to continue explorations of the OAI protocol and research its use by museums.

One way is by making the code for the CIMI repository and its associated explanatory materials available for downloading from the CIMI Website. [8] We hope museums will take advantage of its availability to install, experiment and use the protocol.

CIMI is also interested in conducting a more formal, large-scale test of the OAI for museums as a CIMI testbed to begin testing the OAI in real-world applications. As part of this work we plan using OAI V.1.x in combination with scoped extensions and other applications necessary for aggregation processes (e.g. editorial control, content management and enhancement, registry) to harvest and collect museum metadata. The purpose of the research is to explore how a specific community of users can use the OAI protocol and to begin the process of assessing the effectiveness and efficiency of OAI.

OAI Implementations

In addition to the CIMI-proposed research, which is still in the planning stages, individual CIMI members are proceeding with investigations of their own.

Australian Museums Online (AMOL) are OAI-enabling their regional servers beginning with the Tasmanian Museum and Art Gallery in early 2002. The regional server network is what allows smaller institutions to supply object records to the National Database of Museum Objects [9]. When completed the regional server network will rely entirely on the OAI protocol and not an ad-hoc collection of statically and dynamically generated html pages.

The Canadian federal Department of Canadian Heritage (PCH) in conjunction with the Canadian Heritage Information Network (CHIN) and the National Library of Canada are experimenting with OAI as a method of harvesting and aggregating object records for the about-to-be-launched Canada Place Portal.

The Netherlands-based museum software company ADLiB Information Systems has implemented the OAI protocol in its Internet Server module that works in conjunction with any standard CGI Web server to publish museum data providing the potential for automated harvesting from a broadly distributed collection of institutions. ADLiB expects implementations where groups of museums such as Maritime Digital or IGEM will eventually employ OAI to create a joint web presence.

The Mellon Foundation has sponsored a comprehensive set of experiments in seven USA institutions to demonstrate the kinds of discovery and retrieval services that OAI might enable [10]. One of these projects involves the Research Libraries Group, Inc. researching how library catalogs might be transformed through the harvesting of RLG database records by an Internet search engine such as Google. Also with Mellon Foundation support, the University of Illinois at Urbana-Champaign (UIUC) and the University of Michigan (UM) are collaborating on a joint project involving digital library objects and special collections. The UIUC proposal [11] gives a particularly detailed description of a number of relevant research questions and the nature of issues to be resolved for OAI to become a successful tool.

Moving Forward

Regardless of the services developed there will be a number of issues relating to widespread adoption of the OAI protocol in the museum community. There is a need for the museum community to test hypotheses, assertions, and issues such as:

Exploring OAI

At this early stage of implementation and because OAI is designed to be a machine-based protocol it is difficult to find many human-consumable examples of implementations. The OAI Web site offers a useful Repository Explorer and links to tools [12] as does the CIMI site [8].

Conclusion

Both CIMI and many of our members have experience in the metadata harvesting business. It is this experience that motivates us to explore the OAI Protocol as an enabling technology to facilitate access to resources by making it easier for museums to expose and collect metadata. Sustained testing of the OAI protocol seems a logical and sensible research initiative that will bring us closer to making the rich information resources museums hold more widely available to researchers and other users.

References

  1. In this paper museums is used as a convenient shorthand notation intended to encompass museums, libraries, archives and all organizations usually included in the somewhat cumbersome moniker cultural memory organizations.
  2. A description of the concept of the Deep Web is at:
    URL: < http://www.brightplanet.com/deepcontent/> Link to external resource
  3. The current version of the OAI protocol document is available at
    URL:<http://www.openarchives.org/OAI/openarchivesprotocol.htm> Link to external resource
  4. Digital Library Federation Metadata Harvesting Testbed
    URL:<http://www.diglib.org/architectures/mdheoi.htm> Link to external resource
  5. Perkins, J. (2001) A New Way of Making Cultural Information Resources Visible on the Web: Museums and the Open Archives Initiative, in Museums and the Web 2001Selected Papers form an International Conference 2001, ed. David Bearman and Jennifer Trant, pp 87-92
    URL: < http://www.archimuse.com/mw2001/papers/perkins/perkins.html> Link to external resource
  6. CIMI's OAI workshop at MCN 2001.
    URL:<http://www.mcn.edu/mcn2001/workshop6.html> Link to external resource
  7. Information on the OAI alpha test.
    URL:<http://www.openarchives.org/news/oaipress001103.html> Link to external resource
  8. General information about CIMI's work on the OAI and the CIMI OAI Repository code and associated explanatory documents can be found in the Metadata Harvesting section of the Publications area on the CIMI Website at:
    URL:< http://www.cimi.org/publications.html#oai> Link to external resource
  9. AMOL's National Museum Object Database Open Collections
    URL: <http://www.amol.org.au/collection/collection_index.asp> Link to external resource
  10. Waters, Donald J. The Metadata Harvesting Initiative of the Mellon Foundation, inARL Bimonthly Report 217, 2001.
    URL: <http://www.arl.org/newsltr/217/waters.html> Link to external resource
  11. The University of Illinois Open Archives Initiative Metadata Harvesting Project Proposal to Implement a Scholarly Information Portal Using OAI Metadata Harvesting Protocols, 2001
    URL: <http://oai.grainger.uiuc.edu/proposal.htm> Link to external resource
  12. The OAI resource for tools and utilities
    URL: <http://www.openarchives.org/tools/index.html> Link to external resource

Acknowledgements

The author would like to thank Carl Lagoze for his contributions to CIMI's OAI research and Henry Stern for developing the CIMI Repository. Thanks are also due to the Open Archives Initiative, the Digital Library Federation, the Mellon Foundation, Bert Degenhart Drenth, and all CIMI members for enabling the author to participate in the development of CIMI's thinking on the use of OAI in museums.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

John Perkins
Executive Director
CIMI Consortium
Canada

Phone: +1 902 4295392

jperkins@cimi.org Link to an email address
http://www.cimi.org/> Link to external resource

John Perkins is Executive Director of the Consortium for the Interchange of Museum Information (CIMI). CIMI is a group of the world's most prestigious museums, technology companies, and libraries working to advance museum digital intelligence through standards, research, testbeds, advocacy, training and international collaboration. Current interests are in the area of digital information object management and interchange for museums, metadata harvesting, and distributed searching, mobile computing, and content architecture for Semantic Web applications.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Perkins, J. "Disclosing Digital Cultural Wealth: Museums and the Open Archives Initiative", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/activate/>

-------------------------------------------------------------

News from the Russian State Library

By Monika Segbert - February 2002

In December 2001 the Andrew W. Mellon Foundation, based in New York, approved the funding of a new project in the Russian State Library (RSL) in Moscow. The project will have a life time of 18 months, and a budget of $ 1,2 million. It will build on the results of the TACIS project ‘Creation of an Information System of the Russian State Library’ (1998-2000) and will be managed by the British Council in Russia. Monika Segbert reveals all.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Aims and scope of the project

The aim of the project is to support the modernisation of the RSL to become one of the leading digital libraries of the world, and to improve Russian and international users’ access to the information resources of the library. The project will consist of 5 closely interrelated components:

Component 1: Creation of the electronic bibliographic records of the RSL collection of books 1980 –1998

The EU – Tacis financed project “Creation of an Information System for the Russian State Library” resulted in an OPAC of some ½ million records covering parts of the 19th catalogue of Russian writers, dissertations, the catalogue of foreign books and recent acquisitions as from 1998. This is only a beginning in a library of this size, but a good basis to build on. This component will result in electronic cataloguing records of relevant literature from Russia of the last 20 years, which is the material most in demand. This will open up world-wide access to the bulk of Russian publications from 1980 onwards; it will also be the pre-condition for the automation of the circulation of items within the library (see component 4) which will greatly facilitate local use.

The project will use the files of the Russian Book Chamber from 1980 – 1998 to create 1 million machine readable bibliographic records in MARC21 format.

Component 2: Retroconversion of the RSL collection of musical scores

The Russian State Library houses a number of special collections, which contain rare items of interest to Russian and foreign scholars. The collection most in demand by scholars visiting the RSL is the collection of music scores. This collection is consulted by some 9000 users per annum, who request 40 000 items for consultation. Statistical evidence shows that 95% of these users come from the scholarly community. The collection has a unique value in that 60% of items are from Russian composers and composers from the former Soviet states, such as Ukraine, Belarus, Tadjikistan etc. The catalogue contains some 380 000 records.

Component 3: Pilot project for the automation of circulation

Currently the up to 5000 daily users of the RSL, the majority of which come from the education and research community, have to wait several hours until the books requested by them are delivered, or indeed only to find out that the book is on loan. The creation of electronic records of the RSL collections most in demand, e.g. the last 20 years, and the collection of musical scores, will create the electronic pre-requesite to start implementing the automated circulation module of the Aleph 500 integrated library software acquired under the Tacis project.

Component 4: Capacity building

The main goal of this component is to ensure maximum benefit from the investment in the project by addressing management issues related to the introduction of new technologies in a series of workshops for senior and middle management of the Library, predominantly those directly or indirectly involved in the project.

Component 5: Dissemination

The methodologies, processes and results that will be defined, applied and created in the course of this project will be of interest well beyond the Russian State Library, for example to the professional community in Russia, to national libraries and other major libraries in the former Soviet Union States, and to major libraries in other countries with special interest in the expanded access to the RSL collections. An international conference envisaged for Spring 2003 will present and discuss the progress and results of the projects.

The TACIS project

[1] The Mellon-funded project builds on the results of a project awarded in 1998 by the EU Tacis programme, “Creation of an Information System for the Russian State Library”; this project resulted in the selection of an integrated library system (Aleph 500), the implementation of the WebOPAC and online cataloguing, the creation of a catalogue database of ½ million records through retroconversion, the purchase and installation of hard- and software, and extensive staff training. The main goal of this project was to start opening access to the collections of the Library to national and international scholars. This project finished in May 2001.

The Russian State Library

[2] The Russian State Library, the former Lenin Library (fondly known as Leninka), is the national library of Russia, the largest library of Europe and the second largest in the world, holding over 42 million items. The Library has grown from the private book collection of count N.P. Rumyantsev (Chancellor of the Russian Empire) which he bequeathed to the nation for the purposes of “good education”. In 1831 the library, archive and museum collections of Rumyantsev were opened to the public in his St. Petersburg house. At that time the total size of the library collection was 28200 volumes and 710 manuscripts. In 1861 both the Museum and the Library were transferred to Moscow.

From its very beginning, the library has received a copy of all Russian books, newspapers and journals and its holdings have expanded to include private collections of prominent statesmen and scientists. In 1918 after the capital of the country was moved to Moscow, the Rumyantsev library became the main library of the country. In 1925 the library was renamed the V.I. Lenin State Library of the USSR and acquired the status of the national library of all Soviet states. In 1992 the library was renamed the Russian State Library, retaining its status as a national library, alongside the National Library in St. Petersburg.

The Library is unique in comprehensiveness and universality of its collections, which now contain some 42 million items: it has a rich collection of manuscripts starting form the 6th century; the collections contain archives, personal collections and autographs of prominent figures in Russian culture; special collections include such rare cultural heritage as old photographs, etching, cinema posters, rare maps with fine drawings, albums, precious musical scores. The Oriental Centre of the Library is famous for its Chinese and Japanese collections.

The Library also fulfils the functions of a public library, welcoming up to 5 000 visitors and circulating up to 35 000 items daily; over half of the users come from the academic community.

The Andrew W. Mellon Foundation

[3] THE ANDREW W. MELLON FOUNDATION, is a Not-for-Profit Corporation under the laws of the State of New York. The purpose of the Foundation is to "aid and promote such religious, charitable, scientific, literary, and educational purposes as may be in the furtherance of the public welfare or tend to promote the well-doing or well-being of mankind."

Under this broad charter, the Foundation currently makes grants on a selective basis to institutions in higher education; in cultural affairs and the performing arts; in population; in conservation and the environment; and in public affairs.

The British Council

[4] The overall management of the project will be undertaken by The British Council, Development and Training Services (DATS). DATS is a project management agency, which delivers technical assistance and project management services to multilateral donors such as the European Commission, the World Bank and the UK Department for International Development. The British Council is committed to library and information development and has extensive world-wide experience of managing library and information projects, and has worked with the Russian State Library since the early 1990s. In 1993 the British Council organised familiarisation visits for eight senior RSL staff to several European libraries. This initiative was funded by the EC and managed by the British Council. In 1994-1995 the British Council also managed a consultancy which resulted in the production of a report 'The Russian State Library: Planning for automation, 1995 - 2005 (Project definition report)' funded by the EC DG XIII. Between December 1998 and January 2000, the British Council successfully managed the implementation of the European Commission funded project “Creation of an Information System for the Russian State Library”.

References

  1. TACIS
    URL: <http://www.rsl.ru/tacis/> Link to external resource
  2. The Russian State Library
    URL: <http://www.rsl.ru/> Link to external resource
  3. The Andrew W. Mellon Foundation
    URL: <http://www.mellon.org/> Link to external resource
  4. British Council
    URL: <http://www.britishcouncil.org> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Monika Segbert MBE FLA(hon) Dipl.Bibl.
Project Management and Consultancy
Via Fondiglie 5-7
60030 Rosora (An)
Italy

info@monikasegbert.com Link to an email address
http://www.monikasegbert.com> Link to external resource

Monika Segbert provides advice and management expertise to multi-national projects in the cultural sector. She currently works with the Soros Foundation's Open Society Institute in Budapest to create a multi-country consortium of library consortia in 40+ countries to harness their buying power for affordable access to electronic journals and with DG Information Society of the European Commission to co-ordinate 25 new projects trialling the application of new ICT's in libraries, museums and archives in the EU and the new accession states of Central and Eastern Europe.

From 1998 - 2000 she was based in Moscow as the team leader of the EU -Tacis project for the Creation of an Information System for the Russian State Library. From 1995 -1998 she worked with the European Commission DGXIII Telematics for Libraries programme, with primary responsibility for developing research cooperation for the sector with the new accession countries. Prior to that Monika Segbert served in the British Council in Germany as Head, Libraries, Information and Books.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Segbert, M. "News from the Russian State Library", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/eifl/>

-------------------------------------------------------------

-------------------------------------------------------------

SEPIA: Safeguarding European Photographic Images for Access

By Edwin Klijn - February 2002

This article provides an impression of the first year of the SEPIA working group on descriptive models for photographic materials. The working group has analysed existing methods to describe photographic materials, focusing on the consensus and trying to find solutions that would enable registrars to describe photographs adequately for different purposes. The working group does not aim to develop another standard model but wants to provide tools for description of photographic materials that accept the fact that many institutions use different descriptive models.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

If you don't find it in the index, look very carefully through the entire catalogue.
(Unknown, Sears, Roebuck, and Co. Consumer's Guide, 1897)

In 1999 the European Commission on Preservation and Access (ECPA) initiated a project aimed at the long-term preservation of all kind of photographic materials and defining the role of new technology in collection management, called SEPIA (Safeguarding European Photographic Images for Access) [1]. The project was set up explicitly to bring together representatives from different types of institutions that hold photographs: libraries, archives and museums, as well as from research institutes. SEPIA was funded by the European Union under the Culture 2000 programme, initially for one year, which was extended to another three years last year.

SEPIA has become bigger, now representing 17 partners from 10 different countries. Its objectives remain the same: to actively promote exchange of ideas and experiences on a cross-institutional, national and international level by organising training sessions, exhibitions and public events and doing research on specific topics related to preservation of photographic materials.

The SEPIA working group on descriptive models focuses on descriptive models as they are currently used in relation to photographic collections. It consists of representatives from the National Library of Spain, Stockholm City Museum, the Finnish Museum of Photography, the Norwegian Museum Authority (Secretariat for Historical Photography) and the European Commission on Preservation and Access (project management). All working group members have hands-on expertise with describing photographic collections at their own institutions.

The starting point for the working group was the findings of the survey on photographic collections in Europe, carried out in the first year of the SEPIA project [2]. This survey, to which 141 European institutions (archives, libraries, museums and other cultural heritage organisations) with photographic collections responded, showed that many institutions did not use a standard descriptive model for their photographic materials. Those that did use standard descriptions often used different models. Also, many institutions used descriptive models that can be applied to non-photographic materials as well.

In general, institutions in Europe use a wide variety of descriptive models for describing their photographic collections. They share one common factor: hardly ever are they specifically geared to description of photographic materials. Quite often the rules of certain elements in descriptive models are stretched to make an adequate description of the photographic collection possible. When two institutions use General International Standard Archival Description - ISAD(G) to describe their collections, this does not automatically mean that their descriptions are similar. There can be differences in the selection of elements to be used, but also in the interpretation of ISAD(G) rules and elements.

But what elements make an ‘adequate description’ for photographic materials? The main challenge for the working group and the other SEPIA partners involved will be to develop a set of elements that is essential for description of a photographic collection. This data element set is not meant to replace existing descriptive models, but it should function as an advisory model that can be applied to the needs and requirements of a specific institution.

Another task of the working group will be to formulate recommendations to help those in charge of photographic collections to deal with descriptions in ‘the new environment’. Many institutions have been lured by the possibilities of the Internet to publish (part of) their photographic collections online. Yet, many existing descriptions do not meet up with the specific demands of a search-and-retrieval system, simply because they have not been created for this specific purpose. The working group will do further research on this issue, starting with an expert meeting about search-and-retrieval, which will be held at the Biblioteca Nacional in Madrid on 29 and 30 November 2001 [24].

One of the primary undertakings of the working group was to do further detailed research into how institutions describe photographic materials, what problems they have to cope with when describing photographic materials and the circumstances under which institutions choose or do not choose a particular descriptive model. An analysis was made of descriptive models currently applied to describe photographic materials. This working document provided a more detailed impression of how photographic materials are described in practice, elaborating on the results of the 1999 SEPIA survey [3].

On the 26 and 27th of April Stockholm City Museum hosted an expert meeting on descriptive models [4], where experts had been invited to share their experiences and participate in discussions. The working group has meanwhile formulated further recommendations. In both meetings the need for adequate descriptions for photographic materials was frequently emphasised.

A good description is the key to every collection; it makes a photograph visible, for the researcher, the registrar or the occasional visitor. It opens up a collection, providing access and enabling users to find what they are looking for. At the same time it offers an opportunity to register administrative information about the collection or item; how it was acquired, its physical condition, any access restrictions and similar management information.

The appreciation of photographic items, not only as an art form but also a source of (historical) information has stimulated an increasing public demand. Providing access has become more urgent than ever before. New technological developments offer new possibilities, but do not automatically produce answers to every problem. In some cases it still remains a question of old wine in new bottles. In a recently held survey of art libraries in the UK, many of the frequently mentioned problems that were recorded, sound all too familiar [5]. Determining the depth of indexing, what aspect of the image to register and the lack of information on the image are specific, 'traditional' cataloguing challenges; they have been there ever since the very moment people started cataloguing. These problems will not be solved by any advanced, automated system, but remain subject to human interpretation, defying standardized solutions.

The quality of descriptions can determine the success or failure of a digitization project. The reliability of a search-and-retrieval system depends to a large extent on the quality of the descriptions. Since describing is so labour intensive it can absorb a considerable, frequently underestimated part of the budget [6]. However, digitizing a photographic collection can often be a unique chance to catch up on cataloguing backlogs [7]. By providing digital derivatives of the originals, descriptions can be accompanied by visual references, thereby improving access.

With the ongoing development of the Internet and the advancements of network systems institutions holding photographic collections are tempted to make their collections interoperable and accessible on a national or international level. Initiatives like that are currently in progress on a national level in Norway (Feltkatalogen) [8], Sweden (ARKIS project) [9] and the Netherlands (FOTIOS-project) [10]. On an international level initiatives like the EVA (European Visual Archives)-project [11] and the EUAN (European Union Archive Network)-project [12] attempt to create Internet applications which allow users to search through multiple collections. One of the crucial parts of all these initiatives is to synchronise existing descriptions in such a way that they can be matched on a central level [13]. This is sometimes done by using an ‘intelligent’ search tool that allows cross-collection search-and-retrieval or by synchronization on the level of the different descriptions. Usually synchronization takes far more work, but may provide better results for search-and-retrieval purposes. Anyhow the issue of consistent, reliable descriptions can be very important for a project's failure or success.

Describing photographs takes a lot of time and expertise. Decisions are made on what part of the collection will be described, what level (every single photograph or subseries, series, subcollection or collection), what descriptive model will be used and what are considered to be the minimal requirements for an adequate description.

Regularly applied selection criteria for describing are: frequency of use, rareness and physical condition. These criteria can differ from collection to collection, but especially with large collections establishing selection criteria in advance can be very useful. When the Oslo City Museum started to use the Feltkatalogen [14] for their approximately 520,000 photographic items, initially every single photograph of an arbitrary part of the collection was described. After some time the Museum realised that the collection described was of limited interest to the public so it was decided to introduce selection criteria such as user needs, age and photographic technique to make the process of describing more efficient.

The way photographs are described depends for a large part on the nature of the collection. A photograph can be appreciated because of its value as an object (e.g. a rare, obscure photographic technique), its content ( e.g. portrait, landscape, event), its artistic value (e.g. work of a famous photographer) or a combination between these elements.

Most institutions look at their photographs, either as objects or as visual documents, or as something in between; this can influence the method of description. In general, those institutions that consider their photographs as objects are inclined to put more emphasis on the material aspects (photographic technique, dimensions, condition, etc.). When the content is the most significant feature of a collection, information about the 'what, where, when' aspects of the photograph will be highlighted.

Basically, the starting point is usually the role of a photographic collection in the organizational structure of an institution. If the photographic collection is just a part of the whole, the way the other materials are described can have a large impact on how the photographs are described. For instance, many archives, libraries and museums prefer one descriptive model for all their materials, since they find it more convenient to maintain one system than many different ones. Sometimes registrars have no choice but to use the institutional standard descriptive model to describe their photographic materials. Since most standard descriptive models are not specifically designated for photographic materials, they are often forced to stretch fields and rules considerably.

The size of the photographic collection in relation to the available resources can also influence the method of description. Especially institutions with very large collections usually opt for a 'top-down approach'. International descriptive standards like ISAD(G) [15] and AACR2 [16] both support multi-level description, allowing registrars to describe fonds, subfonds, series, items within one and the same model.

The 'top-down' approach can be a very useful method to increase control over a large collection. For instance, the Biblioteca Nacional in Madrid, holding about 600,000 photographic prints from both 19th and 20th century and about 700,000 negatives mainly from the 20th century, initially started to describe their collection at a high level for the publication of the inventory in 1989. From there they have adopted a strategy for cataloguing, using specific selection criteria such as user needs or conservation reasons to determine the level of description necessary for each part of the collection [17].

Apart from the practical advantages of a ‘top-down’ approach, multi-level description makes it easier to determine the correlation between different parts of a collection. A frequently heard complaint about item-level descriptions is that it is difficult to describe the relationship between the different items. Multi-level description can link every item to other related items or parts of the collection.

Another aspect that can influence the method of description is the availability of trained staff. Especially smaller institutions often lack specialized staff to describe photographic materials. Limited availability of (trained) staff sometimes urges institutions to look for other solutions. For the Swedish ARKIS 1 project about 1,000 unemployed people were called in to produce as many as 5 million records in two years' time [18]. The Netherlands Institute for War Documentation has a team of senior volunteers that provides descriptions for many of the Institute's photographs. After putting the German Colonial Society collection online the Stadt- und Universitätsbibliothek Frankfurt encouraged their users to send e-mails with suggestions to improve the descriptions [19].

In the SEPIA survey mentioned earlier 50 out of 120 respondents did not use a standard descriptive model, but their own models. The other institutions mainly used ISAD(G), ISBD (International Standard Bibliographic Description), MARC (Machine-Readable Cataloguing Record) to describe their photographs [20]. Other research, like for instance the ARLIS survey, also shows that, even in a relatively homogeneous group of predominantly art libraries, there is still a great variety in use of descriptive models [21].

Even if institutions use the same standard descriptive models, their interpretation of the same elements can be quite different. The role of interpretation becomes more and more important as the original model is not actually meant for describing photographic collections. ISBD, for instance, is actually a standard to describe library collections. In order to be able to include 'non-book materials' a specialised version of ISBD, ISBD(NBM) was developed. ISBD (NBM) is used a kind of guidance for the development national interpretations. Sometimes these national guidelines are again interpreted on an institutional level [22].

The lack of uniformity can be explained by many different circumstances mentioned above. Despite all the different approaches, when analysing different descriptive models used for photographic materials, there are a number of elements that almost every registrar will agree on. At the Stockholm meeting all 21 participants were asked to choose out of 53 elements 15 that they considered essential for an adequate description of a photograph. Elements like inventory number, creator/photographer, topical term, title, date of creation, photographic technique, copyright, location number, size, geographical location, date of acquisition, picture caption, colour/ b/w, custodian history and note area ranked high on the priority lists. When respondents to the ARLIS survey were asked how they formally described their images elements like artist/photographer, ID number, title, physical description, place name and date were frequently mentioned [23].

Without going into details about rules and interpretations, it may be possible to deduce some elements that any adequate description of a photographic item/material/object ought to contain. One of the aims of SEPIA in the three years to come will be to design and implement such a minimal model for description of photographic materials.

Figure 1: Stockholm model
Figure 1: Stockholm model

In general, the participants to the Stockholm meeting agreed that a descriptive model for photographic materials should contain information about the object, the content and the creator or photographer. Depending on the specific purpose of description (public, management or provenance) it should contain elements to facilitate access for the general public, elements related to collection and preservation management and elements with information about the creator/photographer, provenance and authority posts.

In the Amsterdam working group meeting the definitions in relation to the Stockholm model were refined. The ‘public’ part should contain search and retrieval elements of content, ‘management’ should include identification and physical description elements for administration and preservation purposes and ‘provenance’ is meant to include elements about the creator and the history of the item.

Another requirement that was mentioned on the SEPIA expert meeting -and that was also discussed in more detail on the working group meeting-was the facility of multi-level description. In the working group the basic idea behind ISAD(G), re-using the same elements for different levels, was considered to be a practical and desired feature. There should at least be a distinction between individual and grouping level. It is up to the preferences of a specific institution to choose the number and the exact names (fonds, subfonds, series, etc) of these groupings.

The Stockholm participants also expressed the idea that the model should provide a basis for possible future interoperability with other collections. In order to do so use of thesauri and controlled terminology lists in relation to specific parts of a description of photographic collections, like for instance photographic techniques, should be stimulated. The SEPIA working group considers it to be one of its primary tasks to integrate existing controlled terminology lists for description of photographic materials. So far research has proved that, although (on institutional and national level) some controlled lists are available, on an international level they are often lacking.

The complexity of a photographic item makes describing very complicated. In the working group meeting in Amsterdam a distinction was made between the object (the physical apparition) and the image (the contents of the object). Main difference is that an object only has one image. An image, however, can have more objects (e.g. a negative, a print, a digital derivative, etc.). The object on which the image appears for the first time is commonly considered to be ‘the original’. When describing a photographic item the object and the image are described at the same time.

In short the ‘consensus’ model (colloquially called SEPIADES after ‘SEPIA Data Element Set’), will be build on the ground structure with the following basic ideas:

Every element can be:

a. mandatory : it should always be filled out

b. strongly recommended: should be filled out with a value if the information is appropriate, but if not it can be left blank

c. optional: could be filled out with a value if the information is appropriate, but if not it can be left blank

One of the challenges of SEPIA for the next two years will be to develop a model for description of photographic materials that would include elements for an adequate and efficient description of photographic materials, according to the ground structure formulated in this first year. By organising national SEPIA meetings on descriptive models and using the SEPIA Web site(24)as a platform for discussion throughout the project, the working group encourages all experts in the field to give their feedback, realising that we can use all the help we can get on the long and winding road that lies ahead of us.

Partners

Partners are:

Associate partners are:

References

  1. SEPIA Web site
    URL:<http://www.knaw.nl/ecpa/sepia/> Link to external resource
  2. Klijn, Edwin and Yola de Lusenet (2000), In the picture. Preservation and digitisation of european photographic collections (Amsterdam: European Commission on Preservation and Access), also available as PDF on the Internet.
    URL:<http://www.knaw.nl/ecpa/sepia/linksandliterature/sepia1.h tml> Link to external resource
  3. This working document contains an analysis of how the four most frequently mentioned, international descriptive models (ISAD, ISBD, MARC, AACR2) and other descriptive models like FOTIOS (a Dutch model designed exclusively for photographic materials), SKOPEO (a model used for the European Visual Archives project), the Dataelementkatalogen (Swedish Fotosekratariat and National Archive of Sweden) and the Feltkatalogen (a Norwegian model to describe collections of cultural-historical material, art objects, books and photographs) are applied in relation to photographic materials. See: Deliverable 5.1 descriptive models for photographic materials,
    URL: <http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/deli verable51.pdf> Link to external resource and
    URL: < http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/sepia51.html> Link to external resource
  4. Report of Stockholm expert meeting
    URL: <http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/ex pertmeeting1.html> Link to external resource
  5. In 1999 the Institute for Image Data Research performed a survey amongst about 60 members of the Art Libraries Society (UK)- ARLIS. See: Graham, Margaret. E., The description and indexing of images. Report of a survey of ARLIS members, 1998/1999, , 13.
    URL:<http://www.unn.ac.uk/iidr/ARLIS/> Link to external resource
  6. Puglia, S., 'The costs of digital imaging projects' in: RLG Diginews (October 1999, Vol. 3, No. 5), When digitizing a photographic item Puglia estimates that about 27% of all costs is spent on metadata creation, including cataloguing, description, and indexing.
    URL:<http://www.rlg.org/preserv/diginews/diginews3-5.html> Link to external resource
  7. The Public Record Office (UK) started a pilot project in 1997 to digitize and describe about 10,000 glass plates that had not yet been catalogued and properly stored. By digitizing them it became easier to see what was on them. See: Klijn, Edwin and Yola de Lusenet (2000), In the picture. Preservation and digitisation of European photographic collections (Amsterdam: European Commission on Preservation and Access), 14-15.
    URL: <http://www.knaw.nl/ecpa/sepia/linksandliterature/sepia1.html> Link to external resource
  8. Registration of photographs in Norwegian museums , Siv Bente Grongstad and Siri Svettlåg ,
    URL:<http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/2.html> Link to external resource
  9. ARKIS II - a Swedish Archival Information System, Göran Kristiansson,
    URL:<http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/1.html> Link to external resource
  10. Deliverable 5.1. Report on descriptive models for photographic materials,
    URL:< http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/fotios.htm l> Link to external resource
  11. EVA Web site
    URL:<http://www.eva-eu.org/> Link to external resource
  12. EUAN Web site
    URL:<http://www.euan.org/> Link to external resource
  13. For instance: about the technical framework of the European Visual Archive project see: Horik, René van, 'Archives and Photographs: the 'European Visual Archive Project', in: Cultivate Interactive (Issue 3, 2001),
    URL:<http://www.cultivate-int.org/issue3/eva/> Link to external resource
  14. The Feltkatalogen is a Norwegian descriptive model to register cultural-historical objects. See also: Deliverable SEPIA 5.1. report on descriptive models for photographic materials,
    URL:<http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/2.html> Link to external resource
  15. General International Standard Archival Description,
    URL:<http://www.ica.org/> Link to external resource
  16. Graphic Materials - Rules for Describing Original Items and Historical Collections: (1982, 1996 US Library of Congress);
    URL: <http://www.tlcdelivers.com/tlc/crs/grph0199.htm> Link to external resource
  17. Description of photographic materials in the Biblioteca Nacional, Spain, Isabel Ortega,
    URL:<http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/3.html> Link to external resource
    Also: Deliverable SEPIA 5.1. report on descriptive models for photographic materials,
    URL:< http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/sepia51.h tml> Link to external resource
  18. ARKIS II - a Swedish Archival Information System, Göran Kristiansson,
    URL:<http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/1.html> Link to external resource
  19. Klijn, de Lusenet, In the Picture, 38-39
  20. Klijn, de Lusenet, In the Picture, 20-24
  21. The description and indexing of images. Report of a survey of ARLIS members, 1998/1999, Margaret. E. Graham, 8.
    URL:<http://www.unn.ac.uk/iidr/ARLIS/> Link to external resource
  22. Deliverable SEPIA 5.1. report on descriptive models for photographic materials,
    URL: <http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/marc.html> Link to external resource
  23. SEPIA Web site
    URL:<http://www.knaw.nl/ecpa/sepia/> Link to external resource
  24. Report of the meeting
    URL:<http://www.knaw.nl/ecpa/sepia/workinggroups/wp5/expertmeeting2.html> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Edwin KlijnEdwin Klijn
Project Manager on Descriptive Models
European Commission on Preservation and Access

edwin.klijn@bureau.knaw.nl Link to an email address
<http://www.knaw.nl/ecpa> Link to external resource

Edwin Klijn is Project Manager of the SEPIA working group ‘descriptive models for photographic materials’ and co-author of ‘In the Picture. Digitisation and Preservation of European Photographic Collections (Amsterdam, 2000). He has been working as Digitisation Officer at the European Commission on Preservation and Access (ECPA) since 1999 and was involved in several digitisation projects, specialising in database publishing on the Internet.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Klijn, E. "SEPIA: Safeguarding European Photographic Images for Access", Cultivate Interactive, issue 6, 11 February 2001
URL: <http://www.cultivate-int.org/issue6/sepia/>

-------------------------------------------------------------

Bringing CHILDE into the World

By Esther Gregory and Michael Ryan - February 2002

The CHILDE (Children's Historical Literature Disseminated Throughout Europe) [1] project was developed by Buckinghamshire County Library Service and European team, and six partners from Germany, Italy, The Netherlands, Ireland and the UK. In November 2000 a grant of £60,000/ €100,000 was awarded to the project.

The CHILDE partners worked together during the next 12 months to:

The following article describes this process and draws some conclusions from its success.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

A Twinkle in the Eye

In the early part of 2000, a chance conversation between two local government officers in Buckinghamshire sparked off an idea that led to the development of a unique cultural and educational project, CHILDE. The idea itself, to create a Web site based on Europe's rich but disparate heritage of historic children's book collections, would have remained merely a distant vision had a partner search, carried out through the European Cultural Contact Points, not proved so successful. An impressive range of organisations responded to the initial project brief and, after further discussion and exchange of ideas, committed themselves to a bid for funding from the European Commission's Culture 2000 programme. A detailed project plan and budget were devised and these formed the basis for the eventual bid to the Commission.

CHILDE Arrives

The project, which by the submission date of May 2000 had acquired the name CHILDE (Children's Historical Literature Disseminated throughout Europe) was developed by Buckinghamshire County Council's Library Service and European team, along with:

Technical advice and assistance was provided by the Higher Education Digitisation Service (HEDS) at the University of Hertfordshire and Buckinghamshire County Council's IT Unit.

The bid was successful and in October 2000 a grant of €100,000 was awarded to the scheme. The inaugural project meeting was organised for the first week in December and on a windswept and rainy day in Brussels, CHILDE was born.

A Purpose in Life

The key objectives of the CHILDE project were to digitise 1,000 illustrations from six European historical children's book collections and to display these on a specially created Web site. Consideration was given to the digitisation of text as well as images but the constraints of time and budget, along with the idea that this project would have scope for such further development in the future, led us to decide to concentrate the digitisation on visual illustration. The site was would also feature an Education Programme built around four main themes:

In addition, a Best Practice Report, drawing on the project partners' experiences of collection management, preservation, cataloguing and utilisation would be included on the site. All of this was to be achieved with the assistance of Email, fax, phone and through four trans-national meetings to be held in four partner countries over one year, although in the event this was to prove to be an eleven-month year. The culmination of the project would be an international conference, provisionally entitled 'Opening the Portal,' to be held in the lead partner's home locality of Buckinghamshire. At the time of writing, all of these objectives have been met, with some aspects, for example the Web site's underlying database, having developed a far greater, and necessary, sophistication than was at first envisaged.

First Steps

Aside from the Education Programme and the Best Practice Report, which were written and edited by all of the project partners, based on their professional knowledge and expertise and using the simple expedient of Word, the main technological and organisational challenges lay in the digitisation and Web site creation elements of the project. But before any of this could begin it was first necessary to select 1,000 images from a joint 'project collection' of almost one million books. The partners at the inaugural meeting in Brussels laid down a number of strict selection criteria. It was agreed that the four Education Programme themes, outlined above, were to guide this selection and that, partly with an eye on copyright considerations, we would only include images published before 1890.

The Family Photographs

The Project Plan, devised with the technical assistance of HEDS, called for the chosen images to be photographed and produced as 35mm slides. The digitisation process itself would be based on these slides. In this way the handling and transportation of many rare and fragile books would be reduced to an absolute minimum. Having agreed in Brussels on the selection criteria, each partner then commissioned professional photographers to produce two sets of 35mm slides. Again taking the advice of HEDS, considerable emphasis was placed on the importance of the quality both of the photography itself and of the physical characteristics of the resulting slides. All partners were agreed that the final images on the Web site would need to be of the highest possible standard if they were to be of real value to their end users. Early in 2000 a 'benchmark selection' of slides was taken to HEDS and subjected to a series of trial scans to highlight any potential problems. It is with no sense of embarrassment that we can state that a very small number of slides, for a variety of reasons, did not initially pass this self imposed quality test and that as a result, some illustrations were re-photographed as a guarantee of quality. The CHILDE site itself now bears witness to the success of this important aspect of the project.

Another important consideration was that of copyright. Although the original illustrations were now free of the European copyright legislation's 70-year statute of limitation on visual images, it was necessary for the photographers to formally assign copyright of the photographs to the CHILDE partnership.

Making Progress

a. The Images

Once the slides had been produced by each partner, using their own choice of photographer, they were posted to the Project Co-ordinator at her office in Chesham Library in Buckinghamshire. From here they were taken to HEDS at the University of Hertfordshire. HEDS organised the scanning and the production of 2700 dpi resolution master tif images, whose high definition would compensate for the slight loss of detail intrinsic in the photography process. From these master images CD ROMs with jpeg thumbnail and screen sized images, along with their unique file names, were produced for use on the Web.

b. The Data

The creation of the database, which would hold all the required bibliographic and image data, was in many ways the most important topic that the project team discussed at its second meeting, held in January 2001 in Dublin. Colleagues from Dublin City Public Libraries undertook the initial work of creating the database, using Microsoft Access. There then followed a period of time during which the draft database was bounced back and forth by email between the partners until a consensus was reached on the range and naming of fields. For Buckinghamshire County Council's I.T. Unit, who were providing essential technical support to the project, the reconciling of these different versions of the database was a major issue. By the time of the third project meeting in Bologna in May the design of the database was beginning to feel like a project in itself. However, all of the partners agreed on the final shape of the database and the construction of the CHILDE site moved on

c. The Web Site

The major design consideration was to understand how users would want to search the site and also, crucially, who we intended or expected those users to be. It had been agreed at the outset of the project that researchers, librarians, teachers and private individuals with an interest in historic children's bibliography would constitute the main audience for the site, but a plea was also made for children. In other words, the site designer would need to combine visual elegance with an ease of operation that would enable a wide range of users to find and make use of the site's various components.

The designer used Microsoft FrontPage to create the site pages. These were then enhanced e.g. through the addition of special language tags so that non-visual browsers could understand the text. Further manual additions were made to the site, including meta-data, such as keywords in various languages, to help promote the site with search engines. The designer also made a simple but very effective use of the European Union's signature colours of blue and yellow.

The database was interrogated via an Active Serve Page (ASP). This was essential in order to coordinate changes being made to the database by the project partners e.g. the names of searchable fields. A few JavaScript 'tweaks' were then added to the ASP to allow the search to focus on the field selected.

Once the various elements were in a sufficiently developed state (please note, we have not used the word 'complete' here) the trial site was placed in the development area of Buckinghamshire County Council's own site, where it was hosted on an Internet Information Server 5 (IIS5) - a Microsoft product that specifically supports FrontPage extensions.

Initially this trial site used only images from the Buckinghamshire Early Children's Book Collection and it was with a considerable sense of excitement that the CHILDE partners were soon able to make use of the Search page to seek out pictures and bibliographic data. As soon as the first few trial searches were undertaken it was clear that in addition to producing detailed book and image data, the user's computer screen would be filled with images of startling clarity. The site worked!

CHILDE Grows Up

Having got this far and having demonstrated that all the development work on the site was producing the results we wanted, it was now necessary for the project partners to complete their bibliographic entries on their parts of the database. It also remained for us all to complete work on writing and editing the Best Practice Reports and the Education Programme. And organise publicity and promotional events in all partner countries and organisations to tie in with the official Web site launch week in September. Oh, and also plan, prepare for and promote the transnational 'Opening the Portal' conference scheduled for the 19th October 2001. This was nothing if not a busy project!
Eventually however all of this work was achieved and the final writing up, budget balancing, invoice chasing, and general managerial tidying up could be done. Less than a year after the project's first transnational meeting in Brussels, the product all of the partners' ideas, efforts and continued commitment had been realised and CHILDE was ready to face the world. Efforts were made to register the site with the major Internet search engines and by the end of October 2001, after a few short weeks of existence, the CHILDE Web site had received 17,761 visits!

Looking Back

Before embarking on CHILDE, the authors spoke to people who had already been involved with other European Commission funded projects. Through this we gained a valuable insight into what should go right and what could go wrong with such a project. To go into detail about this would effectively require a whole new article, but we would like to just highlight some aspects of the way that this project developed that could offer pointers to other projects' futures.

Tomorrow's CHILDE

We hope that tomorrow's CHILDE has far to go. It was part of the original vision of the project that one of its outcomes would be the establishment of the foundations of a European network of historical children's book collections. At the time of writing, these foundations are in place. The opportunity has been created for all interested individuals and organisations to meet and build on these foundations in order to establish a European network and seek the resources necessary to give it life. All of the CHILDE project partners are aware of, and indeed are participants in, the considerable international work that is going on in the field of historical children's literature. We are also aware that the CHILDE Web site has the potential to become the principal European site for this type of digitisation and Internet activity. Now that would be a great future for this particular offspring of Culture 2000.

References

  1. CHILDE (Children's Historical Literature Disseminated Throughout Europe) Web site
    URL: <http://www.bookchilde.org/> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Esther Gregory
CHILDE Project Co-ordinator
Buckinghamshire County Library Service
County Library Headquarters
Gallery Suite
County Hall
Walton Street
Aylesbury
Bucks
HP20 1UU

esther_childe@hotmail.com Link to an email address
<http://www.buckscc.gov.uk/> Link to external resource

Esther Gregory worked for Buckinghamshire Library Service for 6 years. In 1999 she completed a two year postgraduate in Information Management and became Children's Librarian at Aylesbury Lending Library. Finally in the year 2000 she took on the role of Project Co-ordinator for the CHILDE project. Since writing this article, Esther has taken up the post of Virtual Library Development Manager with Bedfordshire Library Service.

Michael Ryan
CHILDE Project Manager/Principal Learning Support Officer:
Buckinghamshire County Library Service
County Library Headquarters
Gallery Suite
County Hall
Walton Street
Aylesbury
Bucks
HP20 1UU

mryan@buckscc.gov.uk Link to an email address
<http://www.buckscc.gov.uk/> Link to external resource

Michael Ryan has worked as a teacher and librarian for 30 years. He currently manages a number of projects, including the CHILDE project. He has written articles for professional journals on various aspects of children's and educational librarianship. Other publications include 'Computers for young people: a select booklist' (YLG) and the 'Buckinghamshire Early Children's Book Collection' (BCC), with Lesley Kumiega. He was until recently a member of the Library Association's Youth Libraries Committee and Chair of the School Libraries Group.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Gregory, E. and Ryan, M. "Bringing CHILDE into the World", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/childe/>

-------------------------------------------------------------

Evaluating Cross-Language Systems the CLEF Way

By Carol Peters - February 2002

Carol Peters reports on the Cross-Language Evaluation Forum (CLEF), an initiative that provides an infrastructure for the testing and evaluation of information retrieval systems operating on European languages. This activity first began in 1997 in the United States as a track in TREC (the Text REtrieval Conference series) but since 2000 has been coordinated in Europe under the auspices of the IST programme of the European Commission. In this brief overview, she describes the organization of the annual CLEF evaluation campaigns, lists the main results, and outlines plans for the future.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Introduction

The popularity of the Internet and the consequent global availability of networked information sources for an increasingly vast public have led to a strong demand for tools that permit the user to find information wherever and however it is stored, regardless of language boundaries. This demand has been stimulated by the fact that the Internet is no longer an English-only preserve and other language content is growing rapidly. It is estimated that by 2005 the non-English speaking Internet population will be approximately 80% and non-English content will be well over 50% [1].

As English is the first language of only about 5% of the world’s population, there is no doubt that this figure will rise steadily in the next decade with the Asian languages occupying a growing percentage. There is, therefore, a strong demand for efficient cross-language systems that allow users to search document collections in multiple languages and retrieve relevant information in a form that is useful to them, even when they have little or no linguistic competence in the target languages. However, such systems are not easy to develop and work is generally still in an experimental stage. Approaches currently being tested imply the integration of tools and methodologies from the fields of information retrieval, natural language processing and human-computer interaction among others. An intensive process of system testing and tuning is thus needed before the separate components can be implemented successfully in end user applications.

The aim of the Cross-Language Evaluation Forum (CLEF) is to assist this process by (i) providing an infrastructure for the testing and evaluation of systems operating on European languages, and (ii) creating test-suites of reusable data which can be employed by system developers for benchmarking purposes. These objectives will be achieved through the organization of a series of annual evaluation campaigns.

CLEF has established strong links with two successful IR system evaluation campaigns held in the United States and Japan: the Text REtrieval Conference (TREC) series organized by the National Institute for Standards and Technology (NIST) [2], and the NACSIS Test Collection for Information Retrieval (NTCIR) sponsored by the National Institute for Informatics of Tokyo [3]. Both initiatives include tracks for cross-language retrieval system evaluation (NIST will focus on English-Arabic retrieval in the next years; NTCIR is working on the evaluation of systems for mono- and cross-language retrieval for Asian languages). The three initiatives (US, Asian and European) aim at creating a network of complementary activities in the cross-language system evaluation area.

In this report, we describe the organization of the CLEF evaluation campaigns and list the main results achieved so far. The final section gives an idea of our plans and hopes for future campaigns. For more details, the interested reader is referred to [4].

The CLEF Consortium

CLEF represents a continuation and expansion of an activity that first began in the United States within the TREC conference series and is now coordinated in Europe and conducted as an EU-US collaboration. The CLEF 2000 and 2001 campaigns were sponsored by the DELOS Network of Excellence for Digital Libraries [5]. From October 2001, CLEF is being run as an independent project of the European Commission (IST-2000-31002). The consortium members are:

The CLEF Evaluation Infrastructure

Typically, an information retrieval system evaluation campaign permits participating groups to compare the performance of their systems with respect to given tasks, under a set of controlled conditions: a reference document collection; a standard set of queries; assessment of the relevance of the ranked list of results submitted by each participating system; a comparative analysis of the results. When the evaluation is performed in a number of successive campaigns, the accumulated resources of data and assessed result sets become increasingly valuable for the participating research groups providing them with material for independent system testing and tuning. CLEF aims at making this test data also available to the wider R&D community

The Tasks

CLEF provides a series of evaluation tracks designed to test different aspects of information retrieval system development. The intention is to encourage systems to move from monolingual searching to the implementation of a full multilingual retrieval service.

Multilingual Information Retrieval: This is the main task in CLEF. It requires searching a multilingual collection of documents for relevant items, using a selected query language. This is a complex task, testing the capability of a system to handle a number of different languages simultaneously and to merge the results, ordering them according to relevance.

Bilingual Information Retrieval: In this track, any query language can be used to search just one of the CLEF target document collections. Many newcomers to cross-language system evaluation prefer to begin with the simpler bilingual task before moving on to tackle the more complex issues involved in truly multilingual retrieval.

Monolingual (non-English) Information Retrieval: Until recently, most IR system evaluation focused on English. However, many of the issues involved in IR are language dependent. CLEF provides the opportunity for monolingual system testing and tuning, and for building test suites in other European languages.

Domain-specific Mono- and Cross-Language Information Retrieval: The rationale for this task is to study CLIR on another type of collection, serving a different kind of information need. The information, which is provided by domain-specific documents, is far more targeted than news stories and contains much terminology. It is claimed that the users of this type of collection are typically interested in the completeness of results. This means that they are generally not satisfied with finding just some relevant documents in a collection that may contain much more. Developers of domain-specific cross-language retrieval systems need to be able to tune their systems to meet this requirement.

Interactive Cross-language Information Retrieval: An interactive track that focused on the document selection problem was experimented with success in CLEF 2001. The design of future interactive tracks will be determined on the basis of input from interested participants.

For each task, the participating systems construct their queries (automatically or manually) from a common set of statements of information needs (known as topics) and search for relevant documents in the collections provided, listing the results in a ranked list.

The Test Collection

The main CLEF test collection is formed of sets of documents in different European languages but with common features (same genre and time period, comparable content); a single set of topics rendered in a number of languages; relevance judgments determining the set of relevant documents for each topic.

Multilingual Corpus: The document collection currently consists of nearly 1,000,000 documents in six languages – Dutch, English, French, German, Italian and Spanish. It contains both newswires and national newspapers. The collection used for the multilingual task in 2001 contained documents in five of those languages (Dutch was excluded). Two target collections were used for the bilingual track in 2001. Participants could query sets of either English or Dutch newspaper documents, using their preferred topic language. Spanish and Dutch were introduced for the first time in CLEF 2001 for different reasons. Spanish was included because of its status as the fourth most widely spoken language in the world. Dutch was added not only to meet the demands of the considerable number of Dutch participants in CLEF (the largest group) but also because it provides a challenge for those who want to test the adaptability of their systems to a new, less well-known language. The domain-specific task uses a different collection: the GIRT database of approx. 80,000 German social science documents, which has controlled vocabularies for English-German and German-Russian. The interactive track used data (documents in French and English and results) from the CLEF 2000 campaign.

Topics: The participating groups derive their queries in their preferred language from a set of topics created to simulate user information needs. Following the TREC philosophy, each topic consists of three parts: a brief title statement; a one-sentence description; a more complex narrative specifying the relevance assessment criteria. The English version of a typical topic from CLEF 2001 is shown below:

Title: U.N./US Invasion Haiti
Description: Find documents on the invasion of Haiti by U.N./US soldiers.
Narrative: Documents report both on the discussion about the decision of the U.N. to send US troops into Haiti and on the invasion itself. They also discuss the direct consequences.

The title contains the main keywords, the description is a “natural language” expression of the concept conveyed by the keywords, and the narrative adds additional syntax and semantics, stipulating the conditions for relevance assessment. The motivation behind these structured topics is to provide query “input” for all kinds of IR systems, ranging from simple keyword-based procedures to more sophisticated systems supporting morphological analyses, parsing, query expansion and so on. In the cross-language context, the transfer component must also be considered, whether dictionary or corpus-based, a fully-fledged MT system or other. Different query structures may be more appropriate for testing one or the other methodology.

For CLEF 2001, 50 such topics were developed on the basis of the contents of the multilingual collection and topic sets were produced in all six document languages. Additional topic sets were then created for Finnish, Swedish, Russian, Japanese, Chinese and Thai. Participants could thus choose to formulate their queries in any one of nine European or three Asian languages.

Results assessment: The number of documents in large test collections such as CLEF makes it impractical to judge every document for relevance. Instead, approximate recall figures are calculated by using pooling techniques. The results submitted by the participating groups are used to form a "pool" of documents for each topic and for each language by collecting the highly ranked documents from all the submissions. The assumption is that if a sufficient number of diverse systems contribute results to a pool, it is likely that a large percentage of all relevant documents will be included. All documents not included in the pool remain unjudged and are therefore assumed to be irrelevant. A main concern with such a pooling strategy is that if the number of not detected relevant documents is above a certain (low) threshold, the resulting test collection will be of limited future use in testing systems that did not contribute to the pool. A grossly incomplete pool would unfairly penalize such systems when calculating precision and recall measures. This pooling strategy was first adopted by TREC and has been subsequently employed by both NTCIR and CLEF. A number of studies have been made to test its validity [6], [7], [8], [9]. Relevance assessment of the documents in the pool is distributed over a number of different sites and performed in all cases by native speakers. The results are then analyzed centrally using recall and precision measures and run statistics are produced and distributed.

Results so far

This activity has grown in popularity and complexity since its beginnings in 1997 when a bilingual track was offered at TREC. Participation in CLEF 2001 was up more than 50% from the previous year, with 34 groups submitting results in one or more of the 5 different tasks offered: 9 from N.America; 21 from Europe, and 4 from Asia. Runs were submitted for all five tasks and for all twelve topic languages. Twenty-one groups tried one of the cross-language tasks, while ten preferred to remain with the monolingual track. Only eight groups were brave enough to attempt the multilingual track (processing a document collection in five languages is certainly a challenging task) and it should be noted that just two of these were newcomers to CLEF. An additional three groups tackled the experimental interactive task.

The results of CLEF 2001 were reported at a Workshop in Darmstadt, Germany (immediately preceding ECDL2001 – the European Conference on Digital Libraries). Both traditional and innovative approaches to CLIR were presented, and different query expansion techniques were described. All kinds of source to target transfer mechanisms were employed, including both query and document translation. Commercial and in-house resources were used and included machine translation, dictionary and corpus-based methods. The search strategies used varied from traditional IR to a considerable employment of natural language processing techniques. Different groups focused on different aspects of the overall problem, ranging from the development of language-independent tools such as stemmers to much work on language-specific features like morphology and compounding. A number of groups compared different techniques in different runs in order to evaluate the effect of a given technique on performance. In particular, it was noticeable that many groups were testing systems that integrated more than one translation method, e.g. MT or bilingual dictionary look-up combined with a data extracted from a comparable of parallel corpora. Overall, the CLEF 2001 Workshop offered a very good picture of current issues and the approaches now being adopted in CLIR. Recent CLEF Proceedings provide an excellent overview of both the state-of-the-art and the latest experiments in this field [10,] [11], [12].

Plans for 2002

The organization of the CLEF 2002 campaign will be very similar to that of CLEF 2001. There will be five tracks:

There will be seven languages in the multilingual collection as Finnish documents have been added to the existing set of Dutch, English, French, German, Italian and Spanish newspaper and news agency texts. Topics will again be provided in a large number of languages, according to demand. The multilingual task will remain unchanged. The greatest change will be in the bilingual task. In an attempt to encourage work on European languages other than English, it will be possible to query any of the document collections bilingually, with the single exception of English. The only variation to this rule will regard newcomers; they will be given the option of choosing English as their target collection. The domain-specific task will be expanded with more use being made of the controlled vocabulary. The conditions of the interactive task are still to be decided.

More information will be available from the Clef Web site [13].

Looking to the Future

It remains to be seen how long it will be possible to continue CLEF. The current funding from the Commission covers the 2002 and 2003 campaigns. Our aim is to add more languages to the document collections and to include new tasks. We would like to cover not only the major European languages but also some representative samples of minority languages, including members from each major group: e.g. Germanic, Romance, Slavic, and Ugro-Finnic languages. We also hope to include tracks to evaluate CLIR systems working on media other than text. In particular, we are beginning to examine the feasibility of organizing a spoken CLIR track in which systems would have to process and match spoken queries in more than one language against a spoken document collection.

References

  1. Global Internet Statistics, Global Reach
    URL: <http://global-reach.biz/globstats/refs.php3> Link to external resource
  2. Text REtrieval Conference (TREC)
    URL: <http://trec.nist.gov/> Link to external resource
  3. NACSIS Test Collection for Information Retrieval (NTCIR)
    URL: <http://research.nii.ac.jp/> Link to external resource
  4. Peters, C., Braschler, M (2001). Cross-Language System Evaluation: the CLEF Campaigns. Journal of the American Society for Information Science and Technology, 52(12):1067-1072, 2001
  5. DELOS Network of Excellence for Digital Libraries
    URL: <http://www.ercim.org/delos/> Link to external resource
  6. Voorhees, E.M. (2000). Variations in relevance judgments and the measurement of retrieval effectiveness. Information Processing and Management, 36, 697-716.
  7. Zobel, J. (1998). How reliable are the results of large-scale information retrieval experiments? In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1998.
  8. Kuriyama, K., Yoshioka, M., Kando, N. (2001). The Effect of Cross-Lingual Pooling on Evaluation. In Proceedings of the Second NTCIR Workshop Meeting on Evaluation of Chinese & Japanese Text Retrieval and Text Summarization, National Institute of Informatics (NII), ISBN 4-924600-89-X.
  9. Braschler, M. (2001). CLEF 2000 – Overview of Results.
  10. Peters, C. (ed.) (2001). Cross-Language Information Retrieval and Evaluation. Workshop of the Cross-Language Evaluation Forum, CLEF 2000, Lisbon, Portugal 2000. Lecture Notes in Computer Science 2069, Springer 2001, 387p.
  11. Peters, C. (ed.) “Results of the CLEF 2001 Cross-Language System Evaluation Campaign: Working Notes for the CLEF 2001 Workshop, 3 September, Darmstadt, Germany. ERCIM-01-W04, 472p.
    URL: <http://www.ercim.org/publication/ws-proceedings/CLEF2/index.html> Link to external resource
  12. Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.). Proceedings of CLEF 2001. Darmstadt, Germany, 2001. Lecture Notes in Computer Science, Springer (forthcoming).
  13. Clef Web site
    <http://www.clef-campaign.org> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Carol Peters
Researcher
IEI-CNR
Area di Ricerca CNR, Via Moruzzi, 1
56124 Pisa
Italy

carol@iei.pi.cnr.it Link to an email address

Phone: +39 050 3152897
Fax: +39 050 3152810

Carol Peters is a researcher at the “Istituto di Elaborazione della Informazione” an Institute of the Italian National Research Council in Pisa. Her current research interests are focused on the multilingual information access area. She has collaborated on numerous European Commission projects and is now working on the implementation of the multilingual interface for two digital library projects: ECHO - European CHronicles On-line - collections of historical film archives from four European countries, and SCHOLNET - a digital library to support virtual scholarly communities. She is also Coordinator of the Cross-Language Evaluation Forum (IST-2000-31002), a cross-language information retrieval system evaluation activity sponsored by the Information Science Technologies programme of the EC in collaboration with the US National Institute of Standards and Technology.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Peters, C. "Evaluating Cross-Language Systems the CLEF Way", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/clef/>

-------------------------------------------------------------

Activate: New Access And Services For Cultural Content

By Annette Kelly and Caroline Clery - February 2002

Annette Kelly and Caroline Clery describe the work of the Activate project [1]. Activate, has two primary objectives: to establish a thematic network to local studies collections held in two Irish libraries; and, to provide a methodology and toolkit which will allow other cultural institutions to replicate the results of the project.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Introduction

Digital technologies are transforming the way in which cultural institutions are making their holdings accessible to a wider audience. The state of the art in collection management for many cultural heritage institutions is the storage of paper records and artefacts. This results in limitations for access and preservation. Digitisation processes and interactive virtual representations can benefit both the user and the collection manager through the possibility of global access, safeguarding fragile originals, overcoming geographical peripherally, and developing links to similar content in thematic networks. Activate aims to trial the technologies which will bring small local cultural collections into the greater information society, by enabling new access modes, new management and co-operation tools, new thematic networks and by establishing a business case for both providing and using such tools and methodologies.

The Project

ACTIVATE is an EU funded Irish national project which began on July 10th 2001 and will run for one year. The project is funded under the IST Programme accompanying measures specific to technology take-up measures Key Action 3 “Heritage for All” initiative.

The project is testing the use of new technology to open up new ways of accessing cultural content and records on the Internet.

There are 5 Irish partners cooperating in the ACTIVATE consortium:

The Library Council is the coordinator of the project and is also the national coordinating body for libraries. The Digital Media Centre and Silverstorm Ltd. are the technical partners or the “technology providers”, The Digital Media Centre is a university department and Silverstorm Ltd. is a SME. The two local library services are “technology users”. Thus, the consortium can be defined as consisting of technology providers and technology users, the aim of the project being to produce a toolkit and methodology from which it is easy for the non technologists to gain benefit.

Existing content held in the two library services is being digitised, made available on the web and a thematic network is being developed as a research model. The two libraries and the technology partners work closely together to develop the ultimate result of the project: the ACTIVATE template which can be adapted and adopted by others to use.

The two local authority library services that are taking part in pilot- work to form an eventual template provide the project with useful and diverse examples of cultural content: Cork City is an urban library service which will focus on online local history documentation of the historical Shandon area of Cork. Clare County is a rural library service that is working to reveal and promote the landscape and history behind the now deserted Scattery Island, off the Clare coast.

Virtual access will be opened to conservation sites of cultural and historical importance. As part of the project, the Digital Media Centre will build a virtual reality model of an historical landscape (Scattery Island) to provide a new way of accessing existing rich stores of cultural content concerning the landscape.

Each step of the process is being documented and the appropriate templates and tools provided, to produce a ready-made, easy to use “out-of-the-box” solution for cultural providers such as museums, galleries, archives and libraries across Europe who are interested in replicating the project’s work. The template and documentation will be available for these institutions to download.

The project’s target audience is similar groups of cultural institutions in countries across Europe. The groups may have an interest or theme which they wish to exploit through the use of web technologies. The ACTIVATE template would be of use to them if they lack technical resources for maintaining such a network. In the short–term the project aims to expand its user base to other libraries in Ireland, but the nature of the ACTIVATE output is such that it will be useful in other countries and for organisations other than libraries.

Current Position

Activate is now at the midway point in the project. The first draft of the portal and content site templates are available from the Web site [1]. All partners are involved in the validation process which is designed to find and iron out problems with the sites and documentation produced to-date. The sites went live on 21 December 2001 and are being added to and amended on a weekly basis. The VRML content is being finalised and will be linked to sets of cultural information.

The project team are committed to developing a methodology and templates which will be user friendly for content providers and end users. Comment is very much welcomed and would be particularly useful now during the evaluation stage.

The Benefits of the ACTIVATE Project:

References

  1. Activate Site
    URL: <http://www.activate.ie/> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Annette Kelly

An Chomhairle Leabharlanna/The Library Council,
53 & 54 Upper Mount St.
Dublin 2.

akelly@librarycouncil.ie Link to an email address

Phone:00353-1-6761167
Fax: 00353-1-6766721

Caroline Clery

An Chomhairle Leabharlanna/The Library Council,
53 & 54 Upper Mount St.
Dublin 2.

Phone:00353-1-6761167
Fax: 00353-1-6766721

cclery@librarycouncil.ie Link to an email address

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Kelly, A. and Clery, C. "Activate: New Access And Services For Cultural Content", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/activate/>

-------------------------------------------------------------

eIFL - Electronic Information for Libraries

By Monika Segbert, Anna Maria Balogh, Rima Kupryte and Darius Cuplinskas - February 2002

The authors outline one of the major OSI Information Programme projects, the eIFL (Electronic Information for Libraries Direct) project whose aim is to facilitate affordable access to electronic journals for libraries and their users in developing countries. Its main efforts are directed at negotiating acceptable deals with publishers while forming a multi-country consortium that will ensure sustainability.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

The CULTIVATE network fosters international co-operation. Its latest node, Cultivate Russia, demonstrates commitment to bridging the digital divide by encouraging knowledge transfer, networking and partnerships. These are the common denominators with the Soros Foundation’s Open Society Institute, which - over 10 years - has addressed the issues concerning library modernisation in countries in transition, especially in Central and Eastern Europe and the former Soviet Union. Its Networked Library Programme helped to alleviate the effect of the momentous political and economic changes in these countries, and the impact of globalisation of the electronic information market on libraries in countries in transition.

This paper outlines one of the major OSI Information Programme projects, the eIFL (Electronic Information for Libraries Direct) project whose aim to facilitate affordable access to electronic journals for libraries and their users in developing countries. Its main efforts are directed at negotiating acceptable deals with publishers while forming a multi-country consortium that will ensure sustainability.

The paper also argues that potential but unexplored synergies exist between EC funded support actions – such as CECUP, TRIS [1] and PULMAN – and the eIFL network.

The Open Society Institute of the Soros Foundation

At the most fundamental, philosophical level, the concept of open society is based on the recognition that people act on imperfect knowledge and that no one is in possession of the ultimate truth. In practice, an open society is characterized by the rule of law; respect for human rights, minorities and minority opinions; the division of power; and a market economy. Broadly speaking, open society is a way to describe the positive aspects of democracy.

The term "open society" [2] was first proposed by French philosopher Henri Bergson (1859-1941) and developed further by the Austrian philosopher Karl Popper (1902-1994) in his 1945 book Open Society and Its Enemies.

Karl Popper and George Soros discussing Open Societies
Karl Popper and George Soros discussing Open Societies

The Open Society Institute, Budapest is a privately run, grant-making foundation that develops and implements a range of programs in civil society, culture, education, media, public health, and human and women's rights, as well as social, legal and economic reform. OSI Budapest is part of the Soros foundations network, an informal network of organizations established and supported by investor and philanthropist George Soros, and active in more than 50 countries world wide. Together with the New York-based Open Society Institute, OSI Budapest provides support and assistance to Soros foundations in Central and Eastern Europe and the former Soviet Union, Guatemala, Haiti, Mongolia, and Southern and West Africa.

The OSI Information Programme

The OSI Information Program is based on three premises. Firstly, that human beings are not passive subjects or just economic agents seeking personal gain, but civic beings who share a world which they have the power to shape. Secondly, that the ability to exchange ideas, knowledge and information is the lifeblood of citizenship and participation in a shared public sphere. And thirdly, that while traditional media remain essential to citizenship, new digital technologies hold the potential for enhancing civic life that is still largely untapped. They also entail dangers that are not yet fully understood.

Knowledge is not sufficient to create open societies. But given adequate economic resources, and a serviceable legal and institutional environment, access to knowledge in all its forms is possibly the single most important factor in determining the success or failure of open society.

Over the past five years or so, the predecessors to the Information Program have done much to provide open access to knowledge and information in previously closed societies. The Internet Program has helped to introduce internet connectivity in more than 35 countries, and has been at the forefront of funding for internet policy for human rights and independent voices online. The Network Library Program has helped libraries transform themselves into truly public, service-oriented centers for their communities. The Center for Publishing Development aided publishers in post-communist countries in the transition to market-based publishing, supported several thousand translations of core books for education and public debate, and nurtured a new electronic publishing industry in the region.

Bridging the Digital Divide – the eIFL Project

OSI wanted to support a venture that targets traditionally “information-starved” countries - access to the widest possible range of information is obviously a major factor in fostering the development of open societies. The eIFL (Electronic Information for Libraries) [3] consortium provides a powerful structural solution to the ‘digital divide’ in content access. Much of the most useful online information is expensive and unaffordable to users in poorer countries. Most of the participating countries had very limited access in the past to the wealth of information available from international scholarly journals, especially in the social sciences and humanities. This was not just for political reasons but also because of the prohibitive cost to these libraries in obtaining even a small number of such journals in print format.

The project is based on leveraging two features of digital information. Firstly, the Internet network effect enables demand aggregation on a very large scale: a large consortium of individually poor consumers acquires significant negotiating power. Secondly, the marginal cost of digital data is zero, so that additional units of high-value digital commodities produced for affluent markets can be re-sold to less affluent markets at negligible cost to the producer, as long as the transaction cost to the producer is kept low.

In late 1999 OSI published a tender for access to electronic journals in social sciences and humanities, which, after independent evaluation, was won by the EBSCO Company. This initiative, called Electronic Information for Libraries Direct - eIFL Direct, makes information available electronically to libraries (academic, research, medical, public, national, and parliamentary), ministries, and non-governmental organisations (NGOs). Users include scholars, policy-makers, government officials, students and the general public. To date well over 3 million searches have been performed and this number is increasing dramatically by the day. Along with Guatemala and Haiti, 27 countries are from Central and Eastern Europe (mostly former Soviet satellites), and ten are from southern Africa. The project is currently expanding to Western African countries and Indonesia, as well as to other countries that have expressed interest.

Building of National Library Consortia

The project was conceived and initiated centrally by OSI, but it became clear very quickly, that its sustainability hinged very much on maximum involvement of the stake-holders, e.g. those libraries who were expected to fund and promote the project from national sources and to local users. A key workshop in November 2000 brought many of the representatives of the participating countries together to discuss the best way to sustain the project. The logical conclusion of keeping in mind the two principal features of leveraging digital information, was to form national consortia, who would be responsible for fund-raising, training, marketing, and provide up-to-date feedback on the needs of its users for existing and future content. However, forming and running consortia requires a sophisticated level of management and co-operation, which many countries in the OSI network had not yet been able to acquire. Much of the effort of developing the eIFL project throughout 2001 was therefore devoted to helping the creation of national consortia, through meetings, grant schemes, training and the strategic advice of Task Forces representing the regions in the eIFL network.

eIFL.Net: The Global Consortium of Consortia

Levering the combined purchasing power of many led to the decision to form the world’s biggest library consortium: EIFL.Net is an international consortium of library consortia that will lead, support, motivate, and advocate for the wide availability of information by library users in member countries in transition.

It will: (1) assist in the building of strong national consortia; (2) be the premier multi-country negotiator for securing affordable commercial electronic information services; (3) strongly advocate for the development of local digital resources; (4) provide a strong central business relationship with content providers; (5) leverage multi-national expertise and resources to expand the access to information; (6) provide top quality educational and consulting services; (7) be an advocate for the adoption and advancement of effective information distribution models; and, (8) develop model partnerships with global funding agencies, foundations, consortial groups, and content providers.

Its goals are to lead, encourage and support the building of national consortial agencies within each of the member countries; to provide libraries, through their national consortia, with expanded availability of electronic content through the highly effective acquisition, use and management of information resources; to provide member consortia with educational, consulting and marketing programs and services that are highly responsive to the needs of the membership; to act on behalf of national consortia and the libraries they represent to influence and change the evolving information environment and to effect the pricing models and technological standards and practices related to the use of and access to electronic information resources.

The eIFL project now encompasses more than 40 countries with a total population of about 800 million. The larger it is, the greater its negotiating power with suppliers. Of the 200 or so countries in the world, about 150 cannot afford broad, direct access to information resources of the kind EIFL is providing, and the project could in principle expand to encompass most of these countries. Already well over 2 000 institutions now take part in the eIFL project, who collectively have performed over 4 million searches, and the number continues to grow.

Training

Training librarians involved in the eIFL project has – to no-ones real surprise – turned out to be one of the key factors of success. Currently a training programme is being designed that will strengthen the capacity to sustain the eIFL project and to take an active part in eIFL.Net. The overall aim of the training program is to ensure that all countries of the region have a clear understanding of the issues involved in the creation and operation of a library consortium. The initial and primary aim is to assist in the establishment and operation of consortia to make effective use of the eIFL databases, while keeping in mind that longer-term consortia for other purposes should be enabled. The success of this program will be measured by the extent to which the appropriate people in the relevant countries gain understanding and skills necessary for effective consortium operation.

Content

In its initial iteration, the eIFL project is a West-to-South/East conduit for journals and databases in the social sciences, public policy, business and medicine. EBSCO, the provider chosen as a result of an international tender, through its databases offers access to more than 5000 journals in these subjects. This content base is now expanding to include science and technology journals, and richer resources for medicine and public health. OSI, in partnership with the World Health Organization (WHO) is already conducting a pilot called the Research Network Pilot in 9 sites in 8 countries. Five countries are in Africa and 3 in the eIFL consortium include Mongolia, Uzbekistan and Armenia. The information partners in the pilot include SilverPlatter, ISI and Elsevier. The goal is to rollout the project after the pilot has been evaluated to approximately 170 developing countries of the world.

A second function of the consortium will be to act as a conduit for lateral information exchange, facilitating East-East, South-South and South-East content flows. It will also be a platform for developing local digital content which feeds into the consortium. Third, the consortium could become a bulk buyer of low-cost hardware and software applications for its member institutions, and an infrastructure for delivering training. At a later stage, it could begin to function as a network for dissemination of policy knowledge and participation in national and global information policy formation.

OSI is also involved in strengthening the role of alternative publishers of electronic journals, such a, to name but one, the Public Library of Science. A recent meeting in Budapest brought the main players of high-quality but low-cost initiatives together to discuss how to bring about barrier-free access to scholarly publications.

A Thought for the Future: Networked Co-operation

Since the fall of the Berlin wall, the EC has taken many steps to encourage co-operation with the countries of Central and Eastern Europe – mostly with the so-called New Accession Countries, but also with the Russian Federation, and countries of the former Soviet Union. This engagement is acknowledged by George Soros, who is in the process of reducing his commitment to Central Europe, in favour of other countries in Africa and Asia which do not receive as much attention from Europe.

There are many projects supported by the European Union that can and could use the network created through the eIFL network to transfer knowledge and research results, enable closer networking with institutions in the EU, and aid capacity building pioneered in the 10 years of OSI operations in the region. Extending the CULTIVATE network eastwards to Russia in one such example, as is the PULMAN network for public libraries which includes countries in Eastern and Southern Europe. A recent OSI initiative as part of eIFL is to run a series of workshops on electronic licensing in the eIFL countries, following the model created in the EU CELIP project, and run by the then CELIP project manager. A new EC funded action, TRIS (see article in this issue of Cultivate Interactive) plans to actively foster the participation of relevant interest groups that may not otherwise be present in IST. This relates in particular to the participation of non-EU countries, mostly within the PHARE, TACIS and MEDA areas. Despite the availability of significant resources and the active promotion policies undertaken by the Commission, in fact, these areas have experienced difficulties in their involvement in European RTD activities. It is one of the working hypothesis of TRIS that the TRIALS format, because of its lightweight footprint and of its direct concern with results and technology transfer, may represent an optimal vehicle for the involvement of these countries and potentially a bridgehead for a low-risk inclusion of these areas into the 6th FP. And then, when we look ahead, we see other EC projects which will have increasing relevance for the development of Open Societies in the OSI countries, such as PSINet [4] a wide-ranging preparatory action under the European Commission's e-Content programme. It has been following up the 1999 Green Paper on Public Sector Information and supporting the overall aim to improve access to Europe's Public Sector Information (PSI).

Much has been done by OSI, especially through its networked library programme, and now within the activities of eIFL, to level the playing field for libraries and their users in disadvantaged communities around the world. Much remains to be done – lets do it all together.

References

  1. The TRIAL projects and their accompanying measure TRIS, Monika Segbert, Cultivate Interactive
    URL: <http://www.cultivate-int.org/issue6/tris/>
  2. Quote from OSI Web site
    URL: <http://www.osi.hu/> Link to external resource
  3. eIFL Web site
    URL: <http://www.eifl.net/> Link to external resource
  4. Public Sector Information Network
    URL: <http://www.publicsectorinfo.com/> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Monika Segbert MBE FLA (hon)
Project management and consultancy

info@monikasegbert.com Link to an email address
<http://www.monikasegbert.com/> Link to external resource

Anna Maria Balogh
eIFL Project Coordinator

abalogh@osi.hu Link to external resource
<http://www.eifl.net/> Link to external resource

Rima Kupryte
Manager, OSI Networked Library Programme

kupryter@osi.hu Link to an email address

Darius Cuplinskas
Director, OSI Information Programme

cuplinsk@osi.hu Link to an email address

eifl logo

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Segbert, M., Balogh, A., Kupryte, R. and Cuplinskas, D. "eIFL - Electronic Information for Libraries", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/eifl/>

-------------------------------------------------------------

Joining European Research through Cultural Heritage Interdisciplinary Projects

By Miloš Drdácký and Jan Válek - February 2002

Miloš Drdácký and Jan Válek report on ARCCHIP [1], a Centre of Excellence Project supported by the European Community. The project uses workshops as a way of exchanging knowledge, primarily on the complexity of problems of safeguarding cultural heritage including its social and economic aspects, documentation, interpretation, presentation, degradation and protection, between EU and CEE countries. ARCCHIP [1] facilitates new contacts, projects and networks.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Introduction

Civil engineering and architecture are the main fields of application for research results of the Institute of Theoretical and Applied Mechanics (ITAM) [2] of the Academy of Sciences of the Czech Republic. This tradition is now more than eighty years old, during this time increased attention has been paid to safeguarding the built heritage and its integration into the life of contemporary society. The complexity of problems of safeguarding cultural heritage calls for modern interdisciplinary approaches which have been implemented in a systematic way since 1995 by the foundation of a new department in the Institute – the Associated Research Centre for Historic Structures and Sites – joining together researchers from two institutes of the Academy of Sciences. The process has continued, because of success in grant project competitions and now even more scientists from other research institutes and universities are involved into this research. There are two research units, one in Prague and another in Telc. Since 2000, ITAM [2] has been supported by the EC 5th Research Framework as one of the Czech Centres of Excellence through ARCCHIP.

ARCHIP Programme

ARCCHIP programme [1] consists of five groups of work packages:

  1. Co-ordination, management an operation measures (CORNELIA)
  2. Advanced research workshops (ARIADNE)
  3. Medium term education activities (METEOR)
  4. Twining arrangements and visits (TACITUS)
  5. Special programmes (PATRIMONIA, HESTIA, GOLEM)

Workshops ARIADNE (Advanced Research Initiation Assisting and Developing Networks in Europe) are a week long working meetings for experts from the EU and ECC countries. The topics of these workshops are highly relevant to the needs of current cultural heritage requirements for their protection. Some previous and upcoming workshops include:

A01 Social and economic integration of cultural heritage April 23-29, 2001

A02 Cultural heritage in local and regional social and economic stability May 9-15, 2001

A03 Cultural heritage in urban areas June 11-17, 2001

A04 Vulnerability of cultural heritage to hazards and prevention measures Aug. 18-24, 2001

A05 Documentation, interpretation, presentation and publication of CH. September 1-7, 2001

A06 Degradation of cultural heritage in surrounding environment October 20-26, 2001

A07 Indoor climate and tourism effects November 12-18, 2001

A08 Biodegradation of cultural heritage December 10-16, 2001

A09 Historic materials and their diagnostics February 4-10, 2002

A10 New materials for safeguarding cultural heritage April 22-28, 2002

A11 Historic structures May 20-26, 2002

A12 New technologies for safeguarding cultural heritage June 1-7, 2002

The structure of the workshops is based on position papers of invited top specialists with the aim of elaborating on state of the art conclusions and to clearly formulating the most up-to-date problems in the area. These documents and profound discussions are utilised for establishment of core networks in the selected areas. These networks are established in order to enable continuous co-operation and mutual information on the topics under discussion and creation of a virtual international centre for cultural heritage research.

METEOR (Medium Term Educational Opportunity and Research) is a 2 week training course consisting of clusters of up-to-date problems provided by the Institute with support of leading experts from EU and CEE countries and is intended for participants mostly from CEE countries. The training sessions are completed with a possibility of 3 months stays for research students and researchers who can be housed by the Centre (ARCCHIP) and participate on its on-going research projects.

TACITUS programme (Twining Arrangements and Research with Co-operating Institutions and Universities) enables visits or stays for research students or researchers from the EU or CEE countries in any of the co-operating institutions.

The preferred research topics involve environmental effects on cultural heritage objects and diagnostics of their integrity damages (e.g. investigation of local failures and damages, monitoring, remote sensing and deterioration of structural integrity, warning systems, loading spectra from technological seismicity and tourism), planning and management tools for historic cities and territories (e.g. methodology of safeguarding planning for historic sites, documentation and information systems for small historic cities – GIS based, micro-regional concept of territorial and urban planning, ICT in spatial evaluation of historic cities), historic structures and materials under repeated loading (e.g. behaviour of historic carpentry timber joints of roof frameworks, fatigue behaviour of historic stones under repeated action of forces or restricted volumetric changes), survey of historic paintings by means of modern non-invasive methods (e.g. detection of delaminated parts of historic lime plasters by means of differential vibration sensitivity and/or thermal conductivity), new materials for conservation and methods for testing historic materials (e.g. small sampling techniques and testing of historic materials – namely mortar, stone and wood, mechanical characteristics of historic materials and influence of consolidation, fibre reinforced mortars for repair and safeguarding of art monuments), fibre plastics composite and net reinforcement for historic masonry.

PATRIMONIA, HESTIA and GOLEM are aimed to enlarge scope of results with a wide practical application

PATRIMONIA is aiming at the rise of cultural heritage awareness of people and understanding heritage protection as a part of total environmental protection. It is a project for youth which concentrates on cultural heritage in the relations between people, between mankind and nature and between people and technology as a message for and source of approaches to sustainable development.

HESTIA (Heritage for Economic and Social Stability and International Adhesion) is designed to prepare pilot programmes based on regional and micro-regional traditions, resources and technologies in order to ensure economic and social stability and sustainability of small historic cities.

GOLEM (Gradually Optimised Low Energy Materials for construction in the CEEC) is intended to prepare pilot programmes based on traditional eco-efficient technologies and to increase awareness of citizens to cultural heritage and create a basis for small regional industry.

Advanced Research Workshops and Interdisciplinary Research into Cultural Heritage

Eight workshops A01-A08 from the aforementioned list were organized in 2001. At those workshops 58 scientists participated from 16 EC countries (Austria, Belgium, Denmark, Finland, France, Germany, Greece, Ireland, Italy, Luxembourg, Netherlands, Portugal, Spain, Sweden, United Kingdom), 58 scientists from 13 Associated countries (Bulgaria, Cyprus, Czech Republic, Estonia, Hungary, Latvia, Lithuania, Malta, Norway, Poland, Romania, Slovakia, Slovenia) and 5 researchers from 4 Mediterranean countries (Bosnia Herzegovina, Israel, Tunisia, Turkey).

During the workshops, there emerged many common themes for further research in the field of cultural heritage, which are of a great relevance to all European countries and which have a high importance and strong impact on European economic and social development and stability. In conclusion despite of some national research funding in individual European countries, many problems are to be solved effectively and competently only through international co-operation.

The workshops as well as being an opportunity for active participation of the ARCCHIP Director at several international events were therefore oriented towards future development of the European Research Area and the scientific activities in the field of cultural heritage. As an example, the declaration from the first workshop, which was sent to representatives of the EU Parliament, is presented here:

“As participants in the ARCCHIP workshop [1], held in the Czech Republic during April 2001, we would like to make representations about the Draft 6th Framework Programme. The workshop in Prague brought together representatives from 15 European countries, including 7 associated countries. The purpose of the workshop was to discuss the possibilities for research into the cultural heritage under the current 5th Framework document, particularly the topic of economic and social benefits of cultural heritage. The workshop has demonstrated that this topic is of very great relevance to Europe. Understanding of cultural heritage is central to European identity.

Having noticed that the draft of the forthcoming 6th Framework Programme document does not mention cultural heritage, we are afraid that research into the cultural heritage will be discontinued. Without such research, all of us will lose an opportunity to strengthen European unity.

Therefore, we would like to recommend that the forthcoming 6th Framework Programme includes:

The contents and priorities of the new 6th Framework Programme are extremely important for CEE countries which are mostly building their new research orientation and therefore the inclusion of cultural heritage issues into the 6th Framework has a very strong stimulating role. On the other hand, its unclear position may cause catastrophic consequences in national research policies of some CEE countries where the European approach will be reflected.

Conclusions

The experience from our first eight workshops confirms that mutual exchange of knowledge between EU and CEE countries helps substantially to discover potentials for future collaboration and encourages formulation of new joint projects. The workshops initiated twelve new proposals for the 5th Framework Programme autumn calls. The professional variety of the invited participants which extended from universities, state supported research institutions to private sector representatives has proved to be very constructive as it condensed in a balanced identification of future scientific problems oriented to basic and applied research.

The ARCCHIP Centre has recently been asked to start retaining contacts, and so we are building our own database and information system and maintaining our Web site [1], with as much actual data as possible. This database aims to become a key source of information for networking arrangements between the EU and CEE countries in the field of Cultural Heritage. The state of the art reports that have resulted from the ARCCHIP programme are published there.

References

  1. ARCCHIP Web site
    URL: <http://www.arcchip.cz/> Link to external resource
  2. ITAM Web site
    URL: <http://www.itam.cas.cz/> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Dr. Miloš Drdácký
Director of ITAM and ARCCHIP
ITAM, Czech Academy of Sciences
Prosecka 76, 190 00
Praha 9, The Czech Republic

drdacky@itam.cas.cz Link to an email address

Miloš F. Drdácký (1945) is the director of the Institute of Theoretical and Applied Mechanics of the Academy of Sciences of the Czech Republic, where he has been working for 34 years. He has been engaged as an external reader in several Czech Universities since 1975, Head of the Town Architect Office in the World Heritage City Telc (1993-1998). His research activities include theory of structures, experimental mechanics and testing, performance of historic structures and materials, safeguarding plans and management of historic sites including risk analysis (tourism, neglecting maintenance, degradation, inappropriate conservation etc.). Results are published in more than 200 publications and research reports and he edited 15 books or International Conference Proceedings

Dr. Jan Válek
Head of Associated Research Centre for Historic Structures and Sites
ITAM, Czech Academy of Sciences
Prosecka 76, 190 00
Praha 9, The Czech Republic

valek@itam.cas.cz Link to an email address

Jan Válek (1970) graduated in Building Construction and Architecture at Czech Technical University in Prague. Following his PhD research he collaborated on several projects including preparation and co-writing a review of the scientific, technical and conservation literature on ‘Mortars in Historic Buildings’ commissioned by Historic Scotland. His expertise covers conservation of stone masonry, compatibility of historic and modern materials, and assessment of degradation connected with conservation of built heritage.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Drdacky, M. and Valek, J. "Joining European Research through Cultural Heritage Interdisciplinary Projects", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/arcchip/>

Cultivate Interactive Issue 6: Regular Articles

At the Event:

Praxis:

Metadata:

-------------------------------------------------------------

DIGICULT Column

By Ian Pigott - February 2002

This section aims to provide news of the European Commission's initiatives in the field of digital heritage and cultural content. Its objectives are to pinpoint the latest developments in programmes, projects and activities and to give a clear picture of progress in the area since the last issue. It certainly does not pretend to be a comprehensive account of what the EC is doing in the area but rather a short summary of some of the key items. The content is based largely on the information provided in the eCulture Newsletter, published by the European Commission, DG Information Society, Cultural Heritage Applications Unit [1].

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Since the last column in October, we have had an active time on all fronts. The IST programme has issued its last Call for Proposals under the present framework programme and we are all beginning to look at the 6th Framework Programme with a view to identifying key research themes for the future. During the last six months we have been working on the Lund Action Plan and on the Brussels Quality Framework. The Belgian Presidency has been very supportive of our activities and very forward looking in the Council Resolution on culture and the knowledge society. Contacts with the Spanish Presidency have been very positive and we hope to continue to develop with them the political vision for eCulture over the next six months.

The Lund Action Plan provides a framework for the implementation of the Lund Principles, a set of recommendations produced and agreed by representatives nominated by Member States to drive and stimulate digitisation of cultural and scientific content across Europe. Significant headway has already been made on plans for implementation: 1- wide diffusion and consensus building around the initiative, both at political and professional level, to be achieved in part by translation into all the official EU languages; 2- the National Representatives Group (NRG) has been established by all Member States for the coordinating group; 3- some specific working groups of experts nominated by Member States have been launched under the coordination of the NRG, with a view to building a common platform on key issues.

In regard to opportunities for projects, the 8th IST call [2] is open until 21 February 2002. While our sector has completed most of the focused work in the area of RTD for scientific and cultural heritage, there are openings for proposals in a number of more general areas: Action Line III.5.1 provides opportunities for disseminating the results from KAIII actions; Action Line III.5.2 looks ahead to RTD roadmaps and collaborative schemes for FP6 while Action Line III.5.3 aims at future paradigms for next-generation knowledge and interface technologies. The Cross Programme Activity V.1.15 CPA15 concerns technology platforms for cultural and arts creative expression.

We would like to draw special attention to Cultivate-list, an email discussion list for anyone in the cultural heritage sector interested in the Information Society Technologies (IST) Programme. If you need details of calls for proposals, if you are looking for partners, if you just want to find out about IST projects - this list is for you. The Archives of the Cultivate E-list can be accessed online [3]. Cultivate has also found a prominent place on the Information Society Discussion List in connection with the launching of the "Your Voice in Europe" initiative [4]. There has been considerable progress on the DigiCULT study. Indeed, after one year of research and extensive input from Cultural Heritage experts, the study on "Technological Landscapes for Tomorrows Cultural Economy", DigiCULT, is in its final phase. The primary objective of this study is to provide archives, libraries, museums and other institutions with recommendations on how to adapt to the technological and organisational challenges of the digital environment. The main topics considered are national policies and initiatives, organisational change, exploitation and technology. Within a few weeks, you will find the executive summary and final report for downloading [5]. The DigiCULT study is commissioned by the Cultural Heritage Applications Unit of the DG Information Society and paid for by the European Commission. DigiCULT co-organised the session on Culture and Community-Building at the IST Conference 2001, held in Düsseldorf in December last year [6].

DLM -Forum has announced an important conference on "@ccess and preservation of electronic information: Best practices and solutions". This third multidisciplinary European DLM-Forum on electronic records will take place at the Palacio de Congresos de Cataluña, in Barcelona, Spain, on 7 and 8 May 2002. The opening of the exhibitions and preconference activities will take place on 6 May 2002. The DLM-Forum 2002 welcomes specialists and executives representing different disciplines from public administrations, archives, ICT industries and research. A large number of participants from the EU Member States, regions, candidate states and other European countries are expected. The DLM-Forum 2002 will be organised by the Secretariat for the Information Society of the Catalan government together with other Catalan institutions and departments of the Spanish central government. Support is being provided by the European Union Presidencies of Sweden (1st half of 2001), Belgium (2nd half 2001) and Spain (1st half of 2002), the European Commission (Secretariat General, DG Information Society) and representatives from the ICT industry. The aspects of short- and long-term preservation, transparency, access and openness of public information will play an important role at the DLM-Forum 2002. The forum aims to achieve concrete results in this area. Notably, it will examine the creation of a Europe-wide network of excellence on electronic archives in order to achieve an even wider cooperation in this area between Member States, regions and at Community level [7].

The DELOS Network of Excellence, funded under Digital Heritage and Cultural Content, in co-operation with the Digital Library Initiative (DLI) of the US National Science Foundation (NSF), is organising a two-day concertation workshop of all projects funded by the EC IST program and the US DLI program in the field of Digital Libraries (DL), also including invited representatives of other relevant initiatives in Europe. The "EC/NSF DL All Projects Workshop" will be held in Rome on 25 and 26 March 2002. For further information contact Tarina Ayazi [8].

Forthcoming Research Programme

Discussions are currently underway on the EU's future research programme (a 6th Framework Programme for Research and Development). The Cultural Heritage Applications unit of the European Commission's DG Information Society is contributing to the cultural part of this discussion. Relying on the presence of many key experts at the successive EVA 2001 conferences in Florence (March), Glasgow (July) and Berlin (November), responses and comments to some basic questions were collected and collated in the form of a "Florence Agenda" followed by the "Glasgow Response" and then, finally, the "Berlin Conclusions".

In line with a general focus on consolidating the development of tools and services for an inclusive approach to raising scientific and cultural awareness for education, quality of life and tourism while highlighting common and specific aspects of cultural identity, more detailed recommendations were made in a number of areas.

In regard to the key aspects of access, content and users, the primary concern is the quality of content, the challenge for the cultural sector being to create high quality and pertinent digital resources (including a stable infrastructure for delivery). This includes the creation of truly integrated digital archives (providing seamless/dynamic access to large volumes of cultural objects and documents of various types, often in the form, of distributed resources). Specific attention should also be given to research investigating methods of improving user-friendly access to scientific and cultural content in accordance with a wide variety of personalised user requirements. The unanswered question of how, why and when users (individuals or communities) interface with digital collections requires detailed assessment, including consideration of emotional and intellectual access to content, especially for those who might otherwise be excluded (cultural poverty). Interaction with digital collections and the ability of individuals or communities to interact with and to add their own creativity also deserves consideration. Care should be taken to attract young people, both as users in their own right and as a vector of future trends. Finally, attention should be paid to access across languages, establishing a dialogue of cultures and building of new online communities.

It is also considered important to develop a common, European view on standards, promoting development and use of open standards, particularly in the context of widely applicable international solutions. Here, efforts should be made to promote a more active role for Europe. Another key issue for unlocking the full potential of scientific cultural resources is the creation of a harmonised European approach to a legal system with adequate protection of IPR.

On the technical front, mobile services will be the next great challenge, requiring incorporation of 3G technologies for services meeting the needs of the cultural institutions and operators. Further enhancement of 3D and VR representations will provide a basis for enhancing representation of real objects. Another challenge will be to support preservation and long term availability of digitally born content. And cultural content will contribute to speeding up technological developments, making the vast resource of authentic materials in digital form available for enhanced and focused research into digital collections as an integral part of an intellectual infrastructure for Europe's future research area.

From an economic viewpoint, there is a need for institutional knowledge resources and skills to be improved and sustained. In addition, it will be important to work towards lowering the cost of ICT applications in order to encourage competitive growth. Co-operative efforts across the whole of Europe should include not only culture but also research and creativity and should target job creation is an important objective. Culture is increasingly considered a strategic element for business in the context of culturalising the economy. Institutions must start to consider themselves not just as cultural actors but should take on an economic role. The economic potential of broadband access to the internet should also be borne in mind.

The importance of the European dimansion was also stressed. Only a truly co-operative and integrated approach would allow for a meaningful European digital cultural and scientific landscape. All European countries, not just the existing Member States, have key assets for cultural tourism calling for a co-ordinated approach across the continent.

Finally, the key stakeholders - whether from universities, research institutes, cultural institutions and large and small enterprises - should be seen as participants in the creation and evolution of pan-European laboratories of knowledge.

Project Developments

Turning to news of our projects and related developments, Europe's public libraries and cultural organisations have a vital role to play in the development of an e-Europe. The PULMAN Network of Excellence (Public Libraries Mobilising Advanced Network) now includes representatives of 26 European countries. The PULMAN approach is inclusive and participation will be extended, in the first instance by the establishment of wider groups of activists in each country. Activities and plans of the PULMAN Network are presented on the web, including guidelines manuals, a policy conference, international cooperation agenda, training workshops [9]. PULMAN Express, the first issue of its newsletter, is published at regular intervals throughout the year. free. To subscribe, visit the PULMAN web server and fill in the on-line form [10]. The first newsletter is now available [11].

The SCHEMAS project is recently hosted a workshop to review ways in which projects can share information about metadata use, and consider how collaboration can be rendered more effective. Participants were updated on recent developments regarding use of metadata schemas and application profiles. The role of registries in providing access to information about schemas was reviewed, and the SCHEMAS registry demonstrated. Presentations from the day are available on the Schemas site [12].

RENARDUS is a collaborative project that aims to improve academic users' access to a range of existing Internet-based information services across Europe. The latest project developments are available in the news digest [13].

The DIFFUSE project has developed an excellent and potentially comprehensive reference to projects dealing with standards issues, including short but effective project descriptions. All projects should visit the site and make sure that the information presented is correct. Projects are able to register or submit updates themselves [14].

The TRIS Accompanying Measure is to provide services aimed at increasing the coordination, impact and dissemination of the 25 take-up trials selected under the 4th IST Call for Proposals and funded under Digital Heritage and Cultural Content. In particular, it will provide projects with assistance on clustering, maximising impact, exchanging experiences and success stories, and supporting and facilitating the execution of IST TRIAL actions by encouraging standardisation, synergy, technology transfer and exploitation [15].

The MULTIMOD project started on October 2001 and will last three years. It will focus on improving the human-machine interface with particular reference to biomedical applications. The target of the demonstrator applications will be the musculo-skeletal apparatus [16].

References

  1. eCulture Newsletter
    URL: <http://www.cordis.lu/ist/ka3/digicult/en/newslett er.html> Link to external resource
  2. 8th IST Call
    URL: <http://www.cordis.lu/ist/calls/200104.htm> Link to external resource
  3. The Archives of the Cultivate E-list
    URL: <http://lists.ukoln.ac.uk/cultivate-list/> Link to external resource
  4. Your Voice in Europe initiative
    URL: < http://europa.eu.int/information _society/services/discussion/index_en.htm> Link to external resource
  5. DIGICULT Study
    URL: <http://www.salzburgresearch.at/fbi/digicult/> Link to external resource
  6. Culture and Community-Building, IST Conference 2001
    URL: <http://2001.istevent.cec.eu.int/december_3-5/session.asp?id=42> Link to external resource
  7. DLM Forum
    URL: <http://europa.eu.int/historical_archives/dlm_forum/index_en.htm> Link to external resource
  8. Tarina Ayazi, email address tarina@iei.pi.cnr.it Link to an email address
  9. PULMAN Web
    URL: <http://www.pulmanweb.org/about/about.htm> Link to external resource
  10. PULMAN newsletter registration form
    URL: <http://www.pulmanweb.org/news/register.asp> Link to external resource
  11. PULMAN newsletter
    URL: <http://www.pulmanweb.org/pulmanexpress/October2001. pdf> Link to external resource
  12. Schemas 4th Workshop presentations
    URL: <http://www.schemas-forum.org/workshops/ws4/progr amme.html> Link to external resource
  13. Renardus News Digest
    URL: <http://www.renardus.org/news/digest10.html> Link to external resource
  14. DIFFUSE
    URL: <http://www.diffuse.org/projects.html> Link to external resource
  15. TRIS
    URL: <http://www.trisweb.org/home.php> Link to external resource
  16. MULTIMOD
    URL :<http://www.ior.it/multimod/> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Ian PigottIan Pigott
Project Officer

Ian.Pigott@cec.eu.int Link to an email address

With the assistance of the Cultural Applications team http://www.cordis.lu/ist/ka3/digicult/en/our_team.html Link to external resource in Luxembourg.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Piggott, I. "DIGICULT Column", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/digicult/>

-------------------------------------------------------------

The TRIAL projects and their accompanying measure TRIS

By Monika Segbert - February 2002

Monika Segbert introduces the TRIAL projects and enlightens us on ACTIVATEd BEASTS that KIST MATAHARI IN VALHALLA. Interested...well read on.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Background

TrisUnder Action Line III.1.5 (Trials on new access modes to cultural and scientific content) the Cultural Applications unit of DG Information Society launched as part of its 4th call in 2000 a programme for ‘Take-Up Actions’. The intention was to launch trial actions across Europe, which will encourage take-up of results and stimulate the implementation of innovative products and services in the cultural heritage sector. The focus of the Action Line was on exploring and experimenting novel ways of creating, manipulating, managing and presenting new classes of intelligent, dynamically adaptive and self-aware digital cultural objects, either held by memory institutions (archives, libraries, museums, etc) or directly involving digitally born objects or art forms. The proposal were to be user-centred and include:

The workplan recommended that focus should be given to the sustainable development of valuable digital repositories in Europe’s libraries, museums and archives, on models for future virtual collections and on guidelines for integrating real and virtual objects and collections. Proposers were asked to provide examples of how dynamic user interaction with the cultural and scientific content can enhance the user experience, addressing the experiences of learning, exploring and entertaining for the user.

As a result of this call 25 new TRIAL projects were chosen. The average duration of the projects is about 12 months. They address a wide-range of user communities (specialist but also general such as children or tourists) and cultural heritage themes. This demonstrates that this is a very worthwhile initiative that has the potential to create a European-wide momentum for innovation not only in the larger cultural institutions, as most of the projects are driven by cultural institutions run by local authorities or by SMEs with local interests. The projects will be interesting to many potential users, as in many cases they will be improving access to cultural assets in museums, archives and libraries, through their use of innovative technologies such as mobiles, digitisation techniques, Internet support.

The TRIAL projects

Playing with the EC acronyms led to the opening line of this article: ACTIVATE will show interesting innovative solutions for accessing cultural resources. It "will also build a virtual reality model of a historic landscape, to provide a new way of accessing existing rich stores of cultural content concerning the landscape". BEASTS will benefit the tourism and leisure industry in Wales: go to the website and read that “eight of the ten SMEs who will be taking part in the trialing have been identified - and represent a wide range of tourism interests. Shops, farms, trekking centres and a riding for those with learning disabilities initiative are included”. KIST will use 3D, audio, animation and video for a digital exploration of the collections of the National Museum of Scotland, whereas MATAHARI will use portable information devices for access to information gleaned from libraries, archives and museums about outdoor objects in order to enhance the visitor’s experience. Finally, VALHALLA is going to provide a resource displaying video & explanation of historic gardens and parks.

The TRIALS meet head on the challenge of cultural institutions taking on ICT, not in an RTD setting, but by experimenting with technology and in partnership between technology providers and (in many cases) smaller cultural institutions – local archives, museums, libraries. These institutions face the challenge of responding to organisational change, of integrating new skills and competences, of meeting the digital challenge. Their motivation is to make cultural content more visible and accessible, to offer their users new ways of interaction and experiences. Those that will benefit from the results of the trials include tourists, teachers and SME’s, historians and scholars, botanists, scientists, the interested citizen, our everyday European life. Particular attention is given to the inclusion and involvement of young people through games and in the creation of content.

Prime examples of involving children are CHOSA, which is developing an interactive web game and a WAP tour, both fun and educational, for access to and awareness of an archaeological site; TREBIS - Trial and Evaluation of a Biodiversity Information System for public use in a natural history museum. This project trials a natural history museum approach to the use of multimedia techniques to enhance awareness of biodiversity, endangered species and ecology. The user community – school children – will be given access to the database and to digital maps. It is a good example of a young Austrian software firm Biogis Consulting partnering with a museum and an educational institution (Institute for Didactis of Biology at the University of Munich and Natural History Museum of Vorarlberg); TPHS is trialling an innovative approach to promoting information on architecture and heritage in that it focuses on information on buildings and related objects which children consider to be of particular interest - it directly deals with Cultura as seen and appreciated by children, in an engaging and playful way.

Another projects aims to bring the sources of history to citizens and tourists - ARCHIVIEW plans to open the resources of historical city archives to broader audiences by integrating easy tools for the management of collections with solutions for creating "narratives" around sources and publishing the results on the Web. This will make a wealth of first-hand information available to those interested in getting to know and feel the past of the towns they live in or are visiting.

A few projects are studying new attractive means to link to town history and historical collections through the resources of virtual reality. VRCHIP, VIRTUAL and HITITE bring slightly different approaches to a common core idea, i.e. showcasing in interactive and fascinating way flagship monuments and resources to make them an access point to the heritage of local identity.

VALHALLA brings an interesting outlook on historical gardens, letting users visit and live them in their rich details of garden architecture in relationship with the buildings that surround them. This is an unusual example of how "minor" but extremely relevant resources of the cultural aspects of towns can be better known and exploited.

And, as a prime example of the rich diversity of the TRIAL projects, the two projects starting with an e- : e-Islam and e-Stage. The one aiming to promote the Islamic collection of the Benaki Museum in Greece by creating digital surrogates of the exhibited items (for example items displayed currently in an exhibition of Glass of the Sultans), the other aimed at setting up a web-based resource on European puppetry.

Time and space are too short to discuss all fascinating aspects of all the other trials not mentioned above in this article (but you will hear more about them elsewhere):

BOOKS2U!: This proposal with an Austrian partnership trials a new approach to inter-library loans which intends to have far reaching impact on necessary improvements across Europe.

CTIC: The partnership, which consists of several UK museums and art galleries, is trialling an online interface enabling users to access cultural content displayed in their collections.

DOMINICO: The project features an Austro-Slovenian partnership to trial innovative technologies for networking smaller museums and exhibition designers as a basis for enhancing a series of exhibitions.

EULER-TAKEUP: Based on the pilot developed through the EULER RTD project, the trial is setting up and evaluating a European digital library for mathematics. The consortium joins partners from Germany, the Netherlands and Italy and addresses the needs of a clearly identified user community.

HYPERGUIDE: The trial builds an XML tool based on a web description methodology, for access to high-value web-based resources in order to enhance selection, filtering and usability of information resources in specific domains.

LAB-VR: The proposal is aiming to improve 3D photographic Internet access to research laboratories and their research activities. Users will be able to view the operational research environment as an interface for gaining further information from the Web..

POUCE: The trial seeks to validate a model for a common access portal to a group of French museums on the Web. The XML-based approach and the exploitation plans are targeted to a sustainable level of service.

SANDALYA: An Italian partnership which will trial the results of previous research dealing with the digitisation of manuscripts. The project has a direct focus on sustainability through both its technical background and active exploitation policies.

SEAX-DAMAS: The project trials a wide range of aspects of archive management in a regional record office relying fully on international standards.

UHI-NMS: The Scottish consortium will trial an approach designed to add value to National Museums of Scotland’s digital content for the National Grid for Learning. Special attention will be given to appropriate pedagogical approaches.

VIRMUS: This project from Latvia proposes to experiment with the use of market-ready 3DML tools in order to enable first-time users to create 3D pages in cultural heritage buildings on the Web. The project aims at a catalytic effect in expanding virtual reality in the museum sector.

More information about each project, and links to individual websites can be found on the TRIS Web site [1].

Accompanying the TRIALS: TRIS

The TRIS Accompanying Measure will cooperate with all TRIAL projects to strengthen and enhance the effectiveness and the benefits of individual actions and projects beyond their own perimeter. The co-ordination, grouping and dissemination activities of TRIS will help the projects to reach critical mass and substantial cultural, scientific and commercial impact. In particular, TRIS will:

TRIS also plans to actively foster the participation of relevant interest groups that may not otherwise be present in IST. This relates in particular to the participation of non-EU countries, mostly within the PHARE, TACIS and MEDA areas. Despite the availability of significant resources and the active promotion policies undertaken by the Commission, in fact, these areas have experienced difficulties in their involvement in European RTD activities. It is one of the working hypothesis of TRIS that the TRIALS format, because of its lightweight footprint and of its direct concern with results and technology transfer, may represent an optimal vehicle for the involvement of these countries and potentially a bridgehead for a low-risk inclusion of these areas into the 6th FP.

Within the EU, TRIS aims at providing a contribution to the core topic of transferring the results of RTD into the mainstream of territorial activities funded under the Structural Funds in Objective Areas (over EUR 120,000 million targeted at sustainable territory development and at fostering employment). The exploitation of cultural heritage and the interrelationships of culture and tourism are considered to be key to developing a sustainable culture economy that may represent, at least in a large number of areas, one of the most powerful engines for local growth and for the promotion of local identities.

A major role in this process will be played by the capability of European research to shape and to qualify structural activities, by providing models, standards, plans and best practices that can be quickly and effectively deployed throughout the Objective areas. TRIS will provide hints, contacts and active support to promote and to follow-up this process, helping consortia and local policy makers leverage the convergence between the results of the projects and the operational programmes of selected regions.

TRIS is currently planning the first major TRIALs event during the EVA Florence conference (25-29.3.2002). For details about this and more detailed information and news about the TRIALS and TRIS visit the Web site [1].

References

  1. TRIS
    URL: <http://www.trisweb.org> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Monika Segbert MBE FLA(hon) Dipl.Bibl.
Project Management and Consultancy
Via Fondiglie 5-7
60030 Rosora (An)
Italy

tris@monikasegbert.com Link to an email address
<http://www.monikasegbert.com> Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Segbert, M. "The TRIAL projects and their accompanying measure TRIS", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/tris/>

-------------------------------------------------------------

At the Event:

Cultivate-Russia Kick off Meeting, 14th - 16th January 2002

By Marieke Napier - February 2002

Marieke Napier reports on the Cultivate-Russia Kick off Meeting, held in Moscow in January.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

In January I traveled to Moscow to participate in the Cultivate-Russia [1] kick off meetings. Getting Cultivate Russia off the ground had taken over 18 months of hard work by Monika Segbert and David Fuegi and I think the enthusiasm of everyone present was evident.

All the meetings and workshops were held in the Darwin Museum in central Moscow. Employees of the Russian Cultural Heritage Network (RCHN) are based at the museum and it was a great venue for the launch. We actually got a tour round the place on the Tuesday and I can honestly say I've never seen so many stuffed animals in such a small place!

Red SquareThe main participants at the meeting were the British Council team from Moscow, the British Council is the principal contractor and will be responsible for the administrative/financial co-ordination function, representatives from the five Russian partners, the European technical partners and members of the organising group.

Monday was just a general kick off meeting mainly for the partners and some representatives of regional museums, libraries and archives. Links with Cultivate-CEE and Cultivate-CEE were also discussed. Having Russia participate in an EC funded IST project is a new experience and during the 3 days it became clear that there are going to be lots of financial, cultural and political differences to the usual set up, which will hopefully make Cultivate-Russia a really interesting project.

On Tuesday Walter Koch of CSC-Cultural Service Centre Austria talked about the obligations of a National Node in this type of networking project. He also demonstrated his Document server. In the afternoon Jorunn Hesjedal and Sigrid Tollefsen from RBT in Norway talked about the policy document they have been working on and gave some pointers to useful EC/IST Web sites. There was also a presentation from Kirill Nasedkin of RCHN on the www.museum.ru [2] Web site. Two articles have previously been published in Cultivate Interactive on RCHN but their Web site has been substantially updated recently and is worth revisiting [3]. The RCHN attempts to bring together information professionals and promote culture by the use of new technologies. The Web site gives information on over 3000 Russian museums, provides a database of 6000 museum professionals and hosts an impressive, highly active discussion group. All of us European 'experts' were all very impressed, I know we would be very lucky to have such a resource in the UK. Throughout the day all presentations were translated into Russian or English by an excellent translator who was not only fluent in both languages, but also had a fantastic memory and did a great job on some very long speeches.

On Wednesday I started the morning with a presentation and demonstration of Cultivate Interactive. We (the technical partners from the Cultivate-CEE and Cultivate-EU projects) had been asked over to Russia to explain and demonstrate the work we have carried out so far because Cultivate-Russia is intending to do its own technical work. This means that they will be creating a Russian version of Cultivate Interactive in Cyrillic, it will probably be managed by Olga Puchnina (RCHN). We will be providing more information on the Russian magazine in future issues of Cultivate Interactive. There were lots of questions on all aspects of the Web magazine work and there are definite plans to translate some of our articles into Russian. In the afternoon Martin Belcher and Paul Smith from ILRT, based at the University of Bristol, showed the main Cultivate site.

At the end of the 3 days the Russians gave us all a hearty send off, made us all promise to come back and surprised us with presents of Cognac and wine.

Dinning at o'Pirosmani Restaurant
Dinning at O'Pirosmani Georgian Restaurant

Moscow city was beautiful, I got a chance to visit all round red square and managed to see Swan Lake at the Bolshoi Theatre. The food was very good, and I particularly enjoyed the Georgian meal at O'Pirosmani, a Georgian restaurant opposite the Novodievitchi monastery; where we even had our own live musical accompaniment. The weather was a lot warmer that we'd all expected at around 0 degrees and Aeroflot got me there and back safely!

I would just like to thank all the members of Cultivate-Russia for making all of our trips to Moscow so enjoyable. Good luck with the project!

References

  1. Russia Joins the Cultivate Family, David Fuegi, Cultivate Interactive, issue 6, 11 February 2002
    URL: <http://www.cultivate-int.org/issue6/russia/>
  2. Russian Museums Online
    URL: <http://www.www.museum.ru/> Link to external resource
  3. Cultural Heritage Networking in Russia: Permanently Upcoming Perspectives by Dmitriy Luchkin, Cultivate Interactive, issue 2, 16 October 2000
    URL: <http://www.cultivate-int.org/issue2/russian/>
    Developing Russian Museums Online by Dmitriy Luchkin, Cultivate Interactive, issue 3, 29 January 2001
    URL: <http://www.cultivate-int.org/issue3/russian/>

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Marieke NapierMarieke Napier
Information Officer
UKOLN
University of Bath
Bath
England
BA2 7AY

m.napier@ukoln.ac.uk Link to an email address
<http://www.ukoln.ac.uk> Link to external resource

Marieke Napier is editor of Cultivate Interactive Web magazine.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Napier, M. "Cultivate-Russia Kick off Meeting, 14th - 16th January 2002", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/moscow/>

-------------------------------------------------------------

Praxis

-------------------------------------------------------------

Content-Based Multimedia Information Handling: Should we Stick to Metadata?

By Paul Lewis, David Dupplaw and Kirk Martinez - February 2002

Paul Lewis, David Dupplaw and Kirk Martinez discuss retrieval and navigation as ways of accessing multimedia information and the use of content as an aid to these activities. They ask whether content-based techniques are really making a useful contribution or whether we should restrict ourselves to the use of metadata.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Introduction

First some definitions. Multimedia information is digital information which may be visual data, images or video for example; or it may be sound data, music or speech; and it may now include 3D visualisations or mixed reality experiences. Finally it will almost certainly include the medium with which we are most familiar, that is text.

In this article we use the word “document” to refer to a multimedia object. It may be a collection of text, an article or a book, it may be an image or a video, or a frame of a video, it may be a mixture of these, in fact it may be any type of basic digital information object.

The issues we are going to discuss apply to text as much as to images or other media and it will be useful to establish the ideas by talking first about text on its own. Let us begin by distinguishing between retrieval and navigation. Retrieval is the business of extracting a document from a collection in order to satisfy some query. The query may take a variety of forms. For example, we may require documents by a particular author, or about a particular subject. This sort of retrieval has traditionally been achieved by using indexed metadata that is stored with the document. Key terms in the metadata may give a controlled vocabulary to aid the retrieval.

Content-based retrieval of text is retrieval that uses the text of the document rather than any added metadata. Free text searching is a good example of content-based text retrieval. The words making up the content of the document are indexed and used as the basis for retrieval, sometimes in conjunction with quite sophisticated “intelligent” software used to satisfy the query. Search engines like Google and AltaVista offer content-based text retrieval on the Web.

By contrast, navigation is the process of moving from one document in the information collection to another because there is some useful association between them, and this is typically achieved by following pre-authored links. On the Web this is achieved by clicking on a highlighted source anchor of a link in one document in order to navigate to the destination document to which it points. Sometimes the distinction between navigation and retrieval is unclear. For example, following links that are stored in a bookmark file under a particular subject heading could be regarded in one sense as indexed retrieval and in another sense as link-based navigation. This is also true when using a search engine to retrieve documents on a particular subject. The documents are presented initially as links to be followed. In both these examples we will regard the process as retrieval rather than navigation, as the aim is to retrieve rather than to follow an association between documents.

On the Web, navigation is mainly based on fixed links that are embedded in the documents themselves. However, it is possible for hypermedia navigation to be content-based. By this we mean that the links offered are determined at link following time and selected on the basis of the content of the chosen source anchor. Link authoring for content-based navigation involves making an association between some chosen source anchor and the address of a destination document. The link information may be stored in a separate location from the document, typically a linkbase holding source anchors and destination addresses. With this content-based approach to navigation, multiple links may be made available for a given source anchor, previously authored links may be added to new documents on the fly with minimal effort and different viewers may see different link sets depending on the linkbases which are active at the time [1].

In both content-based retrieval and content-based navigation for text, the process depends on matching content. In the case of retrieval, the textual content of the query is matched with text forming the content of the document, typically indexed in some way to accelerate the retrieval process. In content-based navigation, the query (which is typically a portion of text selected from the content of the document) is matched with the text making up the source anchors of links in the linkbase.

For text, these processes of content-based retrieval and navigation are sufficiently well established and widely used for us to conclude with some conviction that content-based retrieval and navigation are worthwhile and effective approaches for text information handling. Of course metadata based searches with text are also widely used and the two approaches can complement each other well. The content matching, on which text content-based processes depend, are in many cases straightforward exact matches between words, although statistical matches between word sets, term switching or query expansion via thesauri, word stemming and other textual tricks can greatly enhance the processes to provide more powerful retrieval and navigation facilities.

Now let us turn our attention to content based retrieval and navigation with non-text media. We will use images as our example although many of the comments will apply equally to other non-text media. Can we say with the same conviction as we did for text that content-based image retrieval and navigation are worthwhile and effective approaches for image information handling? Well, in short, the answer is “No, certainly not with the same conviction”. But there are circumstances where content-based retrieval and content-based navigation may be worthwhile particularly in conjunction with metadata-based techniques. And in the longer term, as research into media processing offers up more powerful approaches, the value of content-based techniques should increase.

In the following sections we look more closely at content-based image retrieval and navigation techniques, examine why they are currently less powerful than for text and examine specific efforts to make them more effective.

Content-based Image Retrieval

The basic reason why image retrieval is more difficult than text retrieval is that the digital representation for most images is as a collection of pixels. The only information which is explicit in such a representation is the colour values at each pixel point. Although, when we look at images we, as humans, are able to interpret them automatically and see meaningful regions of colour, recognise objects and identify scenes which can usefully form the basis of effective image matching processes, we are performing substantial and sophisticated information processing which relies on a large volume of prior knowledge for its success. To achieve effective content based image retrieval (CBIR) software systems must achieve some of this extraction and interpretation in order to find something meaningful to form the basis of the content matching. By contrast, in text documents, the words themselves are explicit in the digital document and it is these that form the basis of the matching process. Hence for text retrieval in its basic form, little additional processing is required.

There have been some excellent recent reviews of content-based image retrieval [2], [3] and the reader is encouraged to look at these for further details. Querying in CBIR can take many forms but the most common is probably the query by example paradigm where the user provides a query image and asks for images from the collection that are similar to it in some way. An alternative might be to ask explicitly for images containing some particular object using a text interface to provide a description of the required image. Such an approach requires that the CBIR system can perform object recognition or scene analysis in order to find the required image and at present this is only possible in specific highly constrained application domains.

General approaches to CBIR attempt to find representations of the image which make more information explicit than simply the pixel colour values. Unsurprisingly, many of the approaches have been based on colour. The colour histogram [4] has been a simple and popular representation which captures the relative amounts of each colour in an image. But it is a global measure and does not give information about colour variations at local positions in the image. Nevertheless it provides a useful measure of some aspects of similarity between images and has been widely used in CBIR systems.

To overcome the global nature of the colour histogram the image has sometimes been divided into patches and the colour histogram calculated for each patch. This allows images to be retrieved from a collection when the query image is only similar to a sub-section of an image in the collection. This is taken further when the images are decomposed into patches hierarchically at decreasing resolutions.

A representation which also tries to capture some local colour information is the colour coherence vector representation [5] which counts separately pixels which belong to large (coherent) regions of the same colour and those which do not. We have developed an approach to sub-image matching which uses a pyramid of colour coherence vectors and which can locate details of high resolution art images in large collections of such images [6]. An example of a sub-image query is shown in figure 1 and the resulting match with the located sub-image is shown in figure 2.

Figure 1
Figure 1
Figure 2
Figure 2

A representation which captures information about colour boundaries within the image has been proposed by Matas et al [7] in an approach they call the multi-modal neighbourhood signature representation. This approach has the added benefit that sub images can be matched directly without the need for a pyramid decomposition.

Colour is not the only basis for representations in CBIR. Texture, which in image processing refers to a measure of repeating patterns in an image, has also provided a useful basis for representations. Again the representations tend to be global and only appear useful for some particular image types where repeating patterns are a central characteristic.

For the ultimate CBIR system what we need is perfect image understanding software. We need to bridge the so-called semantic gap, even to be able to address queries like “Find me images in this collection containing a building”. A simple query by example would be inadequate for satisfying this simple query without a substantial knowledge of the variety of ways in which buildings may appear in images. An even greater challenge comes from queries like “Find me images in this collection which depict acts of kindness” It is worth noting that the semantic gap also exists for text. The gap is not as wide but until we have perfect natural language understanding software it will continue to exist at some level.

For CBIR, a starting point would be to represent explicitly any objects in the image. Shape is an important cue to object recognition and many attempts to use shape in CBIR systems have been reported, even in the early systems like QBIC from IBM [8]. The big problem with this is knowing what constitutes an object. It is possible to segment images into regions and represent the shapes of the regions but the software needs to be trained to match or recognise particular object shapes which will typically be composed of several regions from a segmentation of the image. Some approaches to this have been reported in particular domains but general purpose CBIR systems using objects as intermediate representations are still uncommon. A rather simple example of shape finding comes from the Artiste project [9], a European project to develop a distributed art retrieval, navigation and analysis system. It includes a facility to detect images of paintings in frames of a particular shape. Most frames are rectangular but some are circular, some are triptych etc. A border finder locates the boundary of the frame in the image and a neural net classifier has been trained to use the border to deliver the frame type.

Bridging the Semantic Gap

The search for approaches to the extraction of higher level representations from images is an active area of research. Associating features extracted from images with semantic concepts has been reported [10] and in Southampton we have developed the idea of a multimedia thesaurus in our MAVIS 2 multimedia information system as an attempt to bridge the semantic gap [11].

In a traditional thesaurus, different textual representations of the same concept are associated with one another. In the multimedia thesaurus (MMT), different multimedia representations of the same concept are associated with that concept. The MMT is a multi-layer data structure used for storing the multimedia information in the system. At the highest level there is a semantic layer which records concepts and the relationships between them in the application domain. At the next level down in the simplest form of the architecture are selections from media which in some way represent the concept. For example if the concept is a vase, an image selection containing a vase is a visual representation of the concept. Associated with the image selection are the extracted signatures, for example, giving shape, texture and colour information about the vase. Also at this level we may have textual representations of the vase, so the word vase may be stored and associated with the concept vase in the semantic layer. Textual synonyms such as “amphora” may also be stored in the second level as may other visual representations or sound clips of the word vase being spoken. At the lowest level we have the raw media from which the representations have been selected and by keeping pointers to the raw media for the selections rather than a duplicate we can minimise the storage requirements.

This structure provides some valuable additional functionality in the multimedia system, For example, if query by example is being used for content based image retrieval, and the query can be matched with a representation in the MMT, the system may be able to identify the concept forming the basis of the query and from that it may find alternative representations of the concept which may enable it to retrieve images which would otherwise have been missed. Similarly, if content-based navigation is being used and a link has been authored on one view of an object, it may be possible to follow the link using a different view as the source anchor if both views are associated with the same concept in the MMT. It is also possible to follow a link authored on the text representation of a concept from an image representation of the same concept if the user so wishes.

One of the problems with this approach is the building of the associations in the MMT between representations and the concepts they represent. Clearly a manual approach is possible but is time consuming in the extreme. In a prototype application [12], brief text descriptions associated with the concepts in the semantic layer were available and some of the images had sufficient metadata associated with them to allow the use of latent semantic indexing [13] to estimate the similarity between the concept description and the metadata description of the image. This facilitated automatic creation of some of the associations and others could be made by pattern matching between the images themselves. Images were then automatically associated with the concepts with which similar images were associated. Although not a fully automatic approach it enabled us to recognise this as a way of accelerating MMT building in particular application domains.

As the MMT evolves, it should be clear that a larger and larger number of representations associated with concept classes in the semantic layer will be available. To make an association between a query selection and a concept may take some considerable computation time as the representations extracted from the query are compared with representations in the MMT. However, at some stage in the evolution of the MMT it may be possible to develop a classifier which could allocate new representations to concepts more quickly than via brute force matching. We have made some preliminary investigations into the use of intelligent autonomous processes or agents for monitoring the MMT and clustering and classifying representations as their numbers become suitably large [14]. Existing associations in the MMT are used as the basis for learning by the classifiers.

Conclusion

Although we, and others, have made tentative steps towards bridging the semantic gap in multimedia information handling, particularly in the area of content and concept based retrieval and navigation, many problems remain. One of the key difficulties is that the signatures or representations that we are working with are crude and little prior knowledge is being utilised. Until more powerful image understanding techniques can be developed and incorporated into the image processing functions we will be severely handicapped in our efforts. This is even more true for other non-text media. But even for text, it is clear that retrieval and navigation will benefit from enhanced text understanding facilities.

Another difficulty is the computational problem associated with content based media retrieval. Many of the representations are multidimensional feature vectors of high dimensionality and there are serious problems with indexing such features for rapid retrieval. Although novel indexing strategies have been published many of them collapse at very high dimensionality. Finally, it is worth mentioning that human-computer interface problems are also associated with multimedia information handling. For example, given a query image which contains a complex scene and wishing to use one of the objects in the scene as the query object, how do you indicate to the computer the limits of the object required? Interactive segmentation is a possibility but it is slow and inelegant compared with human capabilities for reasoning over images.

In spite of these continuing difficulties, significant strides have been made in recent years in the area of content-based retrieval and navigation and although metadata will continue to be an essential aid, the increasing value of content-based retrieval and content-based navigation should not be overlooked, particularly in constrained application domains and when metadata is sparse.

Acknowledgements

The authors are grateful to the European Commission for their support through grant IST-1999-11978 and to their collaborators (C2RMF(F), NCR(Dk), Giunti Interactive Labs (I), Uffizi Gallery (I), IT Innovation Centre (UK), The National Gallery (UK), The Victoria and Albert Museum(UK)) on the ARTISTE project for image data and useful conversations.

References

  1. Les A. Carr, David C. DeRoure, Hugh C. Davis and Wendy Hall (1998) Implementing an Open Link Service for the World Wide Web. World Wide Web Journal, 1, 1998.
  2. A.W.M. Smeulders, M. Worring, S. Santini, A. Gupta and R. Jain, (2000) Content-Based Image Retrieval at the end of the Early Years, IEEE Transactions on Pattern Analysis and Machine Intelligence, 22 no 12 1349—1380, 2000
  3. J. Eakins and M. Graham. Content based image retrieval. Technical Report 39, U.K. JISC Technology Application Programme, Oct. 1999.
    URL: <http://www.jtap.ac.uk/> Link to external resource
  4. M. J. Swain and D. H. Ballard. Color Indexing. International Journal of Computer Vision, 7(1):11-32, 1991.
  5. Greg Pass, Ramin Zabih, and Justin Miller. Comparing Images Using Color Coherence Vectors. MultiMedia, pages 65-73. ACM, 1996.
  6. Stephen Chan, Kirk Martinez, Paul Lewis, C. Lahanier and J. Stevenson (2001) Handling Sub-Image Queries in Content-Based Retrieval of High Resolution Art Images. International Cultural Heritage Informatics Meeting p.157-163.
  7. J. Matas, D. Koubaroulis, and J. Kittler. Colour Image Retrieval and Object Recognition Using Multimodal Neighbourhood Signature. In D. Vernon, editor, Proceedings of the European Conference on Computer Vision, LNCS volume 1842, pages 48-64, Berlin, German, June 2000. Springer.
  8. M. Flickner, H. Sawhney, W. Niblack, J. Ashley, Q. Huang, B. Dom, M. Gorkani, J. Hafner, D. Lee, D. Petkovic, D. Steele, and P. Yanker. Query by image and video content: The QBIC system IEEE Computer, 28(9):23-32, Sept. 1995.
  9. The Artiste Project Home Page
    URL: <http://www.artisteweb.org/> Link to external resource
  10. Carlo Columbo, Alberto Del Bimbo, Pietro Pala, Semantics in Visual Information Retrieval IEEE Multimedia, 38-53 July 1999.
  11. M. Dobie, R. Tansley, D. Joyce, M. Weal, P. Lewis, and W. Hall. A flexible architecture for content and concept based multimedia information exploration. In Proceedings of the Challenge of Image Retrieval (CIR'99), pages 1-12, Newcastle, UK, Feb. 1999.
  12. Robert Tansley, Colin Bird, Wendy Hall, Paul Lewis and Mark Weal (2000) Automating the Linking of Content and Concept. Proceedings ACM Multimedia 2000 p.445-448.
  13. T. K. Landauer, P. W. Foltz, and D. Laham. An introduction to latent semantic analysis.Discourse Processes, 25:259-284, 1998.
  14. Dan W. Joyce, Paul H. Lewis, Robert H. Tansley, Mark R. Dobie and Wendy Hall (2000) Semiotics and Agents for Integrating and Navigating Through Media Representations of Concepts. Storage and Retrieval for Media Databases 2000 p.120-31.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Paul Lewis
Department of Electronics and Computer Science
University of Southampton
Southampton
SO17 1BJ

Phone: +44 2380 593715

phl@ecs.soton.ac.uk Link to an email address
<http://www.ecs.soton.ac.uk/~phl> Link to external resource

Paul Lewis is a Senior Lecturer in the Intelligence, Agents and Multimedia Research Group in the Department of Electronics and Computer Science in the University of Southampton. His research interests are in image and video analysis and their applications to multimedia information handling. He has been an investigator on numerous EPSRC and EU grants most recently working on the development of content and concept based retrieval and navigation tools in multimedia environments

Kirk Martinez is a lecturer in the Intelligence, Agents, Multimedia Research Group in the Department of Electronics and Computer Science in the University of Southampton. He has a BSc in Physics from the University of Reading and a PhD in image processing from the University of Essex. When he was Arts Computing lecturer in The University of London he developed image processing applications and imaging for art. His current research is content-based retrieval and museum applications of augmented reality.

David Dupplaw is a research assistant in the Department of Electronics and Computer Science in the University of Southampton working on the Artiste European project. He graduated in Computer Science from the University of Southampton and he is nearing the completion of a PhD on image representations for content-based applications.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Lewis, P., Dupplaw, D., and Martinez, K. "Content-Based Multimedia Information Handling: Should we Stick to Metadata?", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/retrieval/>

-------------------------------------------------------------

Metadata

-------------------------------------------------------------

The Historical Data Warehouse

By Frans Smit - February 2002

Frans Smit reports on adapting concepts from Information and Knowledge Management (IKM) and Information and Communication Technology (ICT) into the field of organizing and giving access to metadata about historical archives and collections [1].

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

In recent decades a dramatic change has occurred in the possibilities to provide access to archives, libraries and museums. This change is partly technologically driven and partly driven by the demands of society to provide for a greater transparency and accountability of governments. In various traditional fields of cultural heritage much effort is being made to provide integrated and instant access to information about their holdings.

This article proposes the creation of a generic model for storing data and delivering information. In this model concepts like metadata and Data Warehousing are used in an integrated way. All these seemingly new concepts are really just a new perspective on what archivists and librarians have been doing for a long time. This new perspective is necessary however in order for cultural heritage professionals to cope with demands and challenges presented by ICT [2]. The technological implementation of this model requires a sound vision on which ICT-architecture is necessary. In this ICT-architecture some important issues must be defined, e.g. which databases must be used, how content should be managed and which role standards play.

The broad objective is to create well organised content of high quality in order to fulfil the needs of the public and for an organisation to enhance its surplus value in society. These issues must create the foundation for a truly effective and integrated service to the public. However the true foundation lies within the knowledge and the consciousness that is present in an organisation.

A lot of the notions and concepts in this article still need to be worked through, though as a starting point it is hopefully a contribution to the improvement of quality and quantity of services from cultural heritage institutes to the public.

An Information and Knowledge Management Model

A key issue for institutes like archives, museums and libraries is to define and to create the surplus value of a cultural heritage institute in society. This surplus value often consists primarily of giving accurate and accessible information to the public about the subjects that are the core business of the institute. This information is based upon the material that is kept in the institute, in combination with the data and knowledge about that material. In every organisation Information and Knowledge Management (IKM) is a main issue for managers. In cultural heritage institutes a good IKM is essential.

The scope of IKM includes data, information and knowledge. IKM is not a goal in itself, it is merely a necessary subject in managing an institute in order to produce accurate services to the public. As for other important issues in an organisation, like financial management and staffing policy, it is important for organisations to start off IKM with a model-like approach. This IKM-model may differ in each organisation. A useful model that benefits from notions from other branches may look like this.

diagram 1
Diagram 1

The basic notion of this model is that the surplus value of the organisation consists of extracting material out of society, filtering it and enhancing it with additional information. The result of this is that the users (whoever they may be, that is not upon the organisation to determine) get better information about the subjects that the organisation is concerned with. Due to the demand of the public, services may be added, changed or removed. Every layer in this model is connected with others and is able to influence them in a direct or indirect manner.

The model has three main entities: users, organisation and society. All entities are dynamic. The organisation should take care that it does not stand between society and the users as a barrier. Its reason to exist lies in the precondition that it must give back more to the users than that it extracts from society. To realise this, every defined layer in the organisation should be designed and executed in the context of an IKM-model. Let us take a look at the organisation layers depicted above (especially focusing on the layers of Metadata and the Historical Data Warehouse).

First and foremost the products and services of the organisation show its surplus value to the public. These products may differ considerably and may include publications, exhibitions, catalogues, merchandising, and giving access to original archival material (or reproductions of that material) in a reading room. Of course a Web site should be counted among these products, creating a digital platform for a variety of services to the public. Adding, changing and removing products and services will always cause changes in the other layers in the organisation.

All the products and services are created and maintained by the work processes in the organisation. These work processes cover the whole range of activities that give added value to products created by the organisation. This range of activities is always inter-connected and inter-related to each other and to the other layers. Traditional work processes in institutes for cultural heritage are appraisal, management, description and services to the public. If you add a new service to the public, for example creating a virtual exhibit on the Internet, it will definitely have an impact on service to the public and the metadata needed to support this function. However, the process of management and cataloguing may be influenced as well. Even the appraisal of material may be affected if there is not enough original material present to create the new service.

Creating products and services to the users always require an accurate set of metadata. There is much confusion about the definition of metadata among various disciplines. In my view the concept of metadata comprises structured data about the original material that is kept in the institute (e.g. surveys, catalogues, inventories and genealogical indexes), unstructured data about that material (e.g. publications, guidelines etc.) as well as the knowledge the people in the organisation have about that material [3]. This last category may be called mobile metadata, with all the risks mobility entails (e.g. sick leaves, retirements, vacations etc.).

The accuracy of the metadata is determined by the way in which staff prepare the information that support a particular work process. The more metadata is structured and standardised, and the more they are kept in well-structured and connected databases, the better they can support the various work processes. In order to create and maintain a sound set of metadata it is necessary to have an active policy of IKM. IKM should provide for the availability for as much data as possible in a way that benefit all work processes that may need the data.

It is important to make a distinction between metadata that can be structured and metadata that can not be structured. In order to make this clear the following cycle of knowledge is a useful tool [4].

diagram 2
Diagram 2

In this diagram the data can be viewed as metadata in the most structured way. The data are considered “facts” and are described as such. Modern ICT-concepts and technologies are most helpful to maintain the data in such a way that it can be used in every way possible. Information may be viewed as the result of a combination of data for a specific purpose. Accurate information enhances knowledge. This knowledge improves the understanding of a particular subject. This understanding results in more data or can lead to changes in existing data.

In practice it is impossible to empty everyone’s head about every subject in order to make a complete dataset of every relevant subject. The reason is that the human mind can connect and combine data and information in a way that no information system is presently capable of. It is a challenge for IKM to lay down as much information and data that is present in structured forms, so that they are easily transferable and independent of any human intervention.

For these structured forms of metadata it is possible to lay down criteria in such a way that a healthy ICT-architecture can be realised. For every organisation those criteria may differ but the following ones may be universal:

Metadata can be divided into separate categories, depending on their meaning and the work process in which they are created and maintained. Possible categories for cultural heritage organisations include:

Several accepted international standards, like Dublin Core, ISAD(G), ISAAR(CPF) and EAD, cover many or most of these categories. Not all of them however link these categories to aggregate levels and context of the metadata.

For institutes in the cultural field having the task of appraising and keeping historical material, all the products, work processes and metadata have to relate to that material. Preserving this material is on the long term the most important layer in the IKM-model presented above. In order to represent this material into a contemporary IKM-model, I have labeled it as a Historical Data Warehouse. The reason is that in ICT literature the concept of metadata is often linked to Data Warehousing. The archives and collections that should be kept in an organisation responsible for cultural heritage operate in much the same way as a Data Warehouse does in a business enterprise. It is the basis for providing accurate and inalterable information in order to facilitate decision-making and accumulate and enhance knowledge and understanding.

Marco provides the following useful definition of a Data Warehouse: “A data warehouse is a single, entreprise-wide collection of data”. This collection should fulfil the following four preconditions:

  1. A Data Warehouse is subject-oriented;
  2. A Data Warehouse provides an integrated view of an enterprise’s major subject areas;
  3. A Data Warehouse is non-volatile;
  4. A Data Warehouse holds historical views of data [6].

The concept of Data Warehousing was developed in the ICT-world for data that is kept on digital platforms. While there is a big difference with the majority of data that are kept in cultural heritage organisations this difference nevertheless does not make the comparison worthless. The preconditions of a Data Warehouse mentioned above could also prevail for archives and collections preserved on non-digital media, thus bridging a gap in concepts used in two often too separated worlds.

Another advantage to labelling archives and collections as an Historical Data Warehouse for society is that the concept of metadata falls into place. The word metadata is widely used for data that is necessary to maintain a Data Warehouse. Metadata is the data that is created, changed and removed by yourself, the data in a Data Warehouse should never be changed. As an expert on metadata and data warehousing, David Marco describes the concepts of metadata and Data Warehousing in a way that is very similar to that applied to archives and libraries: “Metadata is the card catalog in a data warehouse. By defining the contents of a data warehouse, meta data helps users locate relevant information for analysis. In addition, meta data enables users to trace data from the data warehouse to its operational source (i.e. drill-down) and to related data in other subject areas (i.e., drill-across). By managing the structure of the data over a broad spectrum of time, it provides a context for interpreting the meaning of information” [7].

The comparison of traditional archives and collections with modern Data Warehousing is an interesting way of putting an IKM-model for cultural heritage in the context of the digital age. Archivists were not considered when this concept was developed. This is both surprising and regrettable because ICT-experts could have learned a lot from them about such concepts as authenticity, reliability, readability and the context and creation of data!

Information and Knowledge Management and ICT-architecture

The above described IKM-model is meant to give an overall, broad and consistent perspective on how to handle data and information in an organisation that manages historical data and material. It is like the design of a nervous system of an organisation. The functioning of this nervous system is nowadays determined by an appropriate use of ICT-systems.

In the last decades some general shifts have occurred in the usage of ICT-tools. In almost all fields, public or private, using ICT started off bottom-up, by enthusiast specialists. This phase had a character of experimenting, making mistakes and learning. With the growth of importance of ICT and with the ever-growing possibilities ICT became in a lot of fields a matter of strategic importance with a tendency to design and implement big monolithic systems. With the rise of client-server systems and especially with the rise of the Internet various systems are being used that are interconnected through a corporate concept. This concept is commonly called an ICT-architecture. The pace in which this process has been taking place varies a lot. In the field of cultural heritage institutes the described phases occur in a lot of institutes at the same time.

What is an ICT-architecture? Applegate describes it as follows: “Just as the blueprint of a building’s architecture indicates not only the structure’s design but how everything –from plumbing and heating systems to the flow of traffic within the building- fits and works together, the blueprint of a firm’s information architecture defines the technical computing, information management, and communications platform. The IT Architecture provides an overall picture of the range of technical options available to a firm, and, as such, it also implies the range of business options. Decisions made in building the technical IT architecture must be closely linked to decisions made in designing the IT organisation that will manage the architecture, which, in turn, must be linked to the strategy and organisation design of the firm itself. Conversely, the organisation strategy, structure, incentives, and processes strongly influence how the technology will be designed, deployed, and used within a firm” [8].

When using the IKM-model described above, the consequence is that the usage of ICT in every layer should be accurately designed, implemented and connected to each other. At every layer ICT will play a role. In the layer of products and services one can think of Internet applications, e-commerce and applications to follow the behavior of the public. The work processes may need one or more applications to enter, change and remove metadata. The metadata it self, if digitally kept, will involve the use of databases and text-files. The Historical Data Warehouse may have digital material and digital reproductions (or even substitutes) of original, non-digital material.

Well-designed ICT-architecture must give an accurate answer to the question of how to use the various components of ICT in every layer of the IKM-model. There are various ways in making distinctions between various ICT-components. One of them is to present ICT-tools in a hierarchical order represented below.

diagram 3
Diagram 3

As in the IKM-model every part is interconnected. On every level a choice has to be made for using the appropriate ICT products. The choices may differ according to scope, budget and structure of the organisation.

One of the ways in which it is possible to make those choices, is to make an enterprise-wide information audit and develop an ICT-policy based on the audit. The result of such an audit varies in every organisation. I will describe some conclusions that I have drawn from my daily practice.

It is very important to use state-of-the-art tools regarding data structures. The metadata layer of the IKM-model is the key layer for being able to generate useful information. Connectivity of the data is perhaps the most essential precondition. This precondition can be met by using relational databases. They are the most modern and powerful tools for designing, implementing and securing good data structures and the data itself. The language for manipulating data in a relational database, SQL, is globally accepted and supported. Relational databases are also open, which means that they can always be connected to each other in real time. Of course the logical and technical data structures in the databases should be designed and implemented in a professional way.

Standardization of metadata is essential but it is also a misunderstood topic. To those that will handle international standards regularly it may sound like nothing new, but a lot of institutes have not implemented them yet. One of the odd things about it is that those standards like ISAD(G) are often considered as being totally new, whereas they are mostly an improved version of previous standards. It is not necessary to create a totally new set of metadata, often a conversion from one data structure to another (with eventually a conversion to modern, digital media) is the only thing to be done. Another misunderstanding about standards is about when they become important. Standardization is needed to create a common language structure among organisations to enable the exchange of information. It is not important whether the database you use is entirely structured according to a standard. As long as you are able to generate your data in the desired structure when needed, there is no need to worry. Much more important is that you choose a state-of-the-art database platform so that you are assured that you can anticipate as good as possible on future developments in databases and metadata standards.

One of the fundamental principles in the ICT-architecture is that maintaining the metadata is different from its presentation. Metadata should never be redundant and metadata that belongs together should be stored and maintained together in preferably one database. You can present them however in various ways. You can present a catalogue on the Internet through a search engine, but you can use the same metadata in a printed version and you can deliver them to a Web site where they are merged with metadata from other institutes. A further important notion is that software for presenting metadata tend to change more often than software for maintaining metadata. An ICT-architecture for maintaining and presenting metadata, that takes these aspects into account and can act as a frame of reference for developing good and efficient software, may look like this.

diagram 4
Diagram 4

Closing remarks: what is the use of IKM in practice?

The described models and concepts are useless when the implementation does not improve the performance of an organisation. And the proof of this improvement can only lie in the improvement of services to the users. Models like this can contribute to the definition of new projects and their impact on the organisation. They can also show the connection between various projects. It is a way of showing all departments in an organisation what the consequences are for all work processes if for example you decide to make a virtual exhibit on the Internet or to create an online search engine through metadata about your archives and collections. A model may help to reduce the risk of modifying work processes or metadata structures in such a way that they cannot be used anymore by other work processes. It also helps to prevent the introduction of new ICT-systems that will prevent the organisation from reaching goals like integration, standardization and presentation.

An IKM-model and an ICT-architecture provide a very effective framework for managing work processes and projects. Every work process and every project can be linked to each other. It is possible to make up a checklist for new projects to make sure that the results of the project fit into the preconditions of the organisation concerning IKM and ICT.

The most important assumption is that the management of the organisation should be aware of these instruments of improving services to the public. This awareness enables at the same time that all the work processes work together in such a way that these services can be created and maintained. It is up to management to create a policy for mid-term and long-term goals and objectives in such a way that projects can be started off in a way that is consistent with that vision.

To conclude this article, an example from the Municipal Archives of Amsterdam may be illustrative of an attempt to create new services to the public. The long-term aim is to create one single portal on the Internet to all descriptions of content and context of the archives and collections. This portal should be designed in such a way that the questions that are mostly asked by the users (questions about a person or organisation, about a subject, a location or a period) are answered in the simplest way possible. This is a huge task that will involve all work processes.

The starting point is very different from the situation that should be created. This is a fate that a lot of institutes now share. A lot of metadata is incomplete, not standardised, not kept in modern databases and not linked to each other. The Municipal Archives of Amsterdam not only hold archives but also vast collections of library material, audiovisual material, photos, drawings and maps. It is a huge task to integrate all the metadata related to this material, using appropriate standards. The work processes are not all structured in order to create integrated services to the public. In order to assemble metadata from different sources a series of projects were defined. The first step was to create a complete set of accurate metadata for the highest aggregate level based on a survey of various access tools (finding aids). This survey is the first result of this [9]. Other search engines on the Web site are not yet linked to this survey. The next step is to present a set of metadata on lower aggregation levels for the archives. The material at present was not stored in a database. In order to realise this precondition a massive data-entry project was initiated. The result of this project is a database with 350.000 records. In order to present the metadata as an integrated service to the public it is necessary to enhance create and present indexes on persons, organisations, subjects, locations and time periods. This project has a twin brother in the back office, where existing ICT systems must be altered or replaced in order to create the necessary metadata in a standardised way.

In the next few years the intention is to integrate the metadata about context and content of all archives and collections into this model. The consequences are that almost all existing ICT-systems must be reconsidered, strategic decisions must be made about the choice of standards for metadata, a lot of conversion or data-entry of existing data must take place, quality controls and different ways of working should be implemented. This cannot succeed without defining an appropriate IKM-model and an ICT-architecture.

An Dutch version of this article is also available.

References

  1. My special thanks go to Kent Haworth, York University Archivist and Head, Special Collections and Project Director and Secretary, ICA Committee on Descriptive Standards, for his comments on an earlier version of this article.
  2. In most literature the abbreviation “IT” is used, I prefer to use the more modern abbreviation “ICT” - Information and Communication Technology.
  3. Compare for example Marco, p. 5
  4. Wurman, p. 27 and Milner, p. 3. I have replaced the word “wisdom” in this model with the more modest word “understanding”.
  5. Compare for example Gilliland-Swetland, Anne J, Defining metadata, in Baca, p. 3;
  6. Marco, p. 23-24. Marco cites Inmon, W.H.: Building the Data Warehouse, Wiley, 1996, p. 33;
  7. Marco, p. 48
  8. Applegate c.s., p. 139-140.
  9. Survey
    URL: <http://www.gemeentearchief.amsterdam.nl/archieven_en_collecties/overzicht/introductie/index.nl.html> Link to external resource

Literature and suggestions for further reading

  1. Applegate, Lynda, F. Warren MacFarlan en James L. MacKenney (1999). Corporate Systems Information Management. Irwin MacGraw-Hill, Boston.
  2. Baca, Martha and others. (1998) Introduction to metadata, pathways to digital information. Getty Information Institute, New York.
  3. Cook, Terry (2001). Archival Science and Postmodernism: New formulations for Old Concepts, in Archival Science (2001-1), ed. Horsman, P., E. Ketelaar and T. Thomassen, Kluwer Academic Publishers, Dordrecht, p. 3-24.
  4. Getty Information Institute, New York, Art and Architecture Thesaurus
    URL: <http://www.getty.edu/research/tools/vocabulary/aat> Link to external resource
  5. International Council of Archives (ICA), ISAAR(CPF) standard
    URL: <http://www.ica.org> Link to external resource
  6. International Council of Archives (ICA), ISAD(G) standard
    URL: <http://www.ica.org> Link to external resource
  7. Marco, David (2000). Building and managing the meta data repository, a full life-cycleguide. John Wiley & Sons, New York.
  8. Menne-Haritz, A. (2001). Access: the reformulation of an archival paradigma, in Archival Science (2001-1), ed. Horsman, P., E. Ketelaar and T. Thomassen, Kluwer Academic Publishers, Dordrecht, p. 57-82.
  9. Milner, Eileen M. (2000). Managing Information and Knowledge in the Public Sector. Routledge, London.
  10. Records Continuum Research Group, Australia.
    URL: <http://rcrg.dstc.edu.au> Link to external resource
  11. Ribeiro, Christina and Gabriel David (2001). A Metadata Model for Multimedia Databases.
  12. Smit, F.P. (2000). Proposal for a Datamodel of Archival Descriptions, in: Atti del Summit DACE, Roma, 2000, p. 149-196.
  13. Smit, F.P. (2001), Het nieuwe Overzicht van Archieven en Collecties, in: Archievenblad (2001-1), Koninklijke Vereniging van Archivarissen, Amsterdam, p. 26-29.
  14. Society of American Archivists, Encoded Archival Description
    URL: <http://www.loc.gov/ead> Link to external resource
  15. Svenonius, Elaine (2001). The Intellectual Foundation of Information organisation. The MIT Press, Cambridge Massachusetts.
  16. Wurman, Richard Saul (2001). Information Anxiety 2. QUE, Indianapolis.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Frans SmitFrans Smit
Manager of the section of archival descriptions and cataloguing
Municipal Archives of Amsterdam
PO Box 51140
1007 EC Amsterdam
The Netherlands

Phone: ++31 20 5720227
Fax: +31 20 6750596

<fsmit@gaaweb.nl> Link to an email address
<franssmit@planet.nl> Link to an email address
<http://www.gemeentearchief.amsterdam.nl> Link to external resource

Frans Smit is Head of the Section of Archival Descriptions and Cataloguing at the Municipal Archives of Amsterdam. He is (and has been) also engaged in various national and international projects concerning providing access to metadata about archives and collections through search engines on the web.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Smit, F. "The Historical Data Warehouse", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/warehouse/>

-------------------------------------------------------------

Het Historisch Data Warehouse

By Frans Smit - February 2002

Frans Smit past concepten toe die afkomstig zijn uit Informatie- en Kennismanagement (IKM) en van Informatie- en Communicatietechnologie (ICT) ten bate van het organiseren en toegankelijk maken van metadata over historische archieven en collecties [1].

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Inleiding

In de afgelopen decennia is een fundamentele verandering opgetreden in de mogelijkheden om de toegankelijkheid tot archieven, musea en bibliotheken te vergroten. Deze verandering komt deels voort uit de technologische vooruitgang maar voor een even belangrijk deel uit de toenemende maatschappelijke behoefte aan een transparante overheid. Veel instellingen op het gebied van cultureel erfgoed zijn bezig om een meer geïntegreerde en snellere toegang te creëren tot informatie over het historisch materiaal dat bij hen bewaard wordt.

In dit artikel wordt een generiek model voorgesteld om gegevens te bewaren en informatie te verschaffen. In dat model worden concepten als metadata en Data Warehousing op een geïntegreerde wijze toegepast. Deze schijnbaar nieuwe concepten vormen in werkelijkheid een nieuw perspectief op wat archivarissen en bibliothecarissen al sinds lange tijd doen. Dit nieuwe perspectief is echter wel nodig voor professionals op het gebied van cultureel erfgoed om de problemen en uitdagingen van het ICT-tijdperk [2] het hoofd te kunnen bieden. De technische implementatie van het model vereist een degelijke visie op de benodigde ICT-architectuur. In deze architectuur moeten belangrijke aspecten worden vastgesteld, bijvoorbeeld welke databases moeten worden gebruikt, hoe de gegevens moeten worden beheerd en welke rol internationale standaarden in deze hebben.

Het doel hiervan is het creëren en in stand houden van goede en goed georganiseerde gegevens. Deze gegevens kunnen voorzien in de informatiebehoefte van het publiek waardoor de maatschappelijke meerwaarde van de organisatie wordt gewaarborgd. Het achterliggende fundament voor die gegevens bestaat altijd uit de aanwezige kennis over het aanwezige historische materiaal.

De concepten die in dit artikel worden beschreven, dienen nog in detail uitgewerkt te worden. Ze zijn als uitgangspunt voor de verbetering van de kwaliteit en kwantiteit van dienstverlening door instellingen op het gebied van cultureel erfgoed aan het publiek naar mijn mening echter nu al uitstekend bruikbaar.

Een Informatie- en Kennismanagement Model

Voor instituten als archieven, musea en bibliotheken is het van essentieel belang om te definiëren wat de maatschappelijke meerwaarde is van het instituut, om vervolgens die meerwaarde te kunnen creëren. Deze meerwaarde bestaat meestal uit het verschaffen van accurate en toegankelijke informatie aan het publiek over de onderwerpen die tot de kerntaken van het instituut worden gerekend. Die informatie is gebaseerd op het materiaal dat wordt bewaard in het instituut, de beschikbare gegevens en de kennis over dat materiaal. In iedere organisatie is Informatie- en Kennismanagement (IKM) een belangrijk onderwerp. Voor archieven, musea en bibliotheken is een goed IKM niet minder dan een bestaansreden.

IKM omvat gegevens, informatie en kennis. IKM is niet een doel op zich, het is een noodzakelijk onderwerp voor het management om de juiste producten en diensten aan te kunnen bieden. Zoals bij andere belangrijke managementonderwerpen, bijvoorbeeld financieel management en personeelsmanagement, is het ook bij IKM nuttig om een modelmatige benadering te hanteren. Een bruikbaar IKM-model dat aansluit op ervaringen in andere bedrijfstakken kan er als volgt uitzien.

diagram 1
Diagram 1

Het uitgangspunt van dit model is dat de maatschappelijke meerwaarde van de instelling bestaat uit het onttrekken van materiaal uit die maatschappij, het selecteren van dat materiaal en het ervan voorzien met aanvullende informatie. Het resultaat is dat de klanten van de instelling (wie dat ook mogen zijn, dat is niet aan de instelling om te bepalen) meer en betere informatie dienen te krijgen over de onderwerpen waar de organisatie zich mee bezig houdt. Afhankelijk van de behoefte kunnen producten en diensten worden vernieuwd, aangevuld of verwijderd. Iedere laag in het IKM-model is gekoppeld aan de andere lagen en is in staat om de andere lagen direct of indirect te beïnvloeden.

Het model kent drie dynamische grootheden: klanten, instellingen en maatschappij. De instelling moet zich ervoor hoeden om als een hindernis tussen de klanten en de maatschappij te staan. Haar bestaansreden ligt in de vooronderstelling dat het meer teruggeeft aan de klanten dan dat het onttrekt aan de maatschappij. Om dit waar te kunnen maken, moet iedere laag in de instelling worden ontworpen en gerealiseerd in het kader van een IKM-model. Hieronder worden de diverse lagen in het IKM-model binnen de instelling nader beschreven, met speciale nadruk op de metadata en het Historisch Data Warehouse.

De producten en diensten tonen de meerwaarde van de instelling aan de klanten. Deze producten en diensten kunnen zeer gevarieerd zijn, bijvoorbeeld publicaties, tentoonstellingen, toegangen, merchandising en raadpleegmogelijkheden van origineel of gereproduceerd materiaal in een studiezaal. Uiteraard is een website ook een product omdat het een digitaal platform creëert voor allerlei diensten. Nieuwe, gewijzigde en verwijderde producten en diensten zullen altijd veranderingen veroorzaken in andere lagen binnen de organisatie.

Alle producten en diensten worden gemaakt en onderhouden door de werkprocessen binnen de instelling. Deze werkprocessen omvatten alle activiteiten die meerwaarde geven aan de producten. Deze activiteiten zijn altijd onderling van elkaar afhankelijk. Traditionele werkprocessen binnen instellingen voor cultureel erfgoed zijn verwerving, toegankelijk maken, beheer en dienstverlening. Als een nieuw product wordt gemaakt voor de klant, bijvoorbeeld een virtuele tentoonstelling op Internet, dan zal dat invloed hebben op de dienstverlening en de benodigde metadata. Het werkproces toegankelijk maken kan ook beïnvloed worden. Zelfs is het mogelijk dat nieuw materiaal verworven moet worden om het product te kunnen realiseren.

Voor de realisatie van producten en diensten is altijd een correcte en volledige verzameling van metadata nodig. Er bestaat veel verwarring over de definitie van de term metadata binnen verschillende disciplines. Naar mijn mening omvat het begrip metadata voor instellingen voor cultureel erfgoed gestructureerde gegevens over het historische materiaal dat wordt bewaard in een instelling (bijvoorbeeld overzichten, catalogi, inventarissen en genealogische indexen), ongestructureerde gegevens over dat materiaal (bijvoorbeeld publicaties en handleidingen) en kennis die bij de medewerkers van de instelling aanwezig is over dat materiaal. De laatste categorie kunnen ook omschreven worden als mobiele metadata, met alle risico’s van dien (ziekte, pensioen, vakantie etc.).

De juistheid en volledigheid van de metadata worden bepaald door de manier waarop de medewerkers informatie samenstellen binnen een bepaald werkproces. Des te meer de metadata zijn gestructureerd, gestandaardiseerd en bewaard in goede en open databases, des te beter kunnen zij de diverse werkprocessen ondersteunen. Teneinde een degelijke verzameling aan metadata te kunnen samenstellen en onderhouden, is een actief IKM-beleid noodzakelijk. Het IKM-beleid dient er zorg voor te dragen dat zoveel mogelijk gegevens beschikbaar zijn op de momenten en plaatsen dat ze voor een werkproces nodig zijn.

Het is van belang om een goed onderscheid te maken tussen metadata die gestructureerd kunnen worden en metadata die ongestructureerd blijven. Om dat duidelijk te maken is de volgende weergave van de cyclus van kennis behulpzaam [3].

diagram 2
Diagram 2

In dit diagram zijn de gegevens de meest gestructureerde metadata. De gegevens worden als feiten gezien en als zodanig beschreven. Moderne ICT-concepten en –technologieën zijn zeer behulpzaam in het dusdanig bewaren en onderhouden van die gegevens dat ze op allerlei mogelijke manieren kunnen worden gebruikt. Informatie kan worden gezien als een combinatie van de gegevens voor een specifiek doel. Juiste informatie leidt tot kennis. Deze kennis vergroot het inzicht over een onderwerp. Dit inzicht kan weer leiden tot nieuwe gegevens of wijzigingen van bestaande gegevens.

In de praktijk is het onmogelijk om alle kennis uit mensen te halen teneinde een complete verzameling aan gegevens over ieder relevant onderwerp samen te stellen. Aangezien de mens gegevens en informatie aan elkaar kan koppelen op een manier die door geen enkel informatiesysteem kan worden gevenaard, is het voor het IKM-beleid een grote uitdaging om zoveel mogelijk gegevens en informatie vast te leggen op een gestructureerde manier, zodat zij eenvoudig en onafhankelijk van menselijke tussenkomst kunnen worden gebruikt.

Voor gestructureerde metadata is het mogelijk om criteria te bepalen op basis waarvan een gezonde ICT-architectuur kan worden gerealiseerd. Deze criteria kunnen per organisatie verschillen maar de volgende kunnen als universeel worden aangemerkt:

Metadata kunnen worden verdeeld in categorieën op grond van hun betekenis en van het werkproces waarin ze worden gecreëerd en onderhouden. Mogelijke categorieën voor metadata van instellingen van cultureel erfgoed zijn:

Verschillende internationale standaarden, zoals Dublin Core, ISAD(G), ISAAR(CPF) and EAD, bevatten veel of alle hierboven vermelde categorieën. Ze bevatten echter niet alle mogelijkheden om de categorieën te koppelen aan aggregatieniveaus en context van de metadata.

Bij instellingen die historisch materiaal bewaren, dienen alle producten, werkprocessen en metadata een relatie te hebben met dat materiaal. Het behouden van dat materiaal is op de lange termijn de belangrijkste taak van de instelling. In het IKM-model heb ik dat materiaal op een eigentijdse manier omschreven als een Historisch Data Warehouse. De reden is dat in veel literatuur over ICT metadata worden gekoppeld aan een Data Warehouse. Historische archieven en collecties hebben vaak dezelfde functie als een Data Warehouse in een moderne organisatie. Ze vormen de basis voor het verschaffen van accurate en onveranderbare informatie teneinde besluitvorming, kennis en inzicht te ondersteunen.

Marco geeft de volgende bruikbare definitie van een Data Warehouse: “A data warehouse is a single, entreprise-wide collection of data”. Deze verzameling dient te voldoen aan de volgende vier randvoorwaarden:

  1. Een Data Warehouse is onderwerpsgericht;

  2. Een Data Warehouse geeft een integraal beeld van de werkgebieden van een organisatie;

  3. De gegevens in een Data Warehouse zijn onveranderbaar;

  4. Een Data Warehouse bevat historische gegevensverzamelingen [5].

Het concept Data Warehouse is ontwikkeld binnen de ICT-wereld voor gegevens die digitaal worden bewaard. Hoewel dat niet opgaat voor de meerderheid van gegevens die worden bewaard in organisaties voor cultureel erfgoed, is de vergelijking erg bruikbaar.De bovengenoemde uitgangspunten voor een Data Warehouse gaan ook op voor archieven en collecties, ook al zijn die niet samengesteld op een digitaal platform. De vergelijking kan dienen om een kloof te dichten tussen twee werelden die al te zeer van elkaar gescheiden zijn.

Een ander voordeel voor het aanduiden van archieven en collecties als een Historisch Data Warehouse voor de maatschappij is dat het begrip metadata op zijn plaats valt. Het begrip metadata wordt vaak gebruikt voor het aanduiden van de gegevens die nodig zijn voor het onderhouden van een Data Warehouse. Metadata zijn de gegevens die worden samengesteld, gewijzigd en verwijderd in de werkprocessen, de gegevens in een Data Warehouse mogen nooit worden gewijzigd. David Marco beschrijft de concepten van metadata en Data Warehousing op een erg herkenbare manier: “Meta data is the card catalog in a data warehouse. By defining the contents of a data warehouse, meta data helps users locate relevant information for analysis. In addition, meta data enables users to trace data from the data warehouse to its operational source (i.e. drill-down) and to related data in other subject areas (i.e., drill-across). By managing the structure of the data over a broad spectrum of time, it provides a context for interpreting the meaning of information” [6].

De vergelijking van historische archieven en collecties met het moderne concept van Data Warehousing is een interessant perspectief voor het ontwikkelen van een eigentijds IKM-model voor instellingen voor cultureel erfgoed. Archiefspecialisten werden overigens niet betrokken bij de ontwikkeling van het concept van Data Warehousing. Dat is tegelijkertijd verrassend en betreurenswaardig omdat ICT-specialisten veel hadden kunnen leren over concepten als authenticiteit, betrouwbaarheid, leesbaarheid en context en creatie van gegevens!

Informatie- en Kennis Management en ICT-architecturen

Het hierboven beschreven IKM-model biedt een abstract, integraal en consistent perspectief op het omgaan met historische gegevens en informatie binnen een organisatie. Het is het zenuwstelsel van de organisatie. Het functioneren van dat zenuwstelsel is tegenwoordig grotendeels bepaald door een goed gebruik van ICT-systemen.

In de laatste decennia zijn een aantal veranderingen opgetreden in het gebruik van ICT-middelen. Overal is ICT begonnen door enthousiaste specialisten die veelal onafhankelijk van elkaar werkten. Deze fase had vaak een experimenteel karakter. Met de groei van het belang en de mogelijkheden van ICT werd het vakgebied een zaak van strategisch belang. Grote monolithische systemen werden gerealiseerd. Met het ontstaan van client-server systemen en vooral met de opkomst van Internet werden systemen ontwikkeld die middels een integraal concept werden ontwikkeld Dit concept wordt doorgaans ICT-architectuur genoemd. De snelheid waarin deze ontwikkeling plaats heeft gevonden is erg verschillend. In instellingen voor cultureel erfgoed komen de beschreven fases vaak tegelijkertijd voor.

Wat is een ICT-architectuur? Applegate beschrijft het als volgt: “Just as the blueprint of a building’s architecture indicates not only the structure’s design but how everything –from plumbing and heating systems to the flow of traffic within the building- fits and works together, the blueprint of a firm’s information architecture defines the technical computing, information management, and communications platform. The IT Architecture provides an overall picture of the range of technical options available to a firm, and, as such, it also implies the range of business options. Decisions made in building the technical IT architecture must be closely linked to decisions made in designing the IT organization that will manage the architecture, which, in turn, must be linked to the strategy and organization design of the firm itself. Conversely, the organization strategy, structure, incentives, and processes strongly influence how the technology will be designed, deployed, and used within a firm” [7].

Indien het IKM-model zoals dat hier is beschreven als uitgangspunt wordt genomen, dient de inzet van ICT-middelen in iedere laag correct te worden ontwerpen, geïmplementeerd en aan elkaar gekoppeld. In iedere laag zal ICT een rol spelen. Op het gebied van producten en diensten kan gedacht worden aan Internet-applicaties, e-commerce en software om het gedrag van de klant te kunnen vastleggen. Alle werkprocessen zullen een of meer applicaties moeten gebruiken om metadata te raadplegen, in te voeren, te wijzigen of te verwijderen. De metadata, mits digitaal opgeslagen en onderhouden, zullen worden bewaard in databases en tekstbestanden. Het Historisch Data Warehouse zal digitaal historisch materiaal bevatten en digitale reproducties (of zelfs substituten) van origineel, niet-digitaal materiaal bevatten.

Een goede ICT-architectuur zal een antwoord moeten geven op de vraag hoe verschillende componenten worden aangewend in de verschillende lagen in het IKM-model. Er zijn verschillende manieren waarop ICT-componenten worden onderscheiden. Een goed onderscheid wordt gemaakt in onderstaand diagram.

diagram 3
Diagram 3

Net als in het IKM-model zijn alle componenten met elkaar verbonden. In ieder deel moet een verantwoorde keuze worden gemaakt uit de beschikbare producten op de markt. Die keuzes hangen af van het doel, het budget en de structuur van de organisatie.

Een goede manier om deze keuzes te maken, is het uitvoeren van een informatie audit en het ontwikkelen van een ICT-beleid op grond van die audit. Het resultaat van die audit verschilt uiteraard per organisatie. Hieronder beschrijf ik enige conclusies die ik uit mijn huidige praktijk heb getrokken.

Het is van groot belang om moderne hulpmiddelen te gebruiken voor gegevensstructuren. De metadata nemen binnen het IKM-model een sleutelpositie in met betrekking tot het leveren van nuttige en juiste informatie. De belangrijkste randvoorwaarde daarbij is connectiviteit van die gegevens, hetgeen kan worden waargemaakt door het gebruiken van relationele databases. Zij vormen het belangrijkste en krachtige instrument om gegevensstructuren en de gegevens te ontwerpen, te implementeren en te waarborgen. De taal om gegevens in een relationele database te manipuleren, SQL, is wereldwijd geaccepteerd en ondersteund. Relationele databases zijn open, hetgeen inhoudt dat zij direct altijd kunnen worden gekoppeld aan andere databases. Uiteraard dienen de logische en technische structuren op een professionele manier te worden ontworpen en gerealiseerd.

Standaardisatie van metadata is een belangrijk maar vaak verkeerd begrepen aspect. Voor degenen die vaak internationale standaarden hanteren is het wellicht niet nieuw maar veel instituten gebruiken deze standaarden niet. Een van de merkwaardige zaken is dat standaarden als ISAD(G) vaak worden gezien als iets totaal nieuws, terwijl ze meestal een verbeterde versie zijn van eerdere conventies. Het is niet nodig om een volledig nieuwe verzameling van metadata samen te stellen. Vaak is een conversie van de oude naar de nieuwe structuur (en eventueel van verouderde naar moderne, digitale informatiedrager) voldoende. Een ander misverstand met betrekking tot standaarden betreft de vraag wanneer en hoe ze moeten worden geïmplementeerd. Standaardisatie is nodig om een gezamenlijke structuur te creëren om informatie-uitwisseling nodig te maken. Het is in feite niet van belang of een database exact gestructureerd is volgens een standaard. Zolang het mogelijk is om de gegevens te genereren in een gestandaardiseerd formaat, is er geen reden tot zorg. Het is veel belangrijker om ervoor te zorgen dat een state-of-the-art database platform wordt gebruikt die het mogelijk maakt om goed te anticiperen op toekomstige ontwikkelingen in databasetechnologie en standaarden voor metadata.

Een van de fundamentele uitgangspunten in de ICT-architectuur is dat het onderhoud van metadata andere applicaties vergt dan de presentatie van de metadata. Metadata mogen nooit redundant worden samengesteld. Ze moeten worden bewaard in bij voorkeur één database. Ze kunnen echter wel gepresenteerd worden op veel verschillende manieren, bijvoorbeeld middels een zoekmachine op Internet, een geprinte versie en via een website waarin ze worden samengevoegd met gegevens van andere instituten. Daarbij komt dat software voor presentatie van de metadata meestal sneller en vaker moet wijzigen dan software voor onderhoud van die gegevens. Een ICT-architectuur die een goede scheiding maakt tussen deze twee soorten applicaties kan er uitzien zoals in onderstaand diagram. Het biedt een kader voor het ontwikkelen van goede, doelgerichte applicaties.

diagram 4
Diagram 4

Slotopmerkingen: wat is het nut van IKM in de praktijk?

De hierboven beschreven modellen en concepten zijn volstrekt nutteloos als ze niet leiden tot een verbetering van de prestaties van een instelling. Het bewijs van die verbetering kan alleen geleverd worden door een verhoging van de kwaliteit van producten en diensten van de instelling. Een grote meerwaarde van het gebruik van dergelijke modellen ontstaat als ze gebruikt worden bij de definitie van nieuwe projecten en de invloed van de beoogde projectresultaten op de organisatie. Ze kunnen ook nuttig zijn bij het in kaart brengen van de samenhang tussen verschillende projecten. De modellen kunnen worden gebruikt om voor alle werkprocessen duidelijk te maken wat de consequenties zijn als bijvoorbeeld een nieuwe virtuele tentoonstelling op het Internet wordt gemaakt of als een nieuwe zoekmachine voor metadata over archieven en collecties wordt gerealiseerd. Ze kunnen een bijdrage leveren aan het verminderen van het risico dat werkprocessen of metadata dusdanig worden veranderd dat zij niet meer aansluiten bij de behoefte van andere werkprocessen. De modellen kunnen ook gebruikt worden om te voorkomen dat nieuwe informatiesystemen in gebruik worden genomen die de organisatie zullen verhinderen om doelen als integratie, standaardisatie en presentatie te bereiken.

Een IKM-model en een ICT-architectuur bieden zeer effectieve kaders om projecten en werkprocessen te beheersen. Ieder werkproces kan aan elkaar worden gekoppeld. Het wordt mogelijk om bijvoorbeeld een checklist te maken voor nieuwe projecten, om te waarborgen dat zij passen in de uitgangspunten van de organisatie met betrekking tot IKM en ICT.

De belangrijkste vooronderstelling is dat het management van de instelling zich bewust is van het nut van het gebruik van deze instrumenten om de kwaliteit van producten en diensten te verhogen. Dit bewustzijn waarborgt dan tegelijkertijd dat die producten en diensten worden ondersteund door werkprocessen die een duidelijke samenhang vertonen. Het is de verantwoordelijkheid van het management om een beleid te ontwikkelen en uit te voeren op de middellange en lange termijn. De gestelde doelen kunnen dan op dusdanige wijze worden geformuleerd en gerealiseerd dat projecten kunnen worden geïnitieerd die consistent zijn met dat beleid.

Aan het slot van dit artikel geef ik een voorbeeld van het Gemeentearchief Amsterdam met betrekking tot het creëren van nieuwe producten en diensten. Het doel op de lange termijn is de beschikbaarstelling van alle beschikbare beschrijvingen van inhoud en context over de archieven en collecties middels één zoekingang op Internet. Deze zoekmachine dient dusdanig te zijn ontworpen dat de meest gestelde vragen van klanten (vragen over een persoon of instelling, over een onderwerp, een locatie of een periode) zo eenvoudig mogelijk kunnen worden beantwoord. Dit is een zeer omvangrijke taak waarin alle werkprocessen moeten worden betrokken.

Het startpunt verschilt zeer van de gewenste situatie, hetgeen een lot is dat door veel instituten wordt gedeeld. Veel metadata zijn niet compleet, niet gestandaardiseerd, niet digitaal beschikbaar in moderne databases en niet aan elkaar gekoppeld. Het Gemeentearchief Amsterdam heeft niet alleen archieven maar bewaart ook grote collectives bibliotheekmateriaal, audiovisueel; materiaal, foto’s, tekeningen en karotgrafisch materiaal. Het realiseren van de integratie van de metadata over dit materiaal met behulp van de juiste standaarden, is een grote uitdaging. Niet alle werkprocessen zijn dusdanig gestructureerd dat geïntegreerde diensten kunnen worden geboden. Teneinde de vereiste metadata te verkrijgen is een aantal projecten geformuleerd. Het eerste project betrof de realisatie van een volledige gegevensverzameling op het hoogste aggregatieniveau in de vorm van een overzicht. Dit overzicht is inmiddels beschikbaar op Internet [8]. Andere zoekmachines op de website zijn nog niet gekoppeld aan dat overzicht. De volgende stap betreft het presenteren van metadata over archieven op lagere aggregatieniveaus. Deze gegevens waren niet beschikbaar in een database. Daarom is een groot data-entry project uitgevoerd. Het resultaat is een database met 350.000 records. Teneinde deze gegevens als een geïntegreerde service aan te kunnen bieden, is het nodig om ze aan te vullen met zoekingangen op namen van personen, organisaties, onderwerpen, locaties en periodes. Dit project kent een tweelingbroer in het backoffice, waar bestaande informatiesystemen dienen te worden gewijzigd of vervangen om de vereiste metadata op een gestandaardiseerde wijze te kunnen samen te stellen en te onderhouden.

In de komende jaren zullen de metadata over context en inhoud van de archieven en collecties worden geïntegreerd in één model. Het gevolg daarvan is dat vrijwel alle bestaande ICT-systemen moeten worden hergewaardeerd, strategische keuzes moeten worden gemaakt over de keuze van standaarden, veel gegevens geconverteerd moeten worden, kwaliteitscontroles plaats moeten vinden en andere werkwijzen moeten worden ingevoerd. Dit alles kan niet succesvol verlopen zonder het hebben en het uitvoeren van een passend IKM-model en een daarop afgestemde ICT-architectuur.

Een Engelse versie van dit artikel is ook verkrijbaar.

Referenties

  1. Mijn speciale dank gaat uit naar Kent Haworth, York University Archivist and Head, Special Collections en Project Director and Secretary, ICA Committee on Descriptive Standards, voor zijn commentaar op een eerdere versie van dit artikel.
  2. In de meeste literatuur wordt de afkorting “IT” gebruikt, mijn voorkeur gaat echter uit naar de afkorting “ICT”.
  3. Wurman, p. 27 en Milner, p. 3. Ik heb het begrip “wijsheid” in dit diagram vervangen door het meer bescheiden “inzicht”.
  4. Vergelijk bijvoorbeeld Gilliland-Swetland, Anne J, Defining metadata, in Baca, p. 3;
  5. Marco, p. 23-24. Marco citeert Inmon, W.H.: Building the Data Warehouse, Wiley, 1996, p. 33;
  6. Marco, p. 48
  7. Applegate c.s., p. 139-140
  8. < http://www.gemeentearchief.amsterdam.nl/archieven_en_collecties/overzicht/introductie/index.nl.html> Link to external resource

Literatuur

  1. Applegate, Lynda, F. Warren MacFarlan en James L. MacKenney (1999). Corporate Systems Information Management. Irwin MacGraw-Hill, Boston.
  2. Baca, Martha and others. (1998) Introduction to metadata, pathways to digital information. Getty Information Institute, New York.
  3. Cook, Terry (2001). Archival Science and Postmodernism: New formulations for Old Concepts, in Archival Science (2001-1), ed. Horsman, P., E. Ketelaar and T. Thomassen, Kluwer Academic Publishers, Dordrecht, p. 3-24.
  4. Getty Information Institute, New York, Art and Architecture Thesaurus
    URL: <http://www.getty.edu/research/tools/vocabulary/aat> Link to external resource
  5. International Council of Archives (ICA), ISAAR(CPF) standard
    URL: <http://www.ica.org> Link to external resource
  6. International Council of Archives (ICA), ISAD(G) standard
    URL: <http://www.ica.org> Link to external resource
  7. Marco, David (2000). Building and managing the meta data repository, a full life-cycleguide. John Wiley & Sons, New York.
  8. Menne-Haritz, A. (2001). Access: the reformulation of an archival paradigma, in Archival Science (2001-1), ed. Horsman, P., E. Ketelaar and T. Thomassen, Kluwer Academic Publishers, Dordrecht, p. 57-82.
  9. Milner, Eileen M. (2000). Managing Information and Knowledge in the Public Sector. Routledge, London.
  10. Records Continuum Research Group, Australia.
    URL: <http://rcrg.dstc.edu.au> Link to external resource
  11. Ribeiro, Christina and Gabriel David (2001). A Metadata Model for Multimedia Databases.
  12. Smit, F.P. (2000). Proposal for a Datamodel of Archival Descriptions, in: Atti del Summit DACE, Roma, 2000, p. 149-196.
  13. Smit, F.P. (2001), Het nieuwe Overzicht van Archieven en Collecties, in: Archievenblad (2001-1), Koninklijke Vereniging van Archivarissen, Amsterdam, p. 26-29.
  14. Society of American Archivists, Encoded Archival Description
    URL: <http://www.loc.gov/ead> Link to external resource
  15. Svenonius, Elaine (2001). The Intellectual Foundation of Information Organization. The MIT Press, Cambridge Massachusetts.
  16. Wurman, Richard Saul (2001). Information Anxiety 2. QUE, Indianapolis.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Author Details

Frans SmitFrans Smit
Hoofd van de Sectie Ontsluiting
Gemeentearchief Amsterdam
PO Box 51140
1007 EC Amsterdam
The Netherlands

Phone: ++31 20 5720227
Fax: +31 20 6750596

<fsmit@gaaweb.nl> Link to an email address
<franssmit@planet.nl> Link to an email address
<http://www.gemeentearchief.amsterdam.nl> Link to external resource

Frans Smit is hoofd van de Sectie Ontsluiting van het Gemeentearchief Amsterdam. Hij is (en was) betrokken bij diverse nationale en internationale projecten met betrekking tot het verschaffen van toegang tot metadata over archieven en collecties door middel van zoekmachines op Internet.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Smit, F. "Het Historisch Data Warehouse", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/warehouse-d/>

-------------------------------------------------------------

Cultivate Interactive Issue 6: News and Events

The content on this page is current at the time of publication (February 2002), but will become out of date. To reach a more recent issue of Cultivate Interactive use the 'Current Issue' link in the top green navigational bar.

News

Information for Museum Experts

ilMuseums logomuseumexperts.com is a Web site for Museum and Cultural Heritage Professionals. It offers both a museum directory listing almost 200 European Museums and a directory of Museum professionals. The site also provides general Museum News and information about calls for tender in the Museum world. museumexperts.com has recently gone bilingual, all the pages now appear in French and English.

Further Information?: Visit the Museum Experts site Link to external resource or contact Jean Francois Grunfeld Link to an email address .

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

SOSIG Expands into Europe

SOSIG (Social Science Information Gateway) is pleased to announce the introduction of a new section dedicated to European Studies.

The new EuroStudies gateway will provide access to Internet resources about Europe as a region, ranging from issues of international security to the introduction of the Euro. In addition, it will cover information provided by individual European countries, with a particular emphasis on those located in Central and Eastern Europe, which are generally under-represented on the Web.

Further Information?: To access the new gateway, go to the SOSIG site Link to external resource or contact Debra Hiom Link to an email address .

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

E-Learning - Must Try Harder

On the surface the performance of e-learning over the last two years looks good. A 70% year-on-year increase in revenues earned by suppliers suggests that we are all taking to e- learning. While that may be true in some countries, in others the growth is comparatively slow. Furthermore a large number of teachers and trainers are seriously worried about their ability to make the most of new technologies.

This mixture of good and bad news typifies the information in two surveys on e-learning published by Cedefop (European Centre for the Development of Vocational Training), the European Union’s centre for information on vocational training. Respondents to the survey E-learning and training in Europe indicate that e-learning is now responsible for over 30% of the earnings of private and public sector training suppliers, compared to 18% two years ago. Figures gathered from a range of e-learning suppliers and consumers across the European Union show Finland, France, Germany and Spain adopting the new methods of learning with enthusiasm but in other countries they are not so popular.

Further Information?: Extracts from both surveys can be found on the European Training Village Web site Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Know any Good jokes?

Gunnel Stjernvall originator of the Renaissance Library Calendar is looking for jokes about librarians and libraries. for example:
"Do you know how many librarians it takes to screw in a light bulb?
No, but I know where you can look it up!"

Further Information?: You can either email your jokes and cartoons to Gunnel Link to an email address or send them by post:
Ms Gunnel Stjernvall
ISIM
Stora Vstergatan 45
SE-271 45 Ystad
Sweden.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Events

Multimedia Archive Preservation Workshop

Where?: Central / West London venue, UK
When?: 20 - 24 May 2002

80% of audio and video archive content is at risk, according to the results of EC project PRESTO (Preservation Technology a project about new technology and processes to make preservation projects more cost effective). Unless preservation procedures are funded and implemented quickly unique heritage and commercially valuable material will be lost. This workshop will provide, in a concentrated three days, the combined experience of ten major European broadcast archives, and the new technology developed by PRESTO.

The workshop will cover funding, selection-criteria, monitoring, the preservation factory, quality, metadata, new technologies, asset management and much more. You will also get tours of actual preservation projects for audio, video and film in the London area and a copy of the PRESTO Multimedia Preservation Handbook.

The cost will be 250 euros for the three days, which includes three lunches and one formal dinner, but does NOT include accommodation. For members of PRESTO, IASA (International Association of Sound and Audiovisual Archives), and FIAT (International Federation of Television Archives) the cost of the conference itself is free but there will be a 50 euros charge for the lunches and dinner.

Further Information?: Email Richard Wright Link to an email address .

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

The European Library (TEL) - The Gate to Europe’s Knowledge Milestone Conference

Where?: Central / West London venue, UK
When?: 29 - 30 April 2002

In addition to print media, which are likely to retain their importance in many areas, networked publications, interactive and multimedia products and services will play an increasingly important role over the next few years. In order to structure the vast flood of information for the benefit of the individual scholar and decision-maker, and ultimately for every private citizen, WWW gateways and portals will need to be created and continuously upgraded. This task requires international co-operation on both technology and content.

Nine European national libraries together with the Conference of European National Librarians are constructing the basis for such a portal concept in the project “The European Library” (TEL). “The European Library” (TEL) is an accompanying measure within the framework of the Key Action 3 of the IST-Programme (Information Societies Technologies) of the EU.

On the basis of the decentralized digital and conventional holdings of the participating national libraries the technical, legal and context platform of a pan-European information server shall be created. In future the users can search all online catalogues of the European national libraries simultaneously with a single inquiry, proceeding from there to access their digital and conventional holdings and services. Definition and clarification of legal issues concerning copyright and licensing as well as a business model play an important role.

The results of the first half period are going to be presented at the international conference. External speakers are going to supplement and to comment the Tel presentations in five topics.

The topics:

Participation is free of charge.

Further Information?: Visit the European Library site Link to an email address .

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Cultivate Interactive Issue 6: Jobs Section

The content on this page is current at the time of publication (February 2002), but will become out of date. To reach a more recent issue of Cultivate Interactive use the 'Current Issue' link in the top green navigational bar.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Position?: Assistant Librarian - Collection Management Electronic Resources Librarian
Company?: Glasgow Caledonian University
Closing Date?: 22 February 2002

A full-time permanent post exists for an Assistant Librarian in the Collection Management section of the University Library. The successful candidate will be responsible for the management of the Serials team (which deals with serials in both print and electronic forms) within Collection Management; managing all Collection Management aspects of electronic journal and electronic book developments and provision; and managing the provision of Reading List materials. The postholder will also be required to do Enquiry Desk duties as part of a rota.

A thorough awareness of current developments in all Collection Management aspects of eis provision in an academic library environment, including the delivery of core text materials is essential, as are excellent team management, interpersonal, and communication skills. Previous experience of journal (print and electronic) processes and management is also required. The requirement to work one evening per week until 9pm (with the morning off) is part of the post.

Information packs can be obtained from:
HR Department,
Glasgow Caledonian University, City Campus,
70 Cowcaddens Road,
Glasgow,
G4 0BA
UK
Phone: 0141 331 8890,
Fax: 0141 331 8868,
E-mail: r.graham@gcal.ac.uk Link to an email address

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

-------------------------------------------------------------

Cultivate Interactive Issue 6: Misc.

Articles

Other Misc. Items

-------------------------------------------------------------

Virtual Reference: A Letter from North America

By Michelle Fiander - February 2002

Michelle Fiander, Manager of the Web Reference Center, writes us a letter from North America explaining what Virtual Reference is and how it came about.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Hi, I’m Michelle Fiander. I manage the world’s first (at least as far as I know, it’s the first…) Web Reference Center for libraries. At the Web Reference Center librarians answer questions from library patrons who have logged into a virtual reference service, usually through their local library’s Web page. The Center operates 24 hours a day, 7 days per week, and we are available to respond to questions from any geographic location at any time. Given the novelty of the Web Reference Center and virtual reference services, I thought I’d tell you a bit about the technology and process behind the Center.

Virtual reference (also called Web reference and real time reference) started a couple of years ago when libraries began experimenting with Web-based customer service software originally designed for answering questions on ecommerce sites like LL Bean [1] and Land’s End [2]. This software, often referred to as Web Contact Center Software, allowed a Customer Service Representative (CSR) to communicate with a customer by phone or by Chat while at the same time sending relevant Web pages to the customer. The result was that the CSR and customer could discuss a product while looking at images or textual information about it. In addition, some of this software also incorporated the ability to send documents in just about any format to the customer, and to annotate documents with or for the customer. Of most interest to librarians was the provision of synchronous communication and sharing of Web-based and other information.

Librarians were interested in the capabilities of this commercial software because, as most of us realized, in-house reference statistics were, and still are, dropping [3]. Further, with the advent of the Internet and Web-based delivery of formerly print resources, library users are relying more and more frequently on information accessed via computer—whether it is from the library or not. Given the typical library patron’s reliance on and interest in retrieving as much information as possible online, and given librarians’ collective awareness of this situation, it’s not surprising that Web Contact Center Software interested some librarians. These librarians quickly envisioned themselves slipping into the Customer Service Representative seat, but instead of responding to questions about the size, color and utility of products, the librarian would answer typical reference questions in a show-and-tell environment that mimicked in-house reference services. Barely two years after these wishful thoughts entered our minds, many libraries, largely in the US--but also in Canada and Australia—have created virtual reference services. Depending on how you classify and count these services, anywhere from 300 to 500 libraries are offering virtual reference services that go beyond email based Ask-A Librarian services [4].

The software being used to support virtual or real-time or Web reference services varies in its power, features and flexibility [5]. For example, some libraries are using software that allows them to send Web pages, but which does not allow the librarian and patron to view the page together. Other libraries are using software that only supports Chat communication, but not Web page or document pushing or sharing. In some cases, the software being used by libraries to support virtual reference is out of the box and not modified extensively for library use. In other cases, software providers, intermediary companies or individual librarians are modifying software to better support the special needs of library reference work.

Software developers, to greater and lesser degrees, are modifying their software for library applications. At LSSI, for example, we are striving to make the software work better in a library environment and are concentrating on issues related to both programming and design. For example, we are building reference tools to support co-browsing of proprietary databases because this issue is vital to good reference service for almost any type of library.

The Virtual Reference Desk, courtesy of one of Steve Coffman's presentations given at Virtual Reference Desk
2nd Annual Digital Reference Conference
The Virtual Reference Desk, courtesy of one of Steve Coffman's presentations
given at The Virtual Reference Desk 2nd Annual Digital Reference Conference

As the field has grown, virtual reference discussions and lectures have become very popular at library conferences, and there is even a conference devoted exclusively to the issue. The VRD (Virtual Reference Desk) Conference [6], begun only three years ago, has become a primary venue for librarians and educators interested in virtual reference. Although the conference is in its infancy, the number of attendees has grown each subsequent year. Further, the conference gets great reviews, and speakers are interested in the topic and enthusiastic about promoting virtual reference strategies. Anecdotal as my account of VRD is, it indicates a strong interest among a significant group of library professionals looking for a way to reach out to library patrons in a variety of settings for a variety of purposes.

Despite the growth of virtual reference, not everyone is keen. On listservs, in discussions with colleagues, and at conferences, I’ve heard nay-saying which presumes an awful lot about virtual reference. Many of those who discount virtual reference, for example, think it will thrive simply because it is based on a hot technology. However, if virtual reference thrives, I doubt it will be simply because it a technological response to information needs. It will thrive because patrons use it. The train took over the horse; the car took over the train; but we still use trains and horses. Regardless of how virtual reference develops, and I hope it does because it’s mighty convenient, there will continue to be a role for in-house reference services and face-to-face interactions for quite some time. I’m not much of an either/or thinker and believe there is room for both virtual and face to face reference and that both have a valuable role to play.

As far as where my interest lies right now, it lies in improving and developing virtual reference, and the Web Reference Center. Of primary importance to me as I do this are meeting the needs of the patron and doing a good job as a reference librarian. To do this, I’ll continue to follow the rules that have guided my in face-to-face reference services: help patrons as they need to be helped; indicate the source of an answer or any information shared; be courteous and helpful; keep as informed as possible about resources and techniques to respond to queries.

Even though I’m working in a new and innovative arena, I see virtual reference neither as panacea nor as something to be dreaded. Virtual reference is challenging some of our traditional methods of doing reference, but even before virtual reference arrived on the scene, methods of information delivery, information itself, and information seeking behavior were changing before our eyes; and libraries have, for the most part, embraced this change by, for example, developing digital libraries and extensive collections of electronic materials. Despite this, we have not dramatically changed our methods of reference service [7]. Just as librarians have adjusted collection development policies and budgets to accommodate electronic materials, we must now adjust the ways we assist patrons who prefer to, or must, work online.

Before I sign off, I hope you won’t mind if I reminiscence…just for a moment. When I was in library school, seven and a half years ago, our director was fond of saying something to the effect of: “We’re not the only game in town anymore.” While I understood at the time that the ‘we’ encompassed libraries and librarians, and that the ominous, shadowy ‘other’ implied in his statement was the Internet, I had no idea how prophetic his statement would be. At the time I was baffled by the Internet and could barely comprehend, let alone use, Gopher or Veronica. Would this thing catch on, I asked myself? Heck, no! At least not with this semi-computer-literate, fledgling-librarian. But then, during the summer between my first and second years of the MLS program, the Web and Netscape appeared! Imagine my surprise when I found a gene sequence for a researcher at the medical library where I was interning by cleverly using Netscape! Easy? You betcha! Within a period of months, my opinion of the Internet as a valuable information resource changed.

Cleveland Public Library's Know It Now
virtual reference service. CPL has a very active service which includes
Homework help as well as service for adults. Further, CPL has established a
partnership with Cleveland area hospitals so that health related questions
are answered by health professionals, nurses, rather than librarians.
Cleveland Public Library's Know It Now virtual reference service.
CPL has a very active service which includes Homework help as well as service for adults.
Further, CPL has established a partnership with Cleveland area hospitals so that
health related questions are answered by health professionals, nurses, rather than librarians. [8]

The Internet is still not the source of every thing, but it is the source of a lot. Add to this the fact that the Internet as place has given way to the Internet, via the Web, as a delivery mechanism, and the possibilities are expanded dramatically. The same thing will happen with virtual reference. Right now it’s new. It has utility, but it’s not perfect. As time passes, the technology to support virtual reference will become easier to use, just as the Internet became easier to use. Along with this, users’ skill levels will rise, just as the skill level of Internet searching has risen during the past ten years. Further, just as our expectations of the Web and its concomitant technologies have risen, so too will our expectations of virtual reference software and virtual reference interactions. Even in these early stages of virtual reference, plenty of people and organizations are working in collaborative situations. The Library of Congress Collaborative Digital Reference (CDRS) Project springs to mind, as do the many library consortia gathering together to pool financial and staff resources in order to support virtual reference. And, finally, some Master of Library and Information Science programs are beginning to think about teaching virtual reference in their regular curriculum. Eventually virtual reference and collaborative virtual reference will be a de facto part of the library landscape.

That’s all for now. So long!

References

  1. L. Bean
    URL: <http://www.llbean.com> Link to external resource
  2. Lands’ End
    URL: <http://www.landsend.com> Link to external resource
  3. See report from Association of Research Libraries
    URL: <http://www.arl.org/stats/arlstat/99intro.html> Link to external resource
  4. 4. For estimates on the number of libraries offering virtual or real-time reference services, see the following Web sites: Stephen Francoeur's The Teaching Librarian
    URL: <http://pages.prodigy.net/tabo1/> Link to external resource and Gerry McKiernan’s LiveRef(sm): A Registry of Real Time Digital Reference Services
    URL: <http://www.public.iastate.edu/~CYBERSTACKS/LiveRef.htm> Link to external resource . Note that although these are sites are terrific resources, they are not definitive.
  5. For an excellent review of the types of virtual reference software available see Steve Coffman's "So you want to do Virtual Reference" Public Libraries, E-libraries (Sept./Oct. 2001) p. 14-20.
  6. According to a conference organizer, Marilyn Schick, the number of attendees has grown from 250 (1999) to 400 (2000) to 430 in 2001; the decrease in travel following 9/11 is suggested as a reason for numbers not being higher in 2001.
    URL: <http://www.vrd.org/> Link to external resource
  7. Until virtual reference arrived on the scene, roving reference and active reference service were innovative responses to the changing needs of library users. See: Anne Lipow. “In your face reference service. Internet is changing the nature of reference. Library Journal v. 124, no. 13 (Aug. 1999) p. 50-52; and Martin P. Courtois and Maira Liriano. “Tips for roving reference: how to best serve library users. College and Research Libraries News v. 61, no. 4 (Apr. 2000) p. 289-90, 315.
  8. Active virtual reference services - how libraries are implementing it:
    South Jersey Regional Library Cooperative's Q and A NJ
    URL: <http://www.qandanj.org/> Link to external resource
    Denver Public Library's Smarty Pants virtual reference service
    URL: <http://www.denver.lib.co.us/smartypants.html> Link to external resource
    Ask Your Library service from a group of Connecticut Libraries
    URL: <http://www.askyourlibrary.org/> Link to external resource

Author Details

Michelle Fiander
Manager, LSSI Web Reference Center
20250 Century Blvd. Suite 200
Germantown, MD 20874-1114
USA

michellef@lssi.com Link to an email address

Michelle Fiander holds an MLS and has been Manager of the Web Reference Center at LSSI since May 2001. Prior to work in virtual reference she was an academic reference and cataloging librarian.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Fiander, M. "Virtual Reference: A Letter from North America", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/reference/>

-------------------------------------------------------------

Chodovec: The New Prague Archive Building

By Borivoj Indra and Vladimíra Hradecká, photos Miloš Klimeš - February 2002

Borivoj Indra and Vladimíra Hradecká talk about a recently opened monumental archive building located in the southern area of Prague, and the facilities it will offer for Czech archives. Photos by Miloš Klimeš.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Introduction

In September 2001, a monumental archive building was completed in Prague which is attracting interest from archivists and other experts from a number of European countries. It is intended that this building will accommodate a third of the archive treasures of the Czech Republic. It is currently being jointly used by the Central State Archives, in whose care are the records of historical and current central authorities and institutions of the Czech state, and the Regional State Archives in Prague, which has has been entrusted with the care of records from the Central Bohemian region and, for commercial matters, from the Capital City of Prague as well. Situated immediately next to this building is the new residence of Archives of the Capital City of Prague, completed in 1997, which also has certain technical equipment in common. The entire complex of these three large and important archive stores create what is known as the Archive Premises of Prague 4 – Chodovec.

An Overall view of the strongrooms and the back side of the administration building
An Overall view of the strongrooms and the back side of the administration building

Having this concentration of archive collections with their excellent facilities for scientific work has considerably simplified research by the public in studies of historical sources. There is now no time lost moving between individual places of work and strongrooms, which in turn creates ideal conditions for further ways of using these records. Proof of this was the overwhelming interest from the public for the six-week exhibition Treasures of Czech History 1158 – 1990, held in Autumn 2001 to celebrate the commencement of operation of the Archive Premises. This new archive building has deservedly found its place as one of the most important and largest archive buildings in the world, alongside others such as the new National Archive USA II – Maryland, the Federal Archive in Koblenz, the new buildings of the National Archives in the Haag, Madrid, Kew in London, Vienna, Budapest and so on.

The Crowning Achievement of Czech Archivists

Through a number of favourable co-incidences a great number of archives have been preserved in the Czech Republic in a relatively complete state, even though until recently these records were stored in a wide variety of what were mostly antiquated and unsuitable buildings. These buildings frequently failed to fulfil even the basic requirements placed upon archive buildings at the end of the twentieth century. A significant change in the physical provision of security for stored records has only happened in the last ten years, after the dramatic political and social changes at the close of the 1980s and beginning of the 1990s. Since 1992, 42 buildings of the state district archives – out of a total of 74 - have been newly built or renovated. In this way, not only was the lack of storage space resolved (there used to be no place to receive records), but above all conditions for their future preservation has been radically improved. These new buildings have in general been constructed in compliance with modern requirements for the total protection of records, with a controlled interior environment and all necessary security measures. At the same time, these buildings create excellent conditions for visitors coming to the archives to study records, as well for the all-purpose use of historical documents, providing - for example - well-equipped lecture and exhibition halls. Of no less importance, the frequently sub-standard working environment for the work of archivists has also been transformed.

The Archive Premises at Chodovec is quite unique amongst these new archive buildings. It has become the crowning achievement of the labours and expertise of entire generations of Czech archivists. It is also successful due to the fact that inspiration and experience could also be drawn on from abroad, where the construction of modern archive buildings was pursued en masse as early as the 1970s. Paradoxically, this delay, in comparison with the international trend, has brought the Czech Republic a certain advantage: there were sources that could be used, inspiration that could be gained and, in addition, knowledge of what to avoid. It is clear that, in some cases and mainly through pressures from limited finances, mistakes were made and that these should be assessed and learnt from for later constructions. Experiences up to now in the use of the strongroom blocks of the Archive Premises, which were brought into operation as early as in 1996, being much earlier than the administrative and technical building, have so far shown however satisfying results.

Chodovec's Vital Statistics

The position of the premises in the Prague 4 district, 285 metres above sea level and well away from the flood plain, has been well chosen. The site of the construction is of a sufficient area (the entire Archive Premises lie on 4.37 hectares) to enable for it not only to be surrounded by green park areas, but also for it to provide possibilities for further development in the long-term future. Its proximity to Metro Route C and frequent bus connections allow the premises to be reached from the city centre within thirty to forty minutes using public transport without any great difficulty. Getting here is also made easier by road connections to the main motorway between Prague and Brno, as well as to the city circular road route.

An Overall view of the main entrance and administration building with lecture halls, study rooms, laboratories
An Overall view of the main entrance and administration building with lecture halls, study rooms, laboratories

Conceived on a vast scale, this construction takes up almost 12,000 square metres of building area and practically 286,000 cubic metres of interior space. Its three strongroom blocks (RB I, II and III) are clearly visible from far away, with a single floor below ground level and a highest thirteenth floor purely for technical equipment. The maximum height of the building is 42 metres. Whilst to the east of the strongroom blocks there stands the equally high grey building of the Archives of the Capital City of Prague, on the west side there extends the broad expanse of the outlying five-floor administrative and technical building for both archives. The strongroom blocks have been designed in such a way as to minimise the distance between the strongroom rooms and the research rooms, which are located in the administrative building. The central axis of the construction is strongroom block RB I, connected to the administrative and technical building by a five-floor joining “neck”. On the opposite side to this axis lie the strongroom blocks RB II and RB III in the shape of a letter T. All three of these strongroom block wings are connected by a main service corridor that has two service lifts and access to a fire emergency staircase joining together all the floors. On the East side of the main connecting corridor from the second to the twelfth floor above ground, there are light and spacious arrangement rooms and work rooms, forming “secondary work areas” that archivists use for making records and viewing archive materials. Here they can carry out retrieval and arrangement work without even taking the records to the work areas in the administrative block. In this way, there is no unnecessary movement of records and, also importantly, it saves time for these specialists.

The Construction of the Building

Detail of the reserves building with a fragment of the administration building. The construction design of both the strongroom blocks and the administrative and technical parts takes into account not only their functional use but also the safe storage of records, providing maximum protection against influences that could endanger them. The most important parts of the archives are the strongrooms. With only slight variations, they have a floor area of 183 square metres and ensure the necessary load-bearing capacity. In strongroom block RB I, which is equipped with mobile shelvings, on each floor there are 6 halls, each with two-sided racks which are twice 40 cm deep, 300 cm wide and 206 cm high. Each rack has 6 stacks (back-to-back in threes) and in each of these there are 6 easily adjustable shelves. On each floor of the RB II and RB III strongroom blocks, there are four strongroom sections (i.e. a total of 8), which are equipped with fixed racks of lengths 500 cm and 600 cm. Its main aisle is 120 cm wide and the aisles between individual racks are 80 cm wide. These fixed racks, which are 232 cm high, have either 5 or 6 s with sets of seven adjustable shelves. The manufacturer of all these racks, which have proved themselves well during use, is the company Ramet, s.r.o. from Kunovice near Uherské Hradiste. The theoretical storage capacity of the strongrooms is approximately 1100 metres for fixed racks and 1668 metres for moveable ones. The actual storage capacity is, however, somewhat less due to differences in the dimensions of archive boxes and books. Certain depositories are specially equipped for storing maps and plans. On the 12th floor above ground, there are halls that have been specially adapted in order to accommodate photographic and video materials. Those basement strongrooms that are designated as the quarantine zone have been designed for the temporary storage of records before dust is removed from them or if they have to be disinfected. In the depot sections of the construction, the Regional State Archives use the sixth to the eighth floors, one cold storage strongroom on the twelfth floor for the storage of photographic material and one hall in the basement. In total, there are 153 strongrooms for standard records and of these the Central State Archives use 113 and the Regional State Archives 40. There are 29 halls for non-standard records and libraries of which the Regional State Archives uses three, the rest being used by the Central State Archives. The actual capacity of all 153 strongrooms with either fixed or mobile shelvings for standard records is approximately 220,000 metres. Of this, the Central State Archives can use approximately 163,000 metres and the Regional State Archives around 57,000. At the time operation was commenced in the building, there were around 135,000 metres of records stored in the strongrooms, of which approximately 105,000 were in Central State Archives sections and around 30,000 in Regional State Archives sections. Further records continue to be added progressively.

In the strongroom wing RB II, areas designed for receiving records have been built on. Next to a small gatehouse, there are three ramps with an adjustable hydraulic loading platform for the easy transfer of records from vans directly to two spacious receiving halls. The larger of these is used for the Central State Archives and the smaller for the Regional State Archives. Adjoining these are small offices for cataloguing and other administrative matters connected with the acquisition process. Each hall is equipped with three dust extractor desks with three-stage air filtering. When these are not sufficiently effective, for example for records with high dust levels, traditional manual cleaning methods have to be employed using high-powered vacuum suction devices.

An integral and very interesting part of the ground floor section of the building are the fumigation areas with their management and control centre, which is used jointly by both archives. The unsuitable storage conditions of the previous decades mentioned above have meant that approximately a quarter of the records are affected by mildew which degrades paper. The fumigation centre is equipped with two pressure autoclaves made by the Spanish company Matachana. They have a volume of 6.2 square metres and, when in operation continuously, carry out fumigation using 10% ethylene oxide diluted in a mixture with 90% CO2 (trade name: etoxen). This capacity level is currently the largest of all archive buildings in the world. Due to the fact that ethylene oxide is classed as a substance that is harmful to health, great attention is paid to the safety of work carried out with it. Part of this fumigation line is equipment which performs sensitive measurements with great accuracy and ensures the airing of the records and the ecologically safe disposal of chemical products. This entire process lasts for seven days, during which on two lines adjacent to each other during a single 24 hour period, up to 70 metres of records held in special cages can be treated. The disinfected records are then for the following six days gradually aired in special tunnels until the residual ethylene oxide is reduced to a hygienically admissible level. The whole process of fumigation is controlled by a technically flawless monitoring system, which evaluates data about ethylene oxide concentrations. In the unlikely event of a breakdown involving a leak of ethylene oxide, it is able to react automatically and stop the process immediately. After seven days, the records are taken away from the other end of the line by service lifts to their final place of storage in the strongrooms.

Risk Reduction

The architectural and construction engineering design of the strongroom blocks was governed entirely by the need to safeguard the stored records. Its whole enormous weight rests on reinforced concrete foundation blocks 1500 mm thick, which lie in foundation basins. These are protected against any aggressive effects of groundwater by a coating on the outside and high-pressure isolating PVC inserted on the inside. As a covering for the depository building, a sandwich construction is used with an air gap covered by ceramic cladding. The development of best practice in the construction of similar buildings has shown that this ensures a high level of temperature constancy, as well as reducing operating costs for air conditioning. At the same time, however, this also preserves the diffusion properties of the walls for offsetting any fluctuations in relative humidity inside the halls. The bright colours of the large ceramic panels that create its facing give the building a unique character and provide an appropriate visual lightening to the otherwise enormous mass of the windowless strongrooms buildings. Articles in the press predominantly assess it positively as being “the most cheerful construction in Prague”. The actual strongrooms are fully air conditioned, with a separate system in use for each floor, allowing air conditioning to be regulated in accordance with immediate requirements. The internal environment of each individual hall can be finely adjusted using ventilation that is independent of the air conditioning system. The optimal constant temperature of 15·C ± 2·C with relative humidity at 55% ± 5% is ensured by an air conditioning system with measuring and control equipment supplied by the company STAEFA CZ. For colour photographic material, a temperature of 5·C ± 1·C is maintained and for black and white 12·C ± 1·C, with relative humidity in both cases at 35% ± 5%. All these parameters comply with the recommendations of the International Archives Council. Even though the building is situated in an area that has low levels of air pollution, any intake of outside air into the system is kept to a minimum. The air conditioning functions as a closed circulation system. Only occasionally, and mostly at night when air pollution is low, can approximately a 5% intake of air be made.

The fire prevention system in the strongroom blocks is unique. The Swedish company Wormald installed a fixed fire extinction system inside them that uses a gas which is not harmful to health, known by the trade name inergen (a mixture of oxygen, carbon dioxide, nitrogen and argon). Electronic fire signalling has been installed throughout the entire building equipped with self-activating and push-button fire extinguishers, connected to a main control centre. In the case of a fire alarm, after a specified time delay during which employees should leave the building, fire doors to each repository room and to central corridors of each strongroom block close automatically, separating these areas from the main service corridor and the arrangement rooms and work areas. In addition to the doors, fire proof flaps are also shut and the ventilation switched off throughout the entire floor. Fire extinguishing gas is fed under pressure through a pipe distribution system into strongrooms where the danger of fire has been detected. The oxygen content of the hall is thus reduced from 21% to 11%. This is a reliable means for putting out the fire without endangering persons who were not able for any reason to leave the strongrooms. In addition to the central fire escape staircase, there are three more emergency staircases in the repository room sections for evacuation purposes and these are connected to the end of each strongroom block. Specialised fire fighting equipment is an integral part of the fumigation line process in the airing tunnels that follow the fumigation chamber. The other areas of the site, including the administrative and technical buildings, are safeguarded using conventional fire fighting devices such as hydrants, dry risers and hand-held fire extinguishers, in addition to the fire prevention construction elements prescribed by law.

A view into the non-mobile shelvings where the documents are housed
A view into the non-mobile shelvings where the documents are housed

The expert conservation, restoration and reprographic work areas of both archives play a crucial role in the care of the records. In particular, the contribution made by the central laboratory of the Central State Archives goes beyond its internal remit and it serves as a national archive research, training and consultation centre. The smaller studio of the Regional State Archives is also equipped with advanced technical equipment and qualified staff. The workshop here serves not only its own archive, but also restores and conserves threatened record materials from twelve state district archives, with in the future this being extended to include company archives from Central Bohemia and the Capital City of Prague.

Due to the fact that the archives and the archive records stored in them are used for researches by the general public, the entire building has been designed in such a way that the possibility of damage or theft of records is as small as possible, whether this concerns the research rooms, which are always a kind of sensitive point in every archive, or in the case of entry by unauthorised persons. If this were to occur, the building is protected by an electronic system with automatic alarms. All entry points and exposed places are controlled by closed-circuit television and there are guards on duty in the building 24 hours a day. This is an absolute necessity due to the importance and irreplaceable nature of the records which are cared for here.

The new administrative operating part of the building is separated from the strongrooms by a joining “neck”. Situated next to the emergency staircase are a large-capacity service lift and two smaller passenger lifts which are used for transferring records to research rooms on trolleys. Following much consideration and on the basis of experiences with the use of automated transport systems in certain buildings abroad, the transport of records to research rooms using containers moving along small tracks was rejected.

Public Visitors

Areas accessible to the general public are strictly separated from those which are designated for the transport, access to and handling of records. This conforms with the principles which are applied in the absolute majority of large modern archive buildings. Visitors have a large entry hall (610 m2) available to them, which serves a whole range of purposes. Both visitors and staff come into the entry hall past a reception desk that is responsible for control and registration duties. At the information service desk, visitors are given initial information necessary for subsequent direction-finding, in accordance with their requirements. On the opposite wall, there is a clear map of the administrative building which marks areas used by the individual archives. Visitors can here read about the main activities of both archives and about the important archive collections that they administer. Visitors from the general public go on to the cloakrooms and then further on to certain of the research rooms or library. Other visitors are directed to places for meetings with archive staff, according to their needs. In this case, they await the arrival of the archive staff member who will look after them and who takes them to one of the discussion rooms on the ground floor, or if required to his or her own study.

The lobby in public entrance hall
The lobby in public entrance hall

From the entrance hall, visitors can reach either the exhibition or conference hall (for 235 persons), which are used jointly by both archives. For smaller events, in particular for schools and for experts from the general public, there are two lecture halls of which one is for the Central State Archives and is for 91 persons and the other for the Regional State Archives for up to 64 persons. Situated behind the large conference hall, there is the registry filing office of Central State Archives and cloakrooms for the joint use of researchers. In these, these are 170 lockers and here visitors to the study rooms can store outer garments and bags under their own lock and key. As is usual in all archives, only pencil and paper are allowed into the research rooms, as well as possibly laptop computers, and, when this is justified, digital cameras, Dictaphones, etc. The entrance hall also serves as a rest area for researchers, who frequently spend many weeks in the research rooms. A pleasant, brightly-coloured environment contributes to this with club chairs, in the middle of which is a fountain with a small water jet. Refreshments are provided in a cafeteria that has several tables. There is also a well-ventilated room here for smokers.

The research room of the Central State Archives is designed for 103 researchers. There are 81 places available to them at tables for researching standard types of archive materials and ten places at large tables for researching archive materials of atypical sizes, in particular maps and plans. There are twelve places in individual boxes for researchers who have specific technical research requirements. High levels of demand have made necessary the enlargement of the Regional State Archives research room with a total of 54 places at tables, of which seventeen of the largest are equipped with reading devices for researching microfilm. On this material there are above all copies of old registry documents (approximately from the 1780s and older). As a result of ever increasing interest from the non-professional general public in genealogical studies, these are the most often requested archive records and the physical condition of their originals would be endangered by frequent handling by the public. In the smaller research room of the regional archives, all tables are also provided with lighting which the researchers can themselves adjust according to their needs. On entry to the research room, each visitor receives necessary specialist information from staff of the information desk service, registers and requests the handing over of records, which he or she would like to research. Once handed over, these archive materials are entered into the relevant research records and on their return they are checked and only after this is their return confirmed. Lending materials outside the archives is absolutely forbidden, as is the general practice in all archives. Adjoining both research rooms are rooms designed for the reservation of archive materials and microfilm, as well as areas equipped with reprographic devices, etc. Researchers are not allowed into these areas. Next to the research rooms, there are also rooms for specialist consultation on complicated enquiries, where visitors receive required information from expert members of staff of the archives.

From the entrance hall, next to the entrance to the study room of the Central State Archives, there is a special lift and a staircase to the first floor, where the reading room of the specialist archive library is located for specific archive subjects. It is also possible to read the daily press and selected magazines here. The system of operation here is similar to that in the research rooms. In addition to this reading room, both research rooms are provided with a reference library, which contains the most commonly used literature necessary for orienting oneself in the research of archive materials. Visitors to the Central State Archives research room can study historical literature in the study area of the library in the above-mentioned reading room and in the Regional State Archives they can order them together with archive records to be sent to the research room.

In the basement, there is a pleasantly designed refectory that provides catering for staff of both archives, where at lunchtime hot food is served at special prices. Also below ground level there is a small exercise gym, as well as warehouses, garages and other services areas. Visitors and staff with vehicles are provided with three relatively spacious parking areas, with a capacity of up to 140 cars.

References

The authors have drawn on the study by Kalina, T.: The New Building of the Central State Archives and the Regional State Archives in Prague – Chodovec. In: Central State Archives and its Premises. Praha 2001.

Author Details

Borivoj Indra
President of the Commission on the New Archive Buildings of the Czech Republic

Vladimíra Hradecká
Head of Service Departement of the Regional State Archives in Prague, President of the Czech Society of Archivists, Member of the National Node of the Cultivate

Miloš Klimeš
Photographer of the Regional State Archives in Prague

Contact: Jana Souckova
NpM Prague

dir.npm@aconet.cz Link to an email address

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

For citation purposes:
Indra, B. and Hradecká, V. "Chodovec: The New Prague Archive Building", Cultivate Interactive, issue 6, 11 February 2002
URL: <http://www.cultivate-int.org/issue6/prague/>

-------------------------------------------------------------

Cultivate Interactive Competition - Spot the European City

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Time again for a bit of light relief with the Cultivate Interactive Competition.

Below are two sets of four pictures. Each set of four represents a different European city. All you have to do is decide which cities are being shown.

City 1

City 2

The answers should be sent to cultivate-editor@ukoln.ac.uk Link to an email address before the closing date of 30th May 2002. Names will be drawn out of a hat and the winner will receive a book token. Good Luck!!

Issue 5 Winner

The winner from issue 5 was Evelyne Porri from the Cultural Policies Research and Development Unit, Strasbourg, France. Congratulations!! A book is on its way to you.

The answers were:

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

-------------------------------------------------------------

Cultivate Interactive Issue 6: Jobs Section

The content on this page is current at the time of publication (February 2002), but will become out of date. To reach a more recent issue of Cultivate Interactive use the 'Current Issue' link in the top green navigational bar.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Position?: Assistant Librarian - Collection Management Electronic Resources Librarian
Company?: Glasgow Caledonian University
Closing Date?: 22 February 2002

A full-time permanent post exists for an Assistant Librarian in the Collection Management section of the University Library. The successful candidate will be responsible for the management of the Serials team (which deals with serials in both print and electronic forms) within Collection Management; managing all Collection Management aspects of electronic journal and electronic book developments and provision; and managing the provision of Reading List materials. The postholder will also be required to do Enquiry Desk duties as part of a rota.

A thorough awareness of current developments in all Collection Management aspects of eis provision in an academic library environment, including the delivery of core text materials is essential, as are excellent team management, interpersonal, and communication skills. Previous experience of journal (print and electronic) processes and management is also required. The requirement to work one evening per week until 9pm (with the morning off) is part of the post.

Information packs can be obtained from:
HR Department,
Glasgow Caledonian University, City Campus,
70 Cowcaddens Road,
Glasgow,
G4 0BA
UK
Phone: 0141 331 8890,
Fax: 0141 331 8868,
E-mail: r.graham@gcal.ac.uk Link to an email address

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

-------------------------------------------------------------

Cultivate Interactive Issue 6: Links Section

Who is linking to Cultivate Interactive Web magazine?

In this section we will review some of the sites that have chosen to link to us. If you would like to be mentioned in the next issue then please Contact Us.

If you would like to see how many sites are linking to Cultivate Interactive have a look at Link Popularity Link to external resource

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Web Site Name?: Resource News
URL?: <http://www.resource.gov.uk/news/> Link to external resource
Description?: Resource is the Council for Museums, Libraries and Archives in the UK. It provides strategic leadership, advocacy and advice to enable museums, archives and libraries to touch people's lives and inspire their imagination, education and creativity. The News section includes press releases, articles and snippets.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

Web Site Name?: Webgenz
URL?: <http://www.webgenz.com> Link to external resource
Description?: Webgenz is an object-oriented content management system and web site development tool for Windows. Webgenz generates web site documents (HTML, ASP, JSP, PHP, etc) from reusable templates and content macros. Using a practical, file-based approach, Webgenz allows you to take advantage of powerful content management features. You may find it useful if you read the article about Content management systems in our last issue.

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

-------------------------------------------------------------