Functional Requirements

From DigiRepWiki

[ Home \| Functional Requirements \| Application Model \| Application Profile \| Community Acceptance Plan \| Mapping to Simple DC \| XML Format

1 Scope
2 Stakeholders and designated community
- 2.1 Designated Community
- 2.2 Stakeholder community
3 Requirements gathering
4 Functional Requirements Specification

[edit]

Scope

The document offers a functional requirements specification for the SWAP. An analysis of the community served by the profile and an indication of the methodologies used are also included.

From the JISC specification for the work:

Metadata:
- In scope: DC elements plus any additional elements necessary
- Out of scope: other metadata formats
Identifiers:
- In scope: Use of identifiers to link from description to related files (eg full text files such as pdf, HTML, etc); also use of identifiers for the description itself, for related resources etc.
- Out of scope: Other uses of identifiers.
Controlled vocabularies (subject classification, name authority, etc):
- In scope: Ensuring the application profile is hospitable to the use of a variety of subject access solutions e.g. classification schemes, controlled vocabularies, name authority lists
- Out of scope: decisions on terminology solutions
Complex objects:
- In scope: Establishing an understanding of existing work in this area and prioritising requirements
- Out of scope: decisions on how to model complex objects
Additional search entry points e.g. Repository of origin
- In scope: inclusion of properties required to fulfil other search requirements such as institution of origin, research funder, national and regional views. These requirements will be provided by RDN.

In addition:

Citations and references
- In scope: Bibliographic citations for eprints and document references citing other works
- Out of scope: Citation analysis solutions

[edit]

Stakeholders and designated community

[edit]

Designated Community

Implementers of UK Institutional Repositories search service (http://www.intute.ac.uk/projects.html) - Intute, UKOLN, Sherpa
Managers and administrators of UK eprint repositories
Implementers of the Prospero interim repository (http://edina.ac.uk/projects/prospero/index.html) - Edina, Sherpa

[edit]

Stakeholder community

The following have a wider stake in the work and need to be engaged in order to ensure that the Designated Community is targeted.

Repositories search service
- Implementers of this service
- Users of the search service
JISC Digital Repositories Programme
JISC Capital Programme - Repositories and Preservation
Eprint repositories community in the UK
- Repository managers, administrators and technical staff
- Software developers of repository software (e.g. eprints.org, DSpace, Fedora)
JISC
DCMI
Other aggregators and search services, e.g. IRIScotland, PerX, ARROW
Other funding bodies

[edit]

Requirements gathering

[edit]

Methodology

Review conclusions from Eprints UK
Identify Issues with current use of simple DC
Review existing practice
Review existing or proposed application profiles
Discussion and input from the working group, feedback group, wider community
Gather/write scenarios and use cases

[edit]

Conclusions from Eprints UK

The final report from the Eprints UK project contained a number of conclusions relevant to the current work (Final Report).

These included the following:

Technical barriers to successful aggregation of metadata from institutional repositories
- issues with the quality of metadata
- the consistency of metadata
- the handling of complex objects
- the lack of a common approach to linking to full text

"The project addressed these to some extent by proposing a Dublin Core eprints application profile. However, adoption of this profile could not be a priority for the FAIR programme as most projects of necessity concentrated on establishing and populating their archives."

Issues with the simple Dublin Core profile:
encoding the location and type of the full-text file
the meaning of each field can be interpreted in various different ways

"Simple DC is not targeted at describing eprints specifically so there is more to the description of an eprint than simple DC will allow. To get round these limitations of simple DC, some repositories try to put more information than necessary into the Dublin Core fields. This varying use of metadata can lead to difficulties for end-users who are trying to discover eprints across multiple repositories".

Recommendations from Eprints UK

There should be further investigation into the user requirements for resource discovery services built on institutional repositories. In particular, this work should explore how an aggregation of metadata from UK repositories would interoperate with other international collections.
More effort should be made to achieve widespread agreement with and adoption of the recommendations for using simple DC to describe eprints.
Repository software suppliers and the administrators of eprint archives should be encouraged to adopt the simple recommendations for linking from the 'jump-off' page to the full text of the eprint. Work should be funded at an international level to agree how best to model eprints as 'complex objects' (e.g. as works and manifestations) and how to encode such complex objects in XML (e.g. by using METS or MPEG-21 DIDL).
There should be more investigation into the issues associated with name authority control for eprints and in particular into how best to maintain and expose authoritative name-based services and how best to integrate such services into the eprint workflow.

[edit]

Existing practice

Local practices can be seen by searching repositories, examples:

eprints.org, e.g. e-Prints Soton
DSpace, e.g. Edinburgh Research Archive
Fedora, e.g. Queensland QUT
Other, e.g. CCLRC ePubs

[edit]

Existing or proposed application profiles

Eprints UK - Using simple Dublin Core to describe eprints
DSpace - DSpace metadata
Eprints.org comes with out-of-the-box metadata
Arrow discovery service, Australia - ARROW application profile
DARE - DARE use of Dublin Core version 2.0 (NB: DARE are working on version 3.0, using 'XXQDC: Qualified Dublin Core eXtended and Extensible' following the idea of MPEG21/DIDL packages)
Canadian Repository Metadata Interest Working Group
Swedish SVEP project Metadata model
- Recommendations for harmonising metadata descriptions of electronically published scientific publications from Swedish universities and university colleges
- National format for publication databases (local registers of academic publications)
ETD-MS: an Interoperability Metadata Standard for Electronic Theses and Dissertations -- version 1.00, revision 2
DiVa metadata application profile, part of the DiVa project in Sweden

[edit]

Vocabularies

OpenURL:

Journal (http://www.openurl.info/registry/docs/mtx/info:ofi/fmt:kev:mtx:journal): journal, issue, article, conference, proceeding, preprint, unknown (but known to be journal related)
Book (http://www.openurl.info/registry/docs/mtx/info:ofi/fmt:kev:mtx:book): Book, bookitem, conference, proceeding, report, document, unknown (this is the fallback)

SWRC (Semantic Web for Research Communities)

http://ontoware.org/projects/swrc/

[edit]

Scenarios and Use case

Wherever possible, usage scenarios exist to support the requirements in the Functional Requirements Specification, as identified below.

[edit]