Automatic identification of "works" toward construction of frbrized opacs: An experiment on JAPAN/MARC bibliographic records

Research output: Contribution to journalArticle

4 Citations (Scopus)

Abstract

Purpose: Efforts have been made to improve the OPACs by collocating bibliographic records sharing the same "work" and navigating users among records under a certain "work," according to the Functional Requirements for Bibliographic Records (FRBR). This paper investigates methods of automatically identifying "works," i.e., grouping bibliographic records sharing the same work, for the JAPAN/MARC records, which are typical Japanese bibliographic records created and maintained by libraries in Japan. It reports the extent to which records can be automatically identified as members of a particular work and also which of the possible methods are effective. Methods: The method used in this study is to generate work identification keys for each work represented in a bibliographic record and then to bring the keys representing the same works together. The keys are in principle constructed as a combination of an author name and a title from the record. Several methods of generating such keys were examined and the clustering of keys was executed for each method. The clusters built automatically were evaluated by comparing them with the sample correct sets built manually. Results: The results of the experiment show that the proposed method is effective in average cases; however, the performance depends on the characteristics of works, for example, the volume of records sharing the same work, whether anonymous or not, and whether uniform titles exist. It also shows that it is effective to generate keys for every bibliographic hierarchical level with data elements such as author headings, statements of responsibility, descriptive titles, and title headings.

Original languageEnglish
Pages (from-to)119-151
Number of pages33
JournalLibrary and Information Science
Issue number61
Publication statusPublished - 2009
Externally publishedYes

Fingerprint

experiment
grouping
Japan
responsibility
performance

ASJC Scopus subject areas

  • Library and Information Sciences

Cite this

@article{04829305b8364b8eb65c95de917410c6,
title = "Automatic identification of {"}works{"} toward construction of frbrized opacs: An experiment on JAPAN/MARC bibliographic records",
abstract = "Purpose: Efforts have been made to improve the OPACs by collocating bibliographic records sharing the same {"}work{"} and navigating users among records under a certain {"}work,{"} according to the Functional Requirements for Bibliographic Records (FRBR). This paper investigates methods of automatically identifying {"}works,{"} i.e., grouping bibliographic records sharing the same work, for the JAPAN/MARC records, which are typical Japanese bibliographic records created and maintained by libraries in Japan. It reports the extent to which records can be automatically identified as members of a particular work and also which of the possible methods are effective. Methods: The method used in this study is to generate work identification keys for each work represented in a bibliographic record and then to bring the keys representing the same works together. The keys are in principle constructed as a combination of an author name and a title from the record. Several methods of generating such keys were examined and the clustering of keys was executed for each method. The clusters built automatically were evaluated by comparing them with the sample correct sets built manually. Results: The results of the experiment show that the proposed method is effective in average cases; however, the performance depends on the characteristics of works, for example, the volume of records sharing the same work, whether anonymous or not, and whether uniform titles exist. It also shows that it is effective to generate keys for every bibliographic hierarchical level with data elements such as author headings, statements of responsibility, descriptive titles, and title headings.",
author = "Shoichi Taniguchi",
year = "2009",
language = "English",
pages = "119--151",
journal = "Library and Information Science",
issn = "0373-4447",
publisher = "Mita Society for Library and Information Science",
number = "61",

}

TY - JOUR

T1 - Automatic identification of "works" toward construction of frbrized opacs

T2 - An experiment on JAPAN/MARC bibliographic records

AU - Taniguchi, Shoichi

PY - 2009

Y1 - 2009

N2 - Purpose: Efforts have been made to improve the OPACs by collocating bibliographic records sharing the same "work" and navigating users among records under a certain "work," according to the Functional Requirements for Bibliographic Records (FRBR). This paper investigates methods of automatically identifying "works," i.e., grouping bibliographic records sharing the same work, for the JAPAN/MARC records, which are typical Japanese bibliographic records created and maintained by libraries in Japan. It reports the extent to which records can be automatically identified as members of a particular work and also which of the possible methods are effective. Methods: The method used in this study is to generate work identification keys for each work represented in a bibliographic record and then to bring the keys representing the same works together. The keys are in principle constructed as a combination of an author name and a title from the record. Several methods of generating such keys were examined and the clustering of keys was executed for each method. The clusters built automatically were evaluated by comparing them with the sample correct sets built manually. Results: The results of the experiment show that the proposed method is effective in average cases; however, the performance depends on the characteristics of works, for example, the volume of records sharing the same work, whether anonymous or not, and whether uniform titles exist. It also shows that it is effective to generate keys for every bibliographic hierarchical level with data elements such as author headings, statements of responsibility, descriptive titles, and title headings.

AB - Purpose: Efforts have been made to improve the OPACs by collocating bibliographic records sharing the same "work" and navigating users among records under a certain "work," according to the Functional Requirements for Bibliographic Records (FRBR). This paper investigates methods of automatically identifying "works," i.e., grouping bibliographic records sharing the same work, for the JAPAN/MARC records, which are typical Japanese bibliographic records created and maintained by libraries in Japan. It reports the extent to which records can be automatically identified as members of a particular work and also which of the possible methods are effective. Methods: The method used in this study is to generate work identification keys for each work represented in a bibliographic record and then to bring the keys representing the same works together. The keys are in principle constructed as a combination of an author name and a title from the record. Several methods of generating such keys were examined and the clustering of keys was executed for each method. The clusters built automatically were evaluated by comparing them with the sample correct sets built manually. Results: The results of the experiment show that the proposed method is effective in average cases; however, the performance depends on the characteristics of works, for example, the volume of records sharing the same work, whether anonymous or not, and whether uniform titles exist. It also shows that it is effective to generate keys for every bibliographic hierarchical level with data elements such as author headings, statements of responsibility, descriptive titles, and title headings.

UR - http://www.scopus.com/inward/record.url?scp=70349766932&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70349766932&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:70349766932

SP - 119

EP - 151

JO - Library and Information Science

JF - Library and Information Science

SN - 0373-4447

IS - 61

ER -