OCLC CJK Users Group 2002 Annual Meeting
Saturday, April 6, 2002
Holiday Inn Georgetown
2101 Wisconsin Avenue NW
Washington DC, 20007

(Continental Breakfast provided)

Agenda



Minutes

As the second step, the Group evaluated sample records.

    1. First sample test for conversion using local records by OCLC:
      * Most Wade-Giles fields were converted correctly
      * Errors--Capitalization (Zhen, Xiang), partially converted, etc.
      * Pinyin fields were not affected by the conversion program--duplication of title entries occurred
    2. Second sample test (comparison between 100 OSUL and OCLC records)
      * In general, OCLC records were enhanced by other institutions, and they were of better quality than local records
      * Differences between OCLC and local records are mostly the result of local practices such as different cutter number, series titles in 490/830 fields instead of 440 field
    3. Third sample test (reviewing 200 OCLC non-Chinese records)
      * Errors and issues discovered --The author name was converted in the 245 field, but not in the 100 field --Wade-Giles in 5xx fields were not converted --Some Chinese subject headings were not converted --Wade-Giles in the 246 or the 740 fields were not converted, particularly in JK (Japanese and Korean) records --Errors found in some records which included 987 fields with the status code "c" --Many English records with Wade-Giles were not converted

As the third step, the Group came up with a specification for Pinyin conversion and CJK record replacement.

    1. OSUL will send 3 files to OCLC:
      * Chinese records--For conversion
      * Non-Chinese records--For conversion
      * Japanese and Korean records--For records replacement
    2. Chinese records:
         Pinyin conversion using option 4
      1) OCLC will map the fields (035, 910, 590) from the local records to the WorldCat records
      2) 880s--If the OSUL record has 880 fields and the WorldCat record does not, convert the OSUL record to Pinyin and return the OSUL records
    3. Non-Chinese records:
         Pinyin conversion using option 1
      1) Converted records will be reviewed by staff
      2) No editing will be done during the conversion period
    4. Customized reports:
      1) List of OSUL records if publication date in 260 in OSUL record does not match with publication date in 260 in the OCLC record
      2) List of OSUL records that have duplicate OCLC numbers, no matches, or no 880s in both OSUL and OCLC records

As concluding remarks, Ms. Shin reminded the audience of the need of collaboration between individual East Asian libraries and the three principal organizations in the Pinyin conversion venture, the need of sharing information on the conversion outcomes at individual libraries, and the need of fixing errors when encountered.

OCLC Reports

Before starting the OCLC reports, Mr. Marty Withrow, Director of the Metadata Services Division, OCLC, acknowledged Mr. Karl Lo's extraordinary contributions to the Pinyin conversion and to the promotion of multi-lingual, multi-script OPAC systems with Z39.50 protocol. He then presented to Mr. Lo a plaque with a clock representing the OCLC.

OCLC Strategic Plan and Developments: Progress Report—Marty Withrow (Click here for the PowerPoint presentation)

Mr. Withrow presented the service view of the OCLC cooperative—three service areas, i.e. Cataloging & Metadata Services, Digital & Preservation Resources Services, and Discovery & Fulfillment Services. The three service areas create a self-reinforcing cycle that supports library cooperation through Extended WorldCat, while at the same time creating new opportunities for expanding cooperative activity into new areas.

WorldCat presently supports cataloging, resource sharing and reference services to libraries. Last year, member libraries cataloged 55 million items via WorldCat and arranged 8.6 million online interlibrary loans. Library users did over 64 million reference searches on WorldCat. It now contains 49 million bibliographic records in 434 languages and over 800 million location listings, and grows by over 2 million records a year. OCLC's new strategy calls for taking this extraordinary library resource to an even higher and more useful level.

The extended WorldCat will be based on Oracle, have Unicode based character sets, and be able to support multiple languages in multiple scripts. OCLC is planning to support new metadata formats like the EAD format used in archives and special collections or the VRA format used for visual resources.

In the area of Discovery Services, recent WorldCat enhancements include added evaluative content such as cover art and summaries for books, interface redesign, Z39.50 OPAC holding links so that it displays local data in WorldCat and other First Search databases, links to Web booksellers, and rank results by library count. For First Search, Japanese interface has been released on March 24, 2002, and the Chinese counterpart is expected to be released within the next few months. To promote library collections and services in the web spaces where patrons look for information, OCLC started a program to link WorldCat holdings to resources offered through library portals and on the "public" web sites. Current partners include online booksellers and library information service providers.

A virtual reference desk service will be offered by OCLC and LC, and the service will be launched in June 2002. The new OCLC ILL web interface has been well accepted, and more than 1,100 libraries are now using this new interface.

OCLC's Digital & Preservation Resources will offer a complete, integrated set of services which will help to build awareness of digitization for your staff, board, and constituents, and move you through the Digital Lifecycle to the point where you are storing your digitized collection materials and preserving them for long-term use. The OCLC Digital & Preservation Co-op provides participants with an opportunity to share knowledge, develop funding opportunities, increase access to digital resources, establish best practices, influence standards and work together to build new collections.

OCLC Cataloging & Metadata Services help libraries cost effectively obtain and manage the metadata they need for both the print and electronic resources (CatExpress, New expert systems, PromptCat, TechPro). OCLC offers ready-for-the shelf collections for Chinese, Japanese, Korean, Vietnamese, Russian, and Spanish languages. And in the near future, OCLC will offer a custom collection development program including acquisition, cataloging, and physical processing services for books, videos, and other media materials.

By 2004 or later, the New Interface will be implemented. The features of the New Interface include:

    * Single entry point to OCLC Cataloging Services
    * Combines the best of OCLC Web and Windows-based interfaces
    * Springboard into enhanced WorldCat
    * Browser-based
    * Advanced Windows-based toolset coming to complement the browser interface

Questions and Answers:

Q: Are there any users of CatExpress service? A: Yes, there are many.

Q: Does the CatExpress service support CJK records as well? A: No, this is strictly for roman-script only.

OCLC CJK Activities Including Asia Pacific CJK-Related Work--Hisako Kotaka (Click here for the PowerPoint presentation)

Ms. Hisako Kotaka's presentation focused on OCLC CJK. First she gave an overview on the OCLC CJK users today.

  • 150+ OCLC CJK libraries worldwide
  • OCLC unable to maintain "CJK Who's Who" since CJK software available at no cost—The directory maintained by OCLC CJK Users Group provide an view about the membership in the North America
  • CJK software use spreading to public and special libraries
  • Contracted CJK projects (RetroCon, TechPro, Language sets) serving users
  • 5% of WorldCat (ca. 2.4M) is CJK MARC by 1.6% of OCLC cataloging members
  • Details on the OCLC CJK users and database statistics are available from: http://www.oclc.org/oclc/cjk/statmenu.htm

    Then, she gave a status report on OCLC CJK Software:

  • CJK version 3.11 available since February 2001 for electronic download
  • CJK 3.11 and Z-Client 1.00 will be on OCLC Access Suite 5.0 CD version in 2nd quarter 2002 for easy, quick installation
  • CJK/Z-client enhancement is "on hold"
  • CJK functionality will be incorporated into the new Unicode-based OCLC Cataloging Service --projected 2004
    http://www.oclc.org/strategy/cataloging/guidetomigration.pdf
  • On OCLC CJK database projects, the following were reported:

  • OCLC WINE Project Phase II done & WINE Records Editing Guide posted: http://www.oclc.org/oclc/bit/254/01aug.htm - waseda
    http://www.oclc.org/oclc/cjk/wineproject.shtm
  • WorldCat Pinyin Conversion done: - Still need the MARC 987 pinyin marker field
    - No longer use the WG vs. PY conversion in CJK software to avoid double PY conversion
  • EACC Validation Work done:
    - Invalid EACCs cannot transform into Unicode in the new OCLC Oracle database
  • The following are the technical news particularly for CJK catalogers:
  • CJK NACO members can adopt CatME for NACO work with similar user interface
  • Passport use for NACO work will end in December 2003
  • Cataloging rules changed to use GMD $h "electronic resource" replacing "computer file" – WorldCat scan and update in process
  • New expanded keyword indexes help users focus search outputs
    http://www.oclc.org/oclc/tb/tb235/tb235.htm
  • As to the technical concerns from CJK users, Ms. Kotaka discussed the following:

  • Toll-free fax number 1-866-709-6252 to report problems to OCLC Quality Control
  • CJK cataloging charges/credits the same as for non-CJK cataloging
  • Online title scan is powerful, but could be costly as 1) scan is one charge and 2) each line select/display is another charge
  • Users can save time/cost for CJK cataloging by mixing online and batch tasks in workflow
  • Contact RSP (Network) for your cataloging price information
  • The following issues related to CJK communications and training were mentioned:

  • OCLC Internet List Manager aware of "junk mails" over the open list
  • OCLC-CJK list with 'scan' features in development to control the list contents
  • OCLC UNS and CJK Resource People support and respond to CJK users' needs http://www.oclc.org/support/
    http://www.oclc.org/oclc/cjk/resource.htm
  • OCLC provides CJK information, news, updates, and announcements via OCLC- CJK list and CJK Web home page
  • E-Learning course sponsored by the OCLC Institute to prepare for changes
  • OCLC Metadata Contracted Services (RetroCon, TechPro, and Language Sets) --Bing Yu (Click here for the PowerPoint presentation)

    Ms. Bing Yu presented an overview on contracted services. First, on the RetroCon CJK Projects, she summarized the recent contract jobs that OCLC handled: Harvard Yenching Library (255,241 records of CJK and Vietnamese languages were converted by May 2001), UC Berkeley (50,000 CJK records for Phase II, and 26,833 for Phase III were converted), University Of Chicago (31,436 CJK records for Phase I were converted), Princeton University (scheduling to convert 180,000 CJK titles by April 2004), and Yale University (53,328 CJK records were converted, and 97,000 CJK titles will be converted by June 2003).

    As to the OCLC TechPro Services, OCLC now has a new, simplified pricing structure, and will be able to begin cataloging projects for Japanese and Chinese languages in early summer. The projects that TechPro Services worked in 2001/02 include Cornell University (CJ), University of Maryland (J), Grinnell College (C), University of California San Diego (JK), Brown University (CJK), Ohio State University (CJK).

    Regarding the Language Sets Services, Ms. Yu mentioned that the services have been very successful, particularly among public libraries. As the services expanded, new staff members joined in July 2001: Lisa Elliott (Co-manager), Andy Kier (Russian Specialist), and Nancy Thomas (Spanish Specialist). The following are news and projects of Language Sets Services for 2001/2002:

  • New languages will be added to the services each year
  • Fiction and non-fiction sets are ready to order
  • Chinese videos and DVD sets will be available soon
  • There were more than 50 new libraries who signed up for Language Sets Services
  • OCLC Pinyin Conversion Project—Marty Withrow (Click here for the PowerPoint presentation)

    As Mr. Glenn Patton, Director, Metadata Standards & Quality, OCLC, was unable to attend the OCLC CJK Users Group meeting this year, Mr. Withrow reported on the OCLC Pinyin Conversion Project for Mr. Patton.

    In the past year, OCLC converted 710,000 Chinese bibliographic records during May and June, 2001 (including 8,900 CONSER serial records), and in October 2001, another 25,000 non-Chinese bibliographic records were converted. There have been several periodic clean-up scans for records added to WorldCat since initial conversions.

    The audience was requested to continue to add a 987 field (current instructions are found at http://www.oclc.org/oclc/pinyin/oclcupdate.htm

      * For new Chinese records created in OCLC--Continue to include a 987 field
      * For non-Chinese records that include Pinyin data--Include a 987 field, especially if the record also contains language code 'chi' in field 041.

    As to the future projects, OCLC will continue to do clean-up conversions of batchloaded records and work on the 100,000 records marked for review. OCLC will be doing authority control processing on bibliographic records in WorldCat. This will assist in updating the name and subject headings on more non-Chinese records as well as updating headings on Chinese records if those headings have changed since the conversion.

    OCLC CJK users are requested to replace the records marked for review, when possible. When we do this, we must change the 987 $d from "r" to "c" and remove the 987 $f. If you do not have the authorization to replace records, you should report by using the "Report Error" function in CJK, or by using the web form found at http://www.oclc.org/oclc/forms/bibchg.htm
    Also, we are asked to take another minute before we add a new record to WorldCat, and eliminate typos and other avoidable problems.

    In connection with the future development of WorldCat, OCLC encourages us to start thinking about the topics of: 1) vernacular data in authority records, and 2) bibliographic records without romanized data. There has already been some discussion at LC and MARBI about how to incorporate other scripts into authority records and how to link together records for headings created under different cataloging rules. There are also various projects in China, Hong Kong and Taiwan to build an authority file of vernacular headings. Regarding the second topic, the example of OCLC Arabic software was mentioned, which accommodates vernacular data without parallel romanized fields.

    Questions and Answers:

    Q: Until when are we going to use the field 987?
    A: No definite date can be set at present.

    Q: Can OCLC work more aggressively to include Chinese bibliographic records created by Chinese institutions into WorldCat?
    A: Members can contact the Asia Pacific for updated information.

    Q: Can OCLC convert 40,000 Chinese MARC records that were previously purchased by the University of California, San Diego?
    A: It is technically possible. OCLC is able to support such kind of conversion as it helped to convert the Tsinghua University Library bibliographic records from the China MARC with GB code to the OCLC MARC with EACC code.

    Q: Can OCLC CJK Users Group consider to form a Task Force to promote metadata and electronic resource cataloging?
    A: Please contact the Chair.

    Q: Regarding the Arabic software, can users search by romanization for bibliographic records that do not have the romanized data?
    A: It is not possible, unless users know vernacular scripts.

    The meeting adjourned at 12:00 p.m.

    Respectfully submitted, Toshie Marra University of California, Los Angeles



    APPENDIX (April 6, 2002)

    Dear Mr. Karl Kwok-Bong Lo, upon your retirement in 2002, the OCLC CJK Users Group would like to acknowledge your distinguished leadership in promoting resource sharing through advanced technologies and in serving as Chair and Vice-Chair of the Group from 1991-1995. In 1991, as one of the founding Chairs, you initiated the establishment of the OCLC CJK Users Group for all its CJK users worldwide. Your strong support of the Group extended to the succeeding Board members.

    As an advocate of technologies related to multilingual scripts, you inspired the Group with your vision of and belief in resource sharing. Because of your exemplary leadership, the Group has been able to achieve its mission and objectives as stated in the Bylaws.

    On behalf of the OCLC CJK Users Group, the Executive Board members hereby present this certificate of appreciation in recognition of your extraordinary contributions.

    ----------------------                 -----------------------                  ----------------------

    Wen-ling Liu                        Philip Melzer                        Meng-fen Su   

    ----------------------                -------------------------                 ----------------------

    Toshie Marra                       Mikyung Kang                     Vickie Fu Doll

    OCLC CJK Users Group Annual Meeting, Washington, D.C., April 6, 2002