Import
Operations can be created using manual data entry. However, it is much easier to import metadata automatically from other systems. Almost any system can be connected via an appropriate plugin if it supplies machine-readable data. Two different types of plugins are generally available for this purpose:
Catalog plugins
Be it the central library catalogue, the index in the archive or the object database in the museum. Using standardized interfaces such as z39.50, SRU or OAI, metadata can be retrieved and processed in any format, including MARCXML, Lido or DublinCore. But proprietary interfaces with their own formats can also be used (e.g. XML via a REST interface).
Import/ Mass Import Plugins
Often metadata and digital objects are available as results of earlier projects and in larger quantities. Or Excel tables, CSV files together with special folder structures, Microsoft Access databases or other proprietary developments are available in proprietary formats. Even such data is no big problem for Goobi workflow. Even in such cases, the data can be transferred and imported using individual import plug-ins.
Processing
The strength of Goobi workflow lies in the processing and control of workflows. For this purpose, the desired workflows are stored in advance in repeatedly applicable production templates. Such a production template describes any number of tasks, which in turn are subject to a predefined sequence. Typical examples of production templates include these:
- Standard workflow for digitization and publishing
- Digitization on Demand
- Mass import of a legacy data project
On the basis of such a production template, a process is created in Goobi for each object to be digitised. This then allows the tasks taken over from the production template to be processed. Each of these tasks is assigned to at least one user group or user and receives various rights, functions or plug-ins for data processing and validation. In this way, among other things, it is controlled whether such a task is executed manually by a user or fully automatically by the server.
For example, a prototypical workflow for digitizing books can consist of the following tasks:
Task | Type | Responsibility | Description |
Creation of the process via catalogue query | Manually | Librarians | check the data record in the catalog, create a process by automatic data transfer from the catalog |
Scanning | Manually | Scan-Operator | Creation of the digital copies and import into Goobi workflow |
Validation of images | Automatic | Server | Automatic validation of correct color depth, resolution and file naming |
Quality Control | Manual | Quality Assurance | Sample-based examination of the scan quality with regard to completeness and sharpness |
Producing Derivatives | Automatic | Server | Automatic creation of a compressed derivative next to the master images |
Structure and Metadata Capture | Manual | Librarians | Generation of a pagination, generation of structural data, enrichment of metadata |
OCR Generation | Automatic | Server | Automatic full-text recognition and save as ALTO files |
Generation of persistent identifier | Automatic | Server | Automatic generation of persistent identifiers and registration of identifiers with a resolver |
Exports the data to the Goobi viewer | Automatic | Server | Automatic publication of the digitized material including metadata in the Goobi viewer with configured access restrictions |
Enrich master images with XMP metadata | Automatic | Server | Automatic enhancement of master images with XMP metadata from the Goobi process metadata |
Ingest the master images into a repository | Automatic | Server | Automatic ingest of master images into a repository (e.g. Fedora) |
Within Goobi workflow, the workflows and associated tasks are visualized as follows:
In addition to the core functions of Goobi workflow, a large number of different plug-ins are available as extensions. There are currently about 210 such plug-ins of different types. Project managers can, for example, use the statistics and administration plug-ins to monitor and evaluate the exact progress of the project and generate reports. The administrators of Goobi workflow, on the other hand, have powerful tools at their side with which mass manipulations can be carried out on the database in a simple manner.
Export
At the end of a workflow there is generally the export. This is also plugin-supported. Typical export plug-ins realize the transfer of images and metadata in standardized formats such as METS/MODS or LIDO format to a presentation system such as the Goobi viewer. But export plug-ins are also available for other application scenarios, such as for generating PDF files and sending e-mails with download links, or for ingesting data into a repository or long-term archive.