Understanding search source tasks

Search source tasks are created and scheduled when you create a new data source. For each data source, Portfolio creates two or three tasks. Each task represents a stage in the process of collecting data from the source and indexing it. A data source connected to an ILS has three tasks: harvesting, importing, and indexing. Discovery data sources usually have only two tasks: importing and indexing. Federated sources do not have associated scheduled tasks. By default, these are the search source tasks that are available:

Task Description

SD_ASSET_IMPORT

Converts all harvested asset metadata to DiXML in preparation for indexing. The ending of this task initiates the SD_ASSET_INDEX task. This task is typically initiated by SD_ASSET_EXTRACTOR in System Tasks.

SD_ASSET_INDEX

Indexes the imported asset metadata. This task is typically initiated by SD_ASSET_IMPORT.

SD_CONTENT_IMPORT

Converts all rooms content data to DiXML in preparation for indexing. In the Edit Discovery Search Source page you can schedule this process to launch at specific times. However, because Portfolio launches it at the end of the SD_CONTENT_EXTRACTOR task, scheduling this task separately is not effective. For more information, see Understanding system tasks. After this task runs, Portfolio runs the SD_CONTENT_INDEX task.

SD_CONTENT_INDEX

Indexes all the rooms content that was previously harvested by the SD_CONTENT_EXTRACTOR process. Portfolio runs this task after the SD_CONTENT_IMPORT task.

SD_ILS_HARVEST

Initiates the Discovery ILS Agent on the ILS server and harvests all library catalog data from your ILS system. After this process has run, Portfolio automatically runs the SD_ILS_IMPORT and SD_ILS_INDEX processes in succession.

SD_ILS_IMPORT

Converts all harvested library catalog data to DiXML in preparation for indexing.

SD_ILS_INDEX

Indexes all library catalog data that was previously harvested by the SD_CONTENT_INDEX process. This process also rebuilds the “Did You Mean?” dictionary and cache.

SD_ILS_HARVEST_DELTA

Harvests all library catalog data from your ILS system from a certain date and time to the present. After this process has run, Portfolio automatically runs the SD_ILS_IMPORT and SD_ILS_INDEX processes in succession.

SD_DIXML_IMPORT

Imports any DiXML data in preparation for indexing.

Note: Prior to running this task, you must have first uploaded the DiXML data files to your Portfolio system. For more information, see Fields: Upload Data Files.

SD_DIXML_INDEX

Indexes any DiXML data that was previously imported by the SD_DIXML_INDEX process.

SD_PDF_IMPORT

Imports and converts any PDF data to DiXML in preparation for indexing.

Note: Prior to running this task, you must have first uploaded the PDF files to your Portfolio system. For more information, see Fields: Upload Data Files.

After this process has run, Portfolio automatically runs the SD_PDF_INDEX process.

SD_PDF_INDEX

Indexes any PDF data that was previously imported and converted by the SD_PDF_INDEX process.

SD_OTHER_MARC_IMPORT

Imports and converts any non-ILS MARC data into DiXML in preparation for indexing.

Note: Prior to running this task, you must have first uploaded the non-ILS MARC files to your Portfolio system. For more information, see Fields: Upload Data Files.

After this process has run, Portfolio automatically runs the SD_OTHER_MARC_INDEX process.

SD_OTHER_MARC_INDEX

Indexes any non-ILS MARC data that was previously imported by the SD_OTHER_MARC_IMPORT process.

SD_LIB_FAVS_IMPORT

Imports and converts any Library Favorites data into DiXML format in preparation for indexing.

Note: Prior to running this task, you must have first uploaded the Library Favorites files to your Portfolio system. For more information, see Fields: Upload Data Files.

After this process has run, Portfolio automatically runs the SD_LIB_FAVS_INDEX process.

SD_LIB_FAVS_INDEX

Indexes any Library Favorite data that was previously imported by the SD_LIB_FAVS_INDEX process.

Tasks occur in a specific order. For ILS tasks, harvesting is the first task. It launches the ILS Agent to gather data from the ILS database. Next, the import task converts the data to DiXML and delivers it to the Portfolio. The indexing task then prepares the data and indexes it to provide quick results for patron searches. Because other types of data, such as rooms content or PDF documents, is already in the Portfolio system, it does not need to be harvested. It can be directly imported and indexed, so those data sources include only two tasks.

When you create an ILS data source, Portfolio also creates a delta harvester task. The delta harvester gathers only the changes that have been made to the ILS database since the previous full or delta harvest task. Because of this, the delta harvest and subsequent import and index tasks take less time to run. Delta tasks are created only for ILS sources; they are not created for any other data source type. For more information, see Delta Indexing Schedule.

Scheduled Tasks lists all of the task created for data sources in the Portfolio system. The task names are created by appending the task type with an underscore to the Code you assigned to the search source. For example, if you created an ILS source with the code “BRANCH_Z”, you would find three tasks in Scheduled Tasks: BRANCH_Z_HARVEST, BRANCH_Z_IMPORT, and BRANCH_Z_INDEX.

Only users with system administrator rights can view and manage scheduled tasks. Profile administrators do not have access to scheduled tasks.

Related topics