Differences
This shows you the differences between two versions of the page.
phelp:helpexportoptions [2021/12/29 16:21] |
phelp:helpexportoptions [2021/12/29 16:21] (current) |
||
---|---|---|---|
Line 1: | Line 1: | ||
+ | TEXT EXPORT | ||
+ | |||
+ | These options generally apply only to the 'One Line' and ' | ||
+ | |||
+ | TAG DELIMITER | ||
+ | |||
+ | Select the character that you would like used to separate the data that is output for each tag. By default, this delimiter is set to the TAB character. If none of the delimiter characters that appear in the list meet your needs, you can enter your own. To do this you have to know the ASCII code (in decimal) for the character you want to use as a delimiter, and it must be entered as three digits (zero-filled). For example, to use '#' | ||
+ | |||
+ | IF THE DELIMITER APPEARS IN THE MARC DATA | ||
+ | |||
+ | If the character that you select for a delimiter actually appears in the data that is being output, it will break the export format (if you are going to import your data into another program). For example, if you export in tab-delimited format, and the title includes an actual TAB character, the import program will add an extra field beginning at that TAB character in the title. | ||
+ | |||
+ | This option gives you an opportunity to workaround this problem. If you select | ||
+ | Ignore: the delimiter character will be ignored if it occurs in the MARC data; | ||
+ | Preserve: the delimiter character will be preserved if it occurs in the MARC data by surrounding the data being output in quotation marks; | ||
+ | Replace: the delimiter character will be replaced with another character (usually a blank space) if it occurs in the MARC data; | ||
+ | Delete: the delimiter character will be deleted wherever it occurs in the MARC data. | ||
+ | |||
+ | If you are running a review and have selected one-line output, accept the default, which is ' | ||
+ | |||
+ | DEFAULT DATA | ||
+ | |||
+ | This option only applies to ' | ||
+ | |||
+ | NOTES: | ||
+ | 1) Default data is global (there is no way to specify different default data for different tags); | ||
+ | 2) Default data will only be consulted when the option ' | ||
+ | 3) Although the text box is small, there is no length requirement or restriction; | ||
+ | 4) The program will remove any leading or trailing whitespace from whatever is entered there. | ||
+ | |||
+ | OUTPUT OF MARC DATA ELEMENTS | ||
+ | |||
+ | In this section you can configure which MARC data elements are included in the text output for each tag. | ||
+ | |||
+ | Tags: If this option is checked, the MARC tag will be prepended (followed by a blank space) to the tag's data. For example: | ||
+ | 245 Scottish music[sound recording] :music rough guide. | ||
+ | The default is not to output tag labels. | ||
+ | |||
+ | Indicators: If this option is checked, the MARC indicators for the tag will be displayed: | ||
+ | 245 00Scottish music[sound recording] :music rough guide. | ||
+ | The default is not to output indicators. | ||
+ | |||
+ | Subfields: If this option is checked, MARC subfield delimiters (' | ||
+ | 245 00$aScottish music$h[sound recording] :$bmusic rough guide. | ||
+ | This is the only Text Export option selected by default. | ||
+ | |||
+ | Blank Around subfields: If this option is selected a blank space is added before and after each subfield: | ||
+ | 245 00 $a Scottish music $h [sound recording] : $b music rough guide. | ||
+ | The default is not add blank spaces around subfields. | ||
+ | This option is an exception in that it can be applied to the One-line, Custom, and Full Record text output types. | ||
+ | |||
+ | Occs: If this option is checked, the MARC tag's occurrence number will follow the MARC Tag. For example: | ||
+ | 245-01 00$aScottish music$h[sound recording] :$bmusic rough guide. | ||
+ | The default is not to output tag occurrences. | ||
+ | This option is an exception because it is only applicable in the ' | ||
+ | |||
+ | Fixed number of tags: This option is important if you want to export text data so that it can be imported by a program like Excel. This option will ensure that each record output will appear to have the same number of fields. This means that when a tag that is in your list of 'Tags to Output' | ||
+ | |||
+ | For example, if tag 440 is in your list but missing from a record, then the program will output '[440 not found]' | ||
+ | |||
+ | IF A DISPLAY TAG REPEATS | ||
+ | |||
+ | This option allows you to configure how the export process should deal with repeating tags (like subject headings). | ||
+ | |||
+ | Export first occurrence: If selected, this option exports only the first occurrence of any repeating tag that you have selected for display. | ||
+ | |||
+ | Export each occurrence: If selected, this option exports each occurrence of a repeating tag as a separate field. Please note that this option will not be a good choice if you expect to re-import the data. | ||
+ | |||
+ | Concatenate into one: If selected, this option will concatenate all the occurrences of a repeating tag into one export field. This is the best option to use if you are expecting to re-import the data. | ||
+ | |||
+ | KEY-VALUE FORMAT | ||
+ | |||
+ | This option is only valid if the following review criteria are met: | ||
+ | 1) Two patterns must be specified (no more, no less) | ||
+ | 2) The first pattern must be marked as the " | ||
+ | 3) No additional tags may be added to the "Tags to Output" | ||
+ | | ||
+ | In this scenario the first pattern should reference an identifier, such as a unique record control number (this is the " | ||
+ | |||
+ | If selected, and if the pattern matching criteria are valid (as defined above), then each line of output will consist of two entries: | ||
+ | 1) the output for the key | ||
+ | 2) the output for the value | ||
+ | | ||
+ | This option invokes special processing on the output. If the " | ||
+ | |||
+ | For example, with 001 specified as the first pattern, and the second pattern simply set to 650 (no subfields specified in the pattern), the output looks like this: | ||
+ | |||
+ | 1372414 $aCats$vFiction. | ||
+ | 1372414 $aShapeshifting$vFiction. | ||
+ | 1372414 $aFantasy fiction.$2sears | ||
+ | 1372414 $aCats.$2fast$0(OCoLC)fst00849374 | ||
+ | 1372414 $aShapeshifting.$2fast$0(OCoLC)fst01748560 | ||
+ | 1122393 $aAnimal attacks$vJuvenile literature. | ||
+ | 1122393 $aDangerous animals$vMiscellanea$vJuvenile literature. | ||
+ | 1122393 $aDangerous animals. | ||
+ | 1122393 $aSharks$vJuvenile literature. | ||
+ | 1122393 $aSharks. | ||
+ | 1133738 $aPolice$zCalifornia$zLos Angeles$vFiction. | ||
+ | |||
+ | | ||
+ | The example should serve to demonstrate the purpose of this output format: to generate an index on repeatable MARC data (something often difficult to achieve). The result of such a review may require post-processing to be useful; for now, we will leave that as an exercise for the user (though keep in mind the list-processing capabilities available in MARC Review and MARC Global: http:// | ||
+ | |||
+ | What if we need to specify more than one subfield for the value pattern, and/or need each subfield to be output to its own line? To do this, go to the "Text Export" | ||
+ | |||
+ | For example, to dump out all of the OCLC numbers in an 035 tag (which may appear in either $a or $z) on separate lines, follow these steps: create one pattern for the 001, and a second pattern for the 035, with subfields " | ||
+ | |||
+ | 99023142 | ||
+ | 1602940 | ||
+ | 1608626 | ||
+ | |||
+ | --but when we additionally select "Fixed number of tags" on the same page, the output becomes more amenable to subsequent processing: | ||
+ | |||
+ | 99023142 | ||
+ | 99023142 | ||
+ | 99023142 | ||
+ | 1602940 | ||
+ | 1602940 | ||
+ | 1608626 | ||
+ | 1608626 | ||
+ | 1608626 | ||
+ | |||
+ | Regarding the output itself, the lines are written in the order that they are found in the record (whether they are complete tags, or strings of subfields within the same tag). There is no attempt to sort the results. Also, if either the " | ||
+ | |||
+ | Also note that most of the options on the 'Text Export' | ||
+ | |||
+ | Finally, the option on the main "Text Output" | ||
+ | |||
+ | | ||
+ | DISPLAY '#' | ||
+ | |||
+ | This option, checked by default, has a very limited scope. It applies primarily to control tags (like the 001), that are completely empty, except for a field terminator. | ||
+ | |||
+ | Depending on what options are selected, this scenario might cause an anomaly in MARC Review' | ||
+ | |||
+ | 1680 record(s) in source file. | ||
+ | 1680 tag(s) in 1680 record(s) matched the pattern: | ||
+ | AND 001 | ||
+ | 228 text records output to: | ||
+ | D: | ||
+ | | ||
+ | So, the anomaly is: 1680 records contained the 001 and matched the (simple) pattern, but only 228 items were output? Upon investigation one discovers that the 001 in the other 1452 records was completely empty except for the field terminator, which is a non-printing character. | ||
+ | |||
+ | Hence this option, which will force something to be output in the scenario in question. | ||
+ | |||
+ | A NOTE ABOUT EXPORTING MARC DATA | ||
+ | |||
+ | Please note that the nature of MARC is such that importing a MARC record into a ' | ||
+ | |||
+ | Best results will be achieved when exporting fields that are almost always present in each record and do not repeat. For example, exporting MARC records into a ' | ||