Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
23 changes: 11 additions & 12 deletions episodes/03-working-with-MARC-files.md
Original file line number Diff line number Diff line change
Expand Up @@ -64,19 +64,19 @@ MarcEdit recognizes the following MARC file types:
</td>
<td>mrk8
</td>
<td>Legacy file format for MARC mnemonic files saved with UTF8 encoding.
<td>Legacy file format for MARC mnemonic files saved with UTF-8 encoding.
</td>
</tr>
<tr>
<td>MARCXML file
</td>
<td>xml
</td>
<td>A MARC file expressed in the eXtensible Markup format or a text-based format for representing structured information.
<td>A MARC file expressed in the eXtensible Markup Language format or a text-based format for representing structured information.
</td>
</tr>
</table>
To work with a MARC file in the MARCEditor your file needs to be in MARC mnemonic format. If you only have a binary file, then that file needs to be converted to the mnemonic format.
To work with a MARC file in the MARCEditor your file needs to be in MARC mnemonic format (.mrk). If you only have a binary file (.mrc), then that file needs to be converted to the mnemonic format.

::::::::::::::::::::::::::::::::::::::::::::::::::

Expand All @@ -93,13 +93,13 @@ To work with MARC data files or convert between metadata formats for library bib
- JSON to XML: This converts a JSON file to XML.
- XML to JSON: This converts an XML file to JSON.

The conversions from one encoding standard to another, as in MARC21 to MARC21XML, rely on eXtensible stylesheets. MarcEdit comes with several default stylesheets which come from those maintained by the Library of Congress. If you are familiar with stylesheets, you can also create your own.
The conversions from one encoding standard to another, as in MARC21 to MARC21XML, rely on extensible stylesheets. MarcEdit comes with several default stylesheets which come from those maintained by the Library of Congress. If you are familiar with stylesheets, you can also create your own.

::::::::::::::::::::::::::::::::::::::::: callout

### Character Encoding

To ensure the integrity of your data you need to select the correct character encoding for your dataset. MarcEdit does not automatically detect character encoding, however, UTF8 is set as the default encoding scheme. You can update the encoding scheme when using the MarcBreaker, or you can update the default in Preferences → MarEditor → Default Encoding. For more information on character encoding and translating from one encoding to another, see [The MarcEdit Field Guide](https://marcedit.reeset.net/learning_marcedit/9-2/dealing-with-character-encodings-in-marcedit/)
To ensure the integrity of your data you need to select the correct character encoding for your dataset. MarcEdit does not automatically detect character encoding, however, UTF-8 is set as the default encoding scheme. You can update the encoding scheme when using the MarcBreaker, or you can update the default in Preferences → MarcEditor → Default Encoding. For more information on character encoding and translating from one encoding to another, see [The MarcEdit Field Guide](https://marcedit.reeset.net/learning_marcedit/9-2/dealing-with-character-encodings-in-marcedit/).


::::::::::::::::::::::::::::::::::::::::::::::::::
Expand All @@ -114,11 +114,11 @@ To transform our binary MARC file (.mrc) into MARC mnemonic human readable forma

1. Launch MarcEdit and from the main window click on the MARC Tools Icon.
2. In the MARC Tools window, Select Operation → MARCBreaker.
3. In the field, Select Data to Process, click the file folder image to the right of the Open box to browse for the sample MARC data file (.mrc). Double click the found file to select it.
3. In the field, Select Data to Process, click the file folder image to the right of the Open box to browse for the sample MARC data file (marc_sample_data.mrc). Double click the found file to select it.
4. Next, you will need save your file in the MARC mnemonic format (.mrk) by clicking the file folder to the right of the Save As box. Select the location and name you would like to give your new file.
5. Under Character encoding select UTF8 as default character encoding.
6. Click execute.
7. Once you click execute the newly created .mrk file will be available to open in the MarcEditor. Under Results at the bottom of the window you will see a count of the records in your file. Click Edit Records to open the .mrk file in the MarcEditor.
7. Once you click execute the newly created .mrk file will be available to open in the MarcEditor. Under Results at the bottom of the window you will see a count of the records in your file (536). Click Edit Records to open the .mrk file in the MarcEditor.

**Note:** When you break a .mrc file and create a new .mrk file for editing in the MarcEditor, you are making a copy of your data in a new file format. As a result, any edits you make to the .mrk file in the MarcEditor will not automatically be reflected in the original .mrc file. We will cover saving and compiling (using the MarcMaker) to create an updated .mrc file in an upcoming lesson.

Expand Down Expand Up @@ -161,7 +161,7 @@ Records in the MarcEditor display are separated by a blank line.

### MARC syntax in the MarcEditor

Understanding the layout of MARC data in the MarcEditor is key to using the program's tools successfully. For instance, in some tools it is important to specify a field's indicators directly preceeding the first subfield. Throughout this workshop we will highlight how different tools rely on this syntax to apply edits.
Understanding the layout of MARC data in the MarcEditor is key to using the program's tools successfully. For instance, in some tools it is important to specify a field's indicators directly before the first subfield. Throughout this workshop we will highlight how different tools rely on this syntax to apply edits.


::::::::::::::::::::::::::::::::::::::::::::::::::
Expand All @@ -172,7 +172,7 @@ The MarcEditor divides a file of MARC records into 'pages' of 100 records. You c

You can adjust the number of records displayed per 'page' through the MarcEditor preferences which can be accessed through the Edit → Preferences menu option from the MarcEditor, or through the 'Settings' icon on the opening screen of MarcEdit.

Within the MarcEditor preferences, you can adjust the font and font size within the MarcEditor. You can also set your character encoding defaults. If you navigate to File Associations within the Preferences window, you can select Associate (`*.mrc`) files with the MarcBreaker and Associate (`*.mrk`) files with the MarcEditor. Setting these file associations will make it easy to break `.mrc` files and edit `.mrk` files.
Within the MarcEditor Preferences, you can adjust the font and font size used in the MarcEditor. You can also set your character encoding defaults. If you navigate to File Associations within the Preferences window, you can select Associate (`*.mrc`) files with the MarcBreaker and Associate (`*.mrk`) files with the MarcEditor. Setting these file associations will make it easy to break `.mrc` files and edit `.mrk` files.

If you change your preferences for the MarcEditor, the tool used to work with MARC data, you can always go back to the default settings.

Expand Down Expand Up @@ -207,9 +207,8 @@ Note: Some preference updates will force MarcEdit to restart and others will not
:::::::::::::::::::::::::::::::::::::::: keypoints

- MarcEdit can work with a variety of file formats
- The MARC Tools Icon allow you to convert data from one file format to another
- The MarcEditor works with a MarcEdit specific mnemonic format of MARC records (.mrk)
- It is necessary to break a MARC binary file to work with that MARC data in the MarcEditor. The extension of these easily readable MARC files are .mrk rather than the binary extension of .mrc
- The MARC Tools Icon allows you to convert data from one file format to another
- The MarcEditor works with an easy-to-read MarcEdit specific mnemonic format of MARC records (.mrk). If starting with a MARC binary file (.mrc), you must use the MarcBreaker to convert the binary file to a mnemonic file.
- Understanding the layout (syntax) of MARC records in the MarcEditor is key to working with the records.

::::::::::::::::::::::::::::::::::::::::::::::::::
Expand Down
3 changes: 3 additions & 0 deletions learners/reference.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,6 +7,9 @@ title: 'Glossary'
[Dublin Core]{#dc}
: A common metadata standard used for describing a wide range of resources across domains. It includes 15 essential elements like title, creator, subject, and date.

[Character Encoding]{#ce}
: A standardized system of numerical codes used to represent text characters—including letters, numbers, punctuation, and diacritics—within a MARC record. It functions as the "translation key" that allows a computer system to interpret binary data into human-readable text. The two common encoding schemes are 1. MARC-8: The legacy 8-bit encoding standard used to represent Latin, Greek, Arabic, Hebrew, and East Asian characters via escape sequences. 2. UTF-8 (Unicode): The modern, universal standard that assigns a unique code point to every character in every language, facilitating global data exchange.

[FAST (Faceted Application of Subject Terminology)]{#fast}
: A simplified subject heading system derived from the Library of Congress Subject Headings (LCSH), designed for faster and more consistent application.

Expand Down
Loading