What is Metadata?
Metadata is “a set of data that describes and gives information about other data.”
Or
“Information about information”
Types of Metadata
Descriptive metadata describes a resource for purposes such as discovery and identification. It can include elements such as title, abstract, author, and keywords.
Structural metadata indicates how compound objects are put together, for example, how pages are ordered to form chapters.
Administrative metadata is an umbrella term for the metadata that provides information to help manage a resource, such as when and how it was created, file type and other technical information, and who can access it.
- Technical metadata - For decoding and rendering files
- Preservation metadata - Long-term management of files
- Rights metadata - Intellectual property rights attached to content
Where to start with metadata management?
The first step to managing your metadata is figuring out what type of metadata you have, where it’s going, and how to get it there.
What is Metadata Management?
“Metadata management is the administration of data that describes other data.” – Margaret Rouse
Establishing policies and processes that ensure information can be:
- integrated
- accessed
- shared
- linked
- analyzed
- maintained
Where do we start?
- What kind of metadata are we dealing with?
- What file types are we working with?
- Where is it going?
Descriptive Metadata Scenario
Your colleague used MarcEdit to modify a collection of MARC records, and now you need to import the changes back into your catalog. The file delivered is a .mrk (mnemonic MARC file), but you know your catalog is expecting a .mrc (binary MARC File).
- What kind of metadata are we dealing with?
- What file types are we working with?
- Where is it going?
Converting metadata using MarcEdit
Technical Metadata Scenario 1
You have an Excel spreadsheet (.xsl) that you have downloaded from your institutional repository containing metadata for electronic theses and dissertations (ETDs). You decided you want to add file type and file size to the metadata, so that researchers accessing the ETDs have an idea what they are about to download.
- What kind of metadata are we dealing with?
- What file types are we working with?
- Where is it going?
Extracting Metadata using the Command Line
Technical Metadata Scenario 2
You have xml records that you have downloaded from your digital repository containing metadata for pdf files of electronic theses and dissertations (ETDs). You decided you want to see if you can add the page counts, software used to create the files, and file type for the individual pdfs.
- What kind of metadata are we dealing with?
- What file types are we working with?
- Where is it going?
Extracting Metadata using the FITS
Administrative Metadata Scenario
You have received a file with image files that you are planning to accession into your digital archive (which ingests XML). Before you make the image files available to the public, you want to record technical, preservation, and rights metadata about the files. You do not feel comfortable using command line.
- What kind of metadata are we dealing with?
- What file types are we working with?
- Where is it going?
Extracting Metadata using the Data Accessioner
Programs mentioned by order of appearance:
- MarcEdit
- Command Line - Built into your computer
- Notepad++
- Sublime Text Editor
- FITS (File Information Tool Set)
- DataAccessioner
- DataAccessioner Metadata Transformer