DISL: ISO Metadata Generator Index
Welcome to DISL Data Management's ISO Metadata Generator!
Step One: Organize Your Data
Before you can describe your dataset, first you must organize it in a clear, concise manner that will be easy to read and understand by scientists unfamiliar with your work. How would you organize it if someone in Alaska needed a presentable copy of your data? Will they understand what everything means without having to ask you any questions? Will it still make sense 30+ years from now?
If your dataset is in a Microsoft Excel workbook:
- Be ready to spell out and define all abbreviations and shorthands.
- Know what timezone you recorded time in, and whether/how you accounted for Daylight Savings.
- Know whether latitude/longitude was recorded in decimal degrees, or degree minutes, or something else.
- Know the units for every number. Was it feet or meter, etc.
- Know the scientific names for the species (or other taxa) of living things in your study.
- If you are using codes from a list hosted at NOAA or other organization, know where this list is located so that others can look it up.
- Be ready to explain in detail (or cite) complex methodologies, instrumentation, equipment, etc. Keep in mind that "standard" methods might change a lot over time.
- Know the full citations of anything you will reference, including author name(s), date of publication, title, journal or publisher, page numbers, DOIs if available, etc.
If your dataset is imagery:
- Fully explain any color coding that you use. You can do this in the feature catalogue (preferred!), within the same spreadsheet (also preferred!), and/or in a separate spreadsheet.
- Scroll each spreadsheet to the top so that cell A1 is visible. Place your cursor on A1.
- Go to the sheet that you would want people to see first when they open the file.
- Save the final version of the file.
If your dataset consists of multiple files:
- Make certain you at least have latitude/longitude for the geographic extent of the study area (northernmost, southernmost, easternmost, and westernmost points).
- For raster data, include a geoTIFF at full resolution.
- For vector data, include shape files.
- It is highly recommended to also include a PDF showing the original layer in the geodatabase.
- Give each file a name that makes clear what it contains.
- If there are more than three files, add a small text file (readme.txt) that lists all of the other files and explains what they are.
Step Two: Feature Catalogue
Now that your dataset is organized, you are ready to describe it in a feature catalogue. A feature catalogue is like the Entity/Attributes section of an FGDC metadata record, but in ISO 19139 it is a separate document (called ISO 19110) that is referenced in the main metadata document (ISO 19115-2). FGDC "entities" are called "feature types" in ISO.
Feature catalogues can be referenced by multiple metadata records with datasets that all use the same organization scheme or system - i.e. the same measured parameters in the same order with the same units, as a catalogue of all possible attributes that might appear in various specific datasets.
Example. Suppose that your dataset is in a Microsoft Excel workbook with three spreadsheets. The first two spreadsheets each have three columns of data: size, weight, and color. The third spreadsheet has: latitude, longitude, date, time, sampling depth, season.
If you have multiple MS Excel workbooks that are all organized the same way, such as for separate but similar projects over many years, you could cite the same feature catalogue for all of them.
- You would describe the MS Excel workbook in one feature catalogue.
- Each spreadsheet could be a separate feature type. However, because the first two spreadsheets contain the same type of data organized the same way, you only need to describe them once as one feature type.
- Each column of data in a spreadsheet is an attribute.
- For the attribute "color", where data are categorical, each possible color that appears in the dataset is a value. If the possible colors are red, blue, green, yellow, orange, pink, and purple, you have seven values. Likewise, "season" with possible seasons of spring, summer, fall, winter, is four values.
The feature catalogue generator creates a customized form for you based on how many different kinds of spreadsheets you have (feature types), how many columns of data are in each spreadsheet (attributes), and how many different items are in each categorical column (values).
IMPORTANT NOTE: This auto-generator does not save any of your work. We strongly recommend that you test it with fake information to find out how it works first, before filling in real information.
Step Three: The Main Metadata Document for the Dataset
Once you have a feature catalogue for your dataset, you are ready to generate the main metadata document.
Please follow these DISL-specific guidelines for ISO metadata.
IMPORTANT NOTE: This auto-generator does not save any of your work either.
Step Four: Revise and Finalize
Do you need to make edits to the auto-generated metadata, for example to add additional information not provided by the forms?
This color-coded template in Google Drive shows the correct order of the underlying code for many possible types of information you may wish to add.
XML Notepad is an xml-specific text editor that can be downloaded for free. It will allow you to more conveniently view the content of your metadata records without having to see all of the code.
For a full preview of how the metadata record will appear in DISL's metadata archive (the XSL Output tab in the above editor), you can also download the stylesheets we use. The second line of the main metadata record (set to <?xml-stylesheet type="text/xsl" href="../ISO/style/xml-to-html-ISO.xsl"?> by the auto-generator) can be temporarily edited to point to the location of the unzipped stylesheets on your computer.
Step Five: Validate and Turn It In
If you made edits, would you like to validate your ISO code? This is relatively fast and painless to do with NOAA NGDC's Record Services. Use Well Formed XML first, then ISO Validate. Note: this will only work for the main metadata record, not the feature catalogue.
Finally, send the metadata, data, maps, and other documentation to the DISL Data Management Center. We will check everything and, per DISL policy, add it to our archive.
If you'd like to make your full dataset publicly available in NOAA NCEI's Ocean Archive System, you can submit it through Send2NCEI.
Resources for learning more about ISO metadata:
NOAA NCEI Metadata Training - The NOAA NCEI Metadata Team offers webinar training courses on a regular basis.
NOAA NCEI Workbooks in PDF - MD for the basics, BIO for additional biological information, MI for more detailed spatial information.
NOAA EDM Wiki - the same information as the workbooks, but presented in a wiki.
Resources for writing and editing ISO metadata:
NOAA NCEI ATRAC System - a dataset submission system for NCEI that includes an ISO metadata editor. Free to register and use just the editor. Includes Data Quality/Lineage section. Does not include Feature Catalogue.
GoMRI GRIIDC - an ISO metadata editor good for quick, minimal metadata records. Does not include Data Quality/Lineage or Feature Catalogue.
GeoNetwork - a dataset catalog application that includes an excellent metadata writing tool. It's intended to be server software, but can be downloaded and installed onto individual PCs for just metadata editing.
Please direct any questions, comments, or other feedback to Mimi Tzeng.