GSU Library Research Guides: Data Management &amp; Sharing: Organizing your Data:Metadata, File Naming, and Data Cleaning

FAIR Principles and C-U-R-A-T-E Steps

When preparing your data for your own use and for reuse by future researchers:

Use the FAIR Principles created by the Go FAIR Initiative to approach making your data Findable, Accessible, Interoperable, and Reusable.

Also consult the C-U-R-A-T-E Steps created by the Data Curation Network to guide preparing your data for curation.

Metadata Standards

Never heard of metadata and wondering what it's all about? 'Metadata' is added information about your data (e.g., codebooks, data documentation) beyond the raw data files that will help you, your research team, and other researchers readily access, understand, and use the data - learn even more about metadata here.

Grant funding agencies and data repositories recommend using established metadata standards whenever possible. Some metadata schema and standards are discipline-specific, such as Darwin Core (biology) or DDI (social and behavioral sciences), while others are designed for a particular type of resource or may cover any discipline.

A metadata schema is a set of metadata elements with the name and meaning of each element specifically defined. A schema may also define rules for content, allowable data values, syntax, and/or other rules for recording and encoding data.

Metadata Standards by Discipline from the Digital Curation Centre
A Qualitative Data Model for DDI from the the DDI Qualitative Data Working Group
Metadata Schema list maintained by the ASERL/SURA Research Data Management Group

Image CC Attribution 2.0 Generic from commons.wikimedia.org

File Naming - Best Practices

Following standardized and consistent best practices for naming your files will help both you and your research team readily access your data while still involved in the project. It similarly will help others if/when you share your data for replication/transparency purposes or reuse by other researchers. Check out Stanford Libraries recommended best practices for file naming.

Image public domain from wpclipart.com

Data Cleaning Tools

Raw data is often messy and needs cleaned up before analysis can be performed -- linked below are video tutorials about some data cleaning tools and also links to freely download them.

Image adapted from public domain images at wpclipart.com

How to Clean Up Raw Data in Excel (video tutorial)
Excel is free for all GSU affiliates - download from http://gsu.onthehub.com/
Introduction to OpenRefine (video tutorial)
Free to download from http://openrefine.org/
Reshaping data in DataWrangler (video tutorial)
Free to download from http://vis.stanford.edu/wrangler/
How to Use SAS - Lesson 5 - Data Reduction and Data Cleaning (video tutorial)
SAS is free for all GSU affiliates - download from http://gsu.onthehub.com/

Research Guides

Data Management & Sharing: Organizing your Data:
Metadata, File Naming, and Data Cleaning

FAIR Principles and C-U-R-A-T-E Steps

Metadata Standards

File Naming - Best Practices

Data Cleaning Tools

Contact Us

University Library

Research

Services & Spaces

Special Collections

Giving to the Library

About

Research Guides

Data Management & Sharing: Organizing your Data:Metadata, File Naming, and Data Cleaning

FAIR Principles and C-U-R-A-T-E Steps

Metadata Standards

File Naming - Best Practices

Data Cleaning Tools

Contact Us

Social

University Library

Research

Services & Spaces

Special Collections

Giving to the Library

About

Data Management & Sharing: Organizing your Data:
Metadata, File Naming, and Data Cleaning