When preparing your data for your own use and for reuse by future researchers:
Never heard of metadata and wondering what it's all about? 'Metadata' is added information about your data (e.g., codebooks, data documentation) beyond the raw data files that will help you, your research team, and other researchers readily access, understand, and use the data - learn even more about metadata here.
Grant funding agencies and data repositories recommend using established metadata standards whenever possible. Some metadata schema and standards are discipline-specific, such as Darwin Core (biology) or DDI (social and behavioral sciences), while others are designed for a particular type of resource or may cover any discipline.
A metadata schema is a set of metadata elements with the name and meaning of each element specifically defined. A schema may also define rules for content, allowable data values, syntax, and/or other rules for recording and encoding data.
Following standardized and consistent best practices for naming your files will help both you and your research team readily access your data while still involved in the project. It similarly will help others if/when you share your data for replication/transparency purposes or reuse by other researchers. Check out Stanford Libraries recommended best practices for file naming and suggested software for batch renaming files.
Raw data is often messy and needs cleaned up before analysis can be performed -- linked below are video tutorials about some data cleaning tools and also links to freely download them.