The following suggestions will help you to organise your data:
Use folders - When organising your data, consider using folders to group related files in one location. The number of files or folders per group may vary depending on the nature of your data.
Apply meaningful folder names - Ensure that you use clear and appropriate folder names that relate to the area of work or study rather than the individual responsible. This will avoid confusion if group members leave and is easier for new researchers to use.
Structure folders hierarchically - Design a folder structure with broad topics at the highest level and specific folders within these. However, try to avoid nesting folders too deeply as this may cause problems with path lengths.
Separate current and completed work - you may find it helpful to move temporary drafts or completed work into separate folders. This will also make it easier to review what you need to keep as you go along.
Control access at the highest level - it is easier to set access permissions near the top of your folder structure rather than trying to control permissions for deeply nested folders. This is particularly important if you need to grant someone access to only a subset of your data, in which case you could move these data to a new, higher-level folder.
Naming conventions are rules that allow electronic and physical records to be named in a consistent and logical way.
Use of consistent and meaningful names will enable you to identify and distinguish between similar records, making data retrieval easier.
If you create large numbers of data files that would be difficult to name individually, apply your naming convention at the folder level instead.
When you agree your naming convention, consider the following suggestions:
Files in a folder are usually shown sorted by name. You can take advantage of this to have your files appear in a consistent order.
Filenames
starting with special characters such as @
will appear first, followed by numbers, then the letters A to Z
For example, you might use this to arrange your files as follows:
2012-03-07_Subject-A_Audio.mp3
2012-03-07_Subject-A_Transcript-raw.docx
2012-03-07_Subject-A_Transcript-anonymised.docx
2012-04-22_Subject-B_Audio.mp3
2012-04-22_Subject-B_Transcript-raw.docx
As you work with your data it is important to distinguish between different versions or drafts of your files. Version control can help you to easily identify the current version of your data so that you avoid working on older or outdated copies. If you are working with others it can also help to link versions of the data to the time and author of the change.
There are a number of ways that different versions of data can be managed:
File naming - a simple method of version control is to create a duplicate copy and then update version information to create a unique file or folder name.
Version control tables - these are included within documents and can capture more information than using file naming conventions. Version control tables typically include the new version number, date of the change, person who made the change and the nature or purpose of the change.
Version control systems - there are many automated systems available that can store a repository of files and monitor access to them, logging who made what change and when. Version control systems are particularly useful for collaborative development of code or software. GitHub is a useful place to share, manage and review code.
Please see below a number of useful pages on our Intranet to help you further. Please note, you will need to log in to access the Intranet.
Further guidance on organising data:
We acknowledge the work of the UK Data Service, the University of Glasgow, the University of Leicester, the University of Southampton and the University of Bath in the development of this guidance.