Thank you to Virginia Wilson at the University of Saskatchewan Library for allowing me to copy her excellent Research Data Management guide.
I have changed some of the information to reflect KPU's situation, but most almost all credit goes to Virginia.(Any mistakes are likely mine, though!)
Your data are critical to your research.
Do you have a plan in place to store the information generated during your research project?
Think about what would happen if you were to lose some or all of your data:
Proper storage of research data pays dividends throughout your project.
(from Scott Summers' presentation on data security and storage, UK Data Service, 2016)
Research data storage can be broken down into 3 phases:
Things to consider when weighing your storage options:
Storage Medium | Examples | Pros | Cons |
---|---|---|---|
Institutional cloud storage | KPU OneDrive, Teams, or Sharepoint (R: drive) |
|
|
Institutional network drive |
(K:) drive |
|
|
KPU-issued desktop or laptop computers |
|
|
|
Personal desktop or laptop computers |
|
|
|
Commercial cloud storage |
Google Drive, AWS, Dropbox |
|
|
External storage media | External hard drives, flash drives, and optical discs |
|
|
Note: In some cases, well-secured and encrypted local drives that are not connected to a network, and are backed up rigorously, are appropriate for storing very sensitive data.
Protecting your research data "requires paying attention to physical security, network security and security of computer systems and files to prevent unauthorized access or unwanted changes to data, disclosure or destruction of data."[1] The more sensitive the data, the more stringent security measures need to be.
The Government of Canada is concerned about risks of theft and espionage related to federally-funded research data and has provided some guidance on its Safeguarding your Research website.
Here are a few aspects to consider:
The UK Data Service has detailed information about encryption techniques and tools, and has tutorial videos demonstrating the most commonly used encryption software.
Sources:
[1] Louise Corti, et al., 2014. Managing and Sharing Research Data: A Guide to Good Practice, Los Angeles: Sage.
Backing up your data
Backing up data refers to making copies of files frequently, usually for short-term storage during a project's active phase; or for long-term storage during its static phase. Data files can be lost due to hardware or software failure, they can be accidentally altered or deleted, or they can become corrupted, rendering them unreadable or error-ridden. To avoid problems resulting from data loss, researchers should ensure that their data is properly backed up. A well thought out backup strategy should be an integral part of the overall data management plan. (adapted from UK Data Service)
Why back up your data?
Things to consider when backing up your data. If you're using networked storage, discuss your requirements with the administrators. Some of the questions you need to ask are:
Incremental or partial back ups copy only the files or data that have changed since the previous back up and are performed on a regular basis.
Complete back-ups, on the other hand, duplicate your project's entire data collection.
Be sure to have multiple copies of backup and archive files, in several locations, in case of software or hardware failure, theft or tampering, or natural disasters.
Other considerations:
Protecting non-digital or textual data: ideally all non-digital data should be digitized. Items that cannot be digitized need to be managed in a way that keeps them secure and permits access on request.
File formats: use open or standardized formats rather than proprietary formats for both short-term and long-term storage of data.
Organization: establish and adhere to a protocol for naming and organizing back up copies to ensure that files are easy to locate and identify.
Tools: Mac OSX has a built-in back up utility called Time Machine; Windows has built-in back up and restore utilities; Linux users have access to a variety of backup and restore utilities.
There are also many third-party backup utilities available for all platforms, some open-source, some commercial.
1. As a part of your overall data management plan, design a detailed data storage, security and back up policy for your project, and review it from time to time during the project's active phase.
2. Adhere to the 3-2-1 principle:
3. Back up data files regularly. Check backed up files manually and verify them (using checksums, etc.) to ensure the integrity of the data.
4. Use portable media -- USB drives, portable hard drives, CDs or DVDs -- only for working copies of research data, not for master copies and never for sensitive or personal data. Encrypt these devices to protect the contents in the event of loss or theft.
5. Ensure data integrity by refreshing storage media. Magnetic and optical storage media can degrade with time.
6. Employ open or standard file formats for data storage to ensure that files will be readable in the future.
7. Create meaningful file names (including version information) to aid in organizing and locating files and folders.
A good reference for all things RDM is Managing and Sharing Research Data: A Guide to Good Practice by Louise Corti, et al. (2nd ed., Sage, 2020).