Skip to Main Content
It looks like you're using Internet Explorer 11 or older. This website works best with modern browsers such as the latest versions of Chrome, Firefox, Safari, and Edge. If you continue with this browser, you may see unexpected results.
Managing your Project
Properly managing data and other research outputs starts at the beginning of the project and continues throughout. Below are tips for managing the project during three phases.
Before the Project
Plan for the project
Generate a directory for the project.
Design the structure of the directory.
An example of a directory structure could be:
Analysis (or Code)/
In designing the structure of the project directory consider the following:
Put code and data in separate subdirectories.
Plan to separate raw data from processed data.
Always apply the the 5 Cs: be Clear, Concise, Consistent, Correct, and Conformant.
Apply good practices for file naming.
Pick a scheme for organizing data.
Add ReadME file to the directory. The ReadME file should include:
Description of the project and information about the funders and collaborators (if there is any).
Goal statement of the project.
The projected input.
The expected output.
Expected environmental computations.
During the Project
On daily/weekly basis, document the steps you have accomplished.
If you have done analysis or collect dataset, try to document the workflow to do this and include the scripts you have generated for featuring the data.
For each script that does analysis, document it in another file and write what it does, the input, and the output of the script.
file formats that will ensure long-term access (e.g., .txt). Use
coding convention (e.g., module naming, comments). Use relative paths in the code (e.g., ../rawdata/example_file.csv).
Use tools that help you to automate your analysis, such as R.
Use tools that help in documenting the workflow and managing the data:
Tools that capture the experimental environment, such as
Docker, ReproZIP, and CDE. Tools that capture the sequence of the computations such as
Jupyter Notebook. Document each major step from the analysis.
Backup your data files every while.
GNU Make to automate the research process. For writing your manuscript use latex and Bibtex.
Use version control system such as Git.
Example for README Template
After the Project
Directly after submitting a scientific paper, document the workflow of the analysis in the paper, from fetching to preprocess the data, until you reach how the graphs and the tables in the paper were generated (i.e., write the steps that allow anyone to repeat the analysis):
List all the steps of the analysis in the paper.
Write a complete description of each algorithm/technique and the source code of these algorithms.
Write clearly the output and the input of each step.
Deposit your data in a repository for long-term preservation.
Institutional Repositories: Dash or Merrit.
Non-institutional repositories: Figshare.
Get your Data and your project DOIs.
Share your project on
Open Science Framework. If you have a sensitive data, check your institution policy to before sharing.
License your software. Develop a clear, accurate, and precise
user document ation for the project.