DMP_template.md 3.2 KB

Responsible people for data management

Please list here everyone who can access the data during the project (one person per line, please add role(s) for each person):

  • Governor: Data governance (co-)lead (person legally responsible)
  • Collector: data collectionr (on behalf of the responsible person(s))
  • Analyst: use of the data, analysis, processing (on behalf of the responsible person(s))
  • Steward XX: consultation and help with access to the data

    Example:  
    - Julien Colomb, Steward  
    

Data flow : list of questions (for each type of experiment):

exp1: TITLE

Data information

  • file naming and organisation conventions,
  • type of version control
  • raw data formats,
  • data analysis software
  • results format,
  • data sharing format,
  • data archive format,
  • metadata content, format and schema,
  • specific keywords,
  • vocabularies, taxonomies and ontologies used,
  • repositories used for publication

    Experimental design information

  • Experimentator blinding, and randomisation strategy:

  • Animal information sheet:

  • Study design (groups, sample type, sample size and rational for these decisions)

  • Inclusion and exclusion criteria

    Example:  
      
    Drosophila learning/memory experiments 
    ====================================== 
      
    File naming convention:  
    ----------------------- 
    YYYY-MM-DD_ShortNameExperiment 
      
    vestion control: 
    ---------------- 
    - no version suffix, version control via GIT in GIN (small text files only) 
      
    Details: 
    -------- 
    - We produce one spreadsheet for each experiment, following a template (link to template) 
    - One folder named YYYY-MM-DD_ShortName_Experiment  (data of start of experiment) contains: 
    - one readme.md file for text notation and explanations 
    - one metadata spreadsheet, indicating genotype and treatment code, age of flies at experiment 
    - The data spreadsheet, in a .tsv format 
    - It may also contain a result figure. 
      
    - Data will be published in this form (.tsv and .md files) at publication, on GIN. 
    - Data analysis is done via R scripts present in the analysis folder, figures are saved (named using the same convention) in the figure folder as `.pdf` files. 
      
    Taxonomies used: 
    - fly lines using flybase code 
    - fly genes as keywords using flybase code 
    
    Experimental design:
    
    - randomisation following table:
    - experiment done blind to genotype (code decripted at the end of the experiment/day)
    - n=15-18 per group per experiment, experiment done on at least 3 batches (3 days)
    - exclusion: if number of animal counted at the end of experiment less than 15.
    
    

other data types

Software and code:

  • Versioning
  • Publication
    • PID
    • dissemination/repository
  • licensing

  • tests

  • containerization

Hardware documentation:

  • Versioning
  • Publication
    • PID
    • dissemination/repository
  • licensing

Reagents (viruses, transgenics, ...):

  • Versioning
  • Publication
    • PID
    • dissemination/repository
  • licensing

Project management documentation, labnotebook (ELN)

  • Versioning
  • Publication
    • PID
    • dissemination/repository
  • licensing