Step 3: Processing and Analyzing Data

Introduction

For effective data analysis in research, the following aspects should be taken into account:

  1. Data Checking, Validation, and Cleaning: Careful quality assurance is considered fundamental. Data should be checked and validated to identify and correct errors. Consistent cleaning is seen as ensuring that analyses are based on accurate and reliable data.
  2. Use of Discipline-Specific Standards: In the data analysis phase, it is important to adhere to recognized and discipline-specific standards and methods of the field and to document them precisely (see excursus: Data documentation), particularly with regard to evaluation methods, controlled vocabularies, and file formats. Compliance with the principles of good scientific practice (GSP) is essential, especially guideline 11 [1], which emphasizes the application of scientifically sound and transparent methods, as well as quality assurance and the establishment of standards.
  3. Preparation of Data for Scientific Publication: Data should be carefully prepared for scientific publication. This includes, for example, the anonymization of personal data and ensuring that all relevant information is understandable to the readership.
  4. Documentation of Data Analysis: Complete documentation of all steps in the data analysis process is essential for understanding and reproducing the research (see excursus: Data documentation). Researchers must record all relevant information leading to a research result in a manner that is traceable according to subject-specific requirements, so that verification and evaluation of the results are possible [2].

Legal Aspects & Ethics

In the phase of data processing and analysis, careful consideration of legal aspects is required, especially those laid out in the General Data Protection Regulation (GDPR). A central issue concerns the processing of personal data (see excursus: legal aspects in RDM). The principle applies that personal data may only be processed if effective anonymization is possible and if a legal basis pursuant to Art. 6(1) GDPR [3] exists, either through the consent of the data subjects or a legal permission.
When analyzing data with special requirements, such as dual use, protected species, animal testing, or medical data, the various legal and ethical aspects must be considered. Furthermore, it should be critically examined whether the research objectives could also be achieved with a smaller amount of data or through the use of anonymized or pseudonymized data, to ensure compliance with the data minimization principles of data protection law.

Further Information

Digital research data management (RDM) assistant for engineering research (currently under development): Javres
Tool for anonymizing qualitative research data: QualiAnon

Tool for anonymizing qualitative research data: QualiAnon: QualiAnon

Tool für statistical analytics: R

R is a free software environment for statistical computing and graphics: https://www.r-project.org

Tool for cleaning and transforming data: OpenRefine

https://openrefine.org

A highly comprehensive introduction to the tool is offered by SODa.

Tool for data anonymization across all disciplines: Amnesia