Statistics:

Multivariate Statistics

Master essential unsupervised learning methods for dimension reduction and clustering. This course blends theory with practical exercises in R, helping you apply techniques like PCA, k-means, and advanced clustering methods to your research data.

This course is offered by HIDA

Data Science Training at HIDA

HIDA provides diverse continuing training programs in Information and Data Science, drawing from the entire Helmholtz Association.

Through specialized data science courses, AI training for administration and management, as well as lectures and events, HIDA enhances professional expertise and fosters interdisciplinary exchange.

This course introduces essential unsupervised learning methods for exploring and uncovering hidden structures in complex, high-dimensional datasets. You will learn the theory and application of dimension reduction with Principal Component Analysis (PCA) and advanced tools like t-SNE and UMAP. Additionally pattern discovery with a range of clustering algorithms is introduced. The focus is on practical implementation in R, equipping you to translate complex data into meaningful insights.

Topics

The course covers two main areas of unsupervised analysis:

  • Dimension Reduction:
    • Core principles of PCA for simplifying complex data.
    • How to select and interpret principal components.
    • Advanced tools for dimension reduction like t-SNE and UMAP and their usage.
  • Cluster Analysis for Pattern Discovery:
    • Understanding and choosing appropriate dissimilarity measures.
    • Applying core methods like k-means and hierarchical clustering.
    • Advanced clustering techniques including the Louvain method.

Methods

Each module introduces a statistical concept, followed immediately by practical exercises with best-practice solutions. We use R for the practical exercises.

Learning Goals

At the end of this course, you will be able to:

  • apply PCA to reduce data dimensionality and interpret the results.
  • select and implement appropriate clustering algorithms (k-means, hierarchical).
  • identify and characterize distinct groups within your data.
  • use advanced methods like Louvain for more complex data.
  • understand and interpret the outputs of the different approaches.

Course Dates

Register now: May 12-13, 2026 

For more information on how to register, please follow the link on the course date.

Prerequisites

Programming skills with R (as taught in the course “Introduction to R”) and basic knowledge of statistics (as taught in the course “Introduction to Statistics”).

Target group

This course is open to researchers of all career stages, or anyone interested in learning about the subject.

This course is free of charge.

Alternativ-Text

Subscribe newsletter