Welcome to our R Programming Course specifically designed for biologists, including master and PhD students. This course aims to equip participants with basic R programming skills and introduce them to statistical analysis techniques applicable in molecular biology.
Participants will learn through a mix of lectures, hands-on exercises, and interactive discussions. By the end of the course, you should be able to perform data manipulation, create visualizations, and conduct statistical analyses using R
Participants are expected to:
1- Have the following installed on their computers before the course begins:
You can find the installation guides once you click on them:
2- install the following R-libraries:
To install these packages, you can use the following R command:
install.packages(c("knitr", "markdown", "dplyr", "ggplot2", "pheatmap", "dendextend"))in the R Console.
3- Have a GitHub account.
All participants must install R, RStudio, and Git before the start of the course. These tools are essential for participating in the course exercises and for following along with the instructions.
If you encounter any issues during the installation process, please:
- Refer to the FAQs and troubleshooting guides provided on the respective software download pages.
- Post your issue on the GitHub issues section of this repository. Please provide as much detail as possible about the problem you're experiencing.
- Contact us directly via email, and we'll do our best to assist you.
We strongly recommend that you try to familiarize yourself with R and RStudio by following some basic tutorials or trying out simple exercises. This will help you hit the ground running when the course starts.
Day 1:
Introduction :: Background and History
Session 1 :: Even More Basic Concepts in R
- R Function
- R Packages
- Package repositories, package ecosystems
Session 2 :: Data Wrangling in R
- dplyr
Session 3 :: Data Visualization
- ggplot2
Day 2:
Session 5 :: Inferential Statistics I
- Chi-square Test
- Fisher’s Exact Test
Session 6 :: Inferential Statistics II
- Anova
- t-test
- Linear Regression
Session 7 :: PCA & Hierarchical Clustering
- PCA
- Hierarchical Clustering
- small project with a report
To get started with the course, follow the link below
https://cecadbioinformaticscorefacility.github.io/Intermediate_R_Course_2025/
To clone this repository using Git:
git clone https://github.com/CECADBioinformaticsCoreFacility/Intermediate_R_Course_2025.git
Navigate into the cloned directory to access all course materials, datasets, and exercises.
For further learning and exploration of R, we recommend the following resources:
We welcome contributions to improve the course materials. Please feel free to fork the repository, make changes, and submit a pull request.
For any queries regarding the course, please reach out to us at cecad-bifacility-course@uni-koeln.de
We would like to thank all contributors and participants for making this course possible. Special thanks to the R community for the comprehensive resources and support.